Update model support in readme (#370)

This commit is contained in:
Ying Sheng
2024-04-17 00:16:32 -07:00
committed by GitHub
parent 2b6d999191
commit ca4f1ab89c

View File

@@ -378,7 +378,12 @@ python -m sglang.launch_server --model-path meta-llama/Llama-2-7b-chat-hf --port
- `python3 -m sglang.launch_server --model-path liuhaotian/llava-v1.6-34b --tokenizer-path liuhaotian/llava-v1.6-34b-tokenizer --port 3000`
- Yi-VL
- see [srt_example_yi_vl.py](examples/quick_start/srt_example_yi_vl.py).
- AWQ/GPTQ quantization
- StableLM
- Command-R
- DBRX
- AWQ/GPTQ/Marlin quantization
Instructions for supporting a new model are [here](https://github.com/sgl-project/sglang/blob/main/docs/model_support.md).
## Benchmark And Performance