diff --git a/README.md b/README.md index 018d1f008..ecbdbbfde 100644 --- a/README.md +++ b/README.md @@ -378,7 +378,12 @@ python -m sglang.launch_server --model-path meta-llama/Llama-2-7b-chat-hf --port - `python3 -m sglang.launch_server --model-path liuhaotian/llava-v1.6-34b --tokenizer-path liuhaotian/llava-v1.6-34b-tokenizer --port 3000` - Yi-VL - see [srt_example_yi_vl.py](examples/quick_start/srt_example_yi_vl.py). -- AWQ/GPTQ quantization +- StableLM +- Command-R +- DBRX +- AWQ/GPTQ/Marlin quantization + +Instructions for supporting a new model are [here](https://github.com/sgl-project/sglang/blob/main/docs/model_support.md). ## Benchmark And Performance