Suppport qwen model and solve some problems (#75)
This commit is contained in:
@@ -316,6 +316,7 @@ python -m sglang.launch_server --model-path meta-llama/Llama-2-7b-chat-hf --port
|
||||
- Mixtral
|
||||
- LLaVA
|
||||
- `python3 -m sglang.launch_server --model-path liuhaotian/llava-v1.5-7b --tokenizer-path llava-hf/llava-1.5-7b-hf --port 30000`
|
||||
- Qwen
|
||||
- AWQ quantization
|
||||
|
||||
## Benchmark And Performance
|
||||
|
||||
Reference in New Issue
Block a user