提交vllm0.11.0开发分支
This commit is contained in:
@@ -4,30 +4,12 @@
|
||||
|
||||
| Model | Support | W8A8 | LoRA | Tensor Parallel | Expert Parallel | Data Parallel | Piecewise Kunlun Graph |
|
||||
| :------------ | :------------ | :--- | :--- | :-------------- | :-------------- | :------------ | :--------------------- |
|
||||
| Qwen2 | ✅ | | ✅ | ✅ | | ✅ | ✅ |
|
||||
| Qwen2.5 | ✅ | | ✅ | ✅ | | ✅ | ✅ |
|
||||
| Qwen3 | ✅ | | ✅ | ✅ | | ✅ | ✅ |
|
||||
| Qwen3-Moe | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ |
|
||||
| Qwen3-Coder | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ |
|
||||
| QwQ-32B | ✅ | | | ✅ | | ✅ | ✅ |
|
||||
| LLama2 | ✅ | | | ✅ | | ✅ | ✅ |
|
||||
| LLama3 | ✅ | | | ✅ | | ✅ | ✅ |
|
||||
| LLama3.1 | ✅ | | | ✅ | | ✅ | ✅ |
|
||||
| GLM-4.5 | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ |
|
||||
| GLM-4.5-Air | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ |
|
||||
| Qwen3-next | 🔜Comming soon | | | | | | |
|
||||
| gpt-oss | 🔜Comming soon | | | | | | |
|
||||
| DeepSeek-V3 | 🔜Comming soon | | | | | | |
|
||||
| DeepSeek-V3.2 | 🔜Comming soon | | | | | | |
|
||||
| Qwen3-Next | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ |
|
||||
|
||||
|
||||
## Multimodal Language Models
|
||||
| Model | Support | W8A8 | LoRA | Tensor Parallel | Expert Parallel | Data Parallel | Piecewise Kunlun Graph |
|
||||
| :----------- | :------------ | :--- | :--- | :-------------- | :-------------- | :------------ | :--------------------- |
|
||||
|Qianfan-VL | ✅ | | | ✅| |✅ |✅|
|
||||
| Qwen2.5VL | ✅ | | | ✅ | | ✅ | ✅ |
|
||||
| InternVL2.5 | ✅ | | | ✅ | | ✅ | ✅ |
|
||||
| InternVL3 | ✅ | | | ✅ | | ✅ | ✅ |
|
||||
| InternVL3.5 | ✅ | | | ✅ | | ✅ | ✅ |
|
||||
| InternS1 | ✅ | | | ✅ | | ✅ | ✅ |
|
||||
| Qwen2.5-Omni | 🔜Comming soon | | | | | | |
|
||||
| Qwen3-VL | 🔜Comming soon | | | | | | |
|
||||
| Qwen3-VL | ✅ | | | ✅ | | ✅ | ✅ |
|
||||
|
||||
Reference in New Issue
Block a user