[Doc] Update user guide for suported models (#2263)

### What this PR does / why we need it?
 Update user guide for suported models 

- vLLM version: v0.10.0
- vLLM main:
4be02a3776

---------

Signed-off-by: hfadzxy <starmoon_zhang@163.com>
This commit is contained in:
zhangxinyuehfad
2025-08-07 14:39:51 +08:00
committed by GitHub
parent 440d28a138
commit 92eebc0c9b

View File

@@ -5,49 +5,75 @@ Get the newest info here: https://github.com/vllm-project/vllm-ascend/issues/160
## Text-only Language Models
### Generative Models
| Model | Supported | Note |
|-------|-----------|------|
| DeepSeek v3 | ✅ | |
| DeepSeek R1 | ✅ | |
| DeepSeek Distill (Qwen/LLama) | ✅ | |
| Qwen3 | ✅ | |
| Qwen3-Moe | ✅ | |
| Qwen2.5 | ✅ | |
| QwQ-32B | ✅ | |
| LLama3.1/3.2 | ✅ | |
| Internlm | ✅ | |
| Baichuan | ✅ | |
| Phi-4-mini | ✅ | |
| MiniCPM | ✅ | |
| MiniCPM3 | ✅ | |
| LLama4 | ✅ | |
| Mistral | | Need test |
| DeepSeek v2.5 | |Need test |
| Gemma-2 | | Need test |
| Mllama | |Need test|
| Gemma-3 | ❌ | [#496](https://github.com/vllm-project/vllm-ascend/issues/496) |
| ChatGLM | ❌ | [#554](https://github.com/vllm-project/vllm-ascend/issues/554) |
| Model | Supported | Note |
|-------------------------------|-----------|----------------------------------------------------------------------|
| DeepSeek v3 | ✅ | |
| DeepSeek R1 | ✅ | |
| DeepSeek Distill (Qwen/LLama) | ✅ | |
| Qwen3 | ✅ | |
| Qwen3-based | ✅ | |
| Qwen3-Coder | ✅ | |
| Qwen3-Moe | ✅ | |
| Qwen2.5 | ✅ | |
| Qwen2 | ✅ | |
| Qwen2-based | ✅ | |
| QwQ-32B | ✅ | |
| LLama2/3/3.1 | ✅ | |
| Internlm | ✅ | [#1962](https://github.com/vllm-project/vllm-ascend/issues/1962) |
| Baichuan | ✅ | |
| Baichuan2 | ✅ | |
| Phi-4-mini | ✅ | |
| MiniCPM | ✅ | |
| MiniCPM3 | ✅ | |
| Ernie4.5 | ✅ | |
| Ernie4.5-Moe | ✅ | |
| Gemma-2 | ✅ | |
| Gemma-3 | ✅ | |
| Phi-3/4 | ✅ | |
| Mistral/Mistral-Instruct | ✅ | |
| GLM-4.5 | ✅ | |
| GLM-4 | ❌ | [#2255](https://github.com/vllm-project/vllm-ascend/issues/2255) |
| GLM-4-0414 | ❌ | [#2258](https://github.com/vllm-project/vllm-ascend/issues/2258) |
| ChatGLM | ❌ | [#554](https://github.com/vllm-project/vllm-ascend/issues/554) |
| DeepSeek v2.5 | 🟡 | Need test |
| Mllama | 🟡 | Need test |
| MiniMax-Text | 🟡 | Need test |
### Pooling Models
| Model | Supported | Note |
|-------|---------|------|
| XLM-RoBERTa-based | ✅ | |
| Molmo | ✅ | |
| Model | Supported | Note |
|-------------------------------|-----------|----------------------------------------------------------------------|
| Qwen3-Embedding | ✅ | |
| Molmo | ✅ | [1942](https://github.com/vllm-project/vllm-ascend/issues/1942) |
| XLM-RoBERTa-based | ❌ | [1960](https://github.com/vllm-project/vllm-ascend/issues/1960) |
## Multimodal Language Models
### Generative Models
| Model | Supported | Note |
|-------|-----------|------|
| Qwen2-VL | ✅ | |
| Qwen2.5-VL | ✅ | |
| LLaVA 1.5 | ✅ | |
| LLaVA 1.6 | ✅ | [#553](https://github.com/vllm-project/vllm-ascend/issues/553) |
| InternVL2 | ✅ | |
| InternVL2.5 | ✅ | |
| Qwen2-Audio | ✅ | |
| LLaVA-Next | | Need test |
| LLaVA-Next-Video | | Need test |
| Phi-3-Vison/Phi-3.5-Vison | | Need test |
| GLM-4v | | Need test |
| Ultravox | | Need test |
| Model | Supported | Note |
|--------------------------------|---------------|----------------------------------------------------------------------|
| Qwen2-VL | ✅ | |
| Qwen2.5-VL | ✅ | |
| Qwen2.5-Omni | ✅ | [1760](https://github.com/vllm-project/vllm-ascend/issues/1760) |
| QVQ | ✅ | |
| LLaVA 1.5/1.6 | ✅ | [1962](https://github.com/vllm-project/vllm-ascend/issues/1962) |
| InternVL2 | ✅ | |
| InternVL2.5 | ✅ | |
| Qwen2-Audio | ✅ | |
| Aria | ✅ | |
| LLaVA-Next | ✅ | |
| LLaVA-Next-Video | ✅ | |
| MiniCPM-V | ✅ | |
| Mistral3 | ✅ | |
| Phi-3-Vison/Phi-3.5-Vison | ✅ | |
| Gemma3 | ✅ | |
| LLama4 | ❌ | [1972](https://github.com/vllm-project/vllm-ascend/issues/1972) |
| LLama3.2 | ❌ | [1972](https://github.com/vllm-project/vllm-ascend/issues/1972) |
| Keye-VL-8B-Preview | ❌ | [1963](https://github.com/vllm-project/vllm-ascend/issues/1963) |
| Florence-2 | ❌ | [2259](https://github.com/vllm-project/vllm-ascend/issues/2259) |
| GLM-4V | ❌ | [2260](https://github.com/vllm-project/vllm-ascend/issues/2260) |
| InternVL2.0/2.5/3.0<br>InternVideo2.5/Mono-InternVL | ❌ | [2064](https://github.com/vllm-project/vllm-ascend/issues/2064) |
| Whisper | ❌ | [2262](https://github.com/vllm-project/vllm-ascend/issues/2262) |
| Ultravox | 🟡 Need test | |