[Doc] Update user guide for suported models (#2263)

### What this PR does / why we need it? Update user guide for suported models - vLLM version: v0.10.0 - vLLM main: 4be02a3776 --------- Signed-off-by: hfadzxy <starmoon_zhang@163.com>
2025-08-07 14:39:51 +08:00
parent 440d28a138
commit 92eebc0c9b
1 changed files with 66 additions and 40 deletions
--- a/docs/source/user_guide/support_matrix/supported_models.md
+++ b/docs/source/user_guide/support_matrix/supported_models.md
@@ -5,49 +5,75 @@ Get the newest info here: https://github.com/vllm-project/vllm-ascend/issues/160
 ## Text-only Language Models
 ### Generative Models
-| Model | Supported | Note |
+
-|-------|-----------|------|
+| Model                         | Supported | Note                                                                 |
-| DeepSeek v3 | ✅ | |
+|-------------------------------|-----------|----------------------------------------------------------------------|
-| DeepSeek R1 | ✅ | |
+| DeepSeek v3                   | ✅        |                                                                      |
-| DeepSeek Distill (Qwen/LLama) | ✅ | |
+| DeepSeek R1                   | ✅        |                                                                      |
-| Qwen3 | ✅ | |
+| DeepSeek Distill (Qwen/LLama) | ✅        |                                                                      |
-| Qwen3-Moe | ✅ | |
+| Qwen3                         | ✅        |                                                                      |
-| Qwen2.5 | ✅ | |
+| Qwen3-based                   | ✅        |                                                                      |
-| QwQ-32B | ✅ | |
+| Qwen3-Coder                   | ✅        |                                                                      |
-| LLama3.1/3.2 | ✅ | |
+| Qwen3-Moe                     | ✅        |                                                                      |
-| Internlm | ✅ | |
+| Qwen2.5                       | ✅        |                                                                      |
-| Baichuan | ✅ | |
+| Qwen2                         | ✅        |                                                                      |
-| Phi-4-mini | ✅ | |
+| Qwen2-based                   | ✅        |                                                                      |
-| MiniCPM | ✅ | |
+| QwQ-32B                       | ✅        |                                                                      |
-| MiniCPM3 | ✅ | |
+| LLama2/3/3.1                  | ✅        |                                                                      |
-| LLama4 | ✅ | |
+| Internlm                      | ✅        | [#1962](https://github.com/vllm-project/vllm-ascend/issues/1962)     |
-| Mistral | | Need test |
+| Baichuan                      | ✅        |                                                                      |
-| DeepSeek v2.5 | |Need test |
+| Baichuan2                     | ✅        |                                                                      |
-| Gemma-2 | | Need test |
+| Phi-4-mini                    | ✅        |                                                                      |
-| Mllama |  |Need test|
+| MiniCPM                       | ✅        |                                                                      |
-| Gemma-3 | ❌ | [#496](https://github.com/vllm-project/vllm-ascend/issues/496) |
+| MiniCPM3                      | ✅        |                                                                      |
-| ChatGLM | ❌ | [#554](https://github.com/vllm-project/vllm-ascend/issues/554) |
+| Ernie4.5                      | ✅        |                                                                      |
 | Ernie4.5-Moe                  | ✅        |                                                                      |
 | Gemma-2                       | ✅        |                                                                      |
 | Gemma-3                       | ✅        |                                                                      |
 | Phi-3/4                       | ✅        |                                                                      |
 | Mistral/Mistral-Instruct      | ✅        |                                                                      |
 | GLM-4.5                       | ✅            |                                                                  |
 | GLM-4                         | ❌        | [#2255](https://github.com/vllm-project/vllm-ascend/issues/2255)     |
 | GLM-4-0414                    | ❌        | [#2258](https://github.com/vllm-project/vllm-ascend/issues/2258)     |
 | ChatGLM                       | ❌        | [#554](https://github.com/vllm-project/vllm-ascend/issues/554)       |
 | DeepSeek v2.5                 | 🟡        | Need test                                                            |
 | Mllama                        | 🟡        | Need test                                                            |
 | MiniMax-Text                  | 🟡        | Need test                                                            |
 ### Pooling Models
-| Model | Supported | Note |
+
-|-------|---------|------|
+| Model                         | Supported | Note                                                                 |
-| XLM-RoBERTa-based | ✅ |  |
+|-------------------------------|-----------|----------------------------------------------------------------------|
-| Molmo | ✅ |  |
+| Qwen3-Embedding               | ✅        |                                                                      |
 | Molmo                         | ✅        | [1942](https://github.com/vllm-project/vllm-ascend/issues/1942)      |
 | XLM-RoBERTa-based             | ❌        | [1960](https://github.com/vllm-project/vllm-ascend/issues/1960)      |
 ## Multimodal Language Models
 ### Generative Models
-| Model | Supported | Note |
+
-|-------|-----------|------|
+| Model                          | Supported     | Note                                                                 |
-| Qwen2-VL | ✅ | |
+|--------------------------------|---------------|----------------------------------------------------------------------|
-| Qwen2.5-VL | ✅ | |
+| Qwen2-VL                       | ✅            |                                                                      |
-| LLaVA 1.5 | ✅ | |
+| Qwen2.5-VL                     | ✅            |                                                                      |
-| LLaVA 1.6 | ✅ | [#553](https://github.com/vllm-project/vllm-ascend/issues/553) |
+| Qwen2.5-Omni                   | ✅            | [1760](https://github.com/vllm-project/vllm-ascend/issues/1760)      |
-| InternVL2 | ✅ | |
+| QVQ                            | ✅            |                                                                      |
-| InternVL2.5 | ✅ | |
+| LLaVA 1.5/1.6                  | ✅            | [1962](https://github.com/vllm-project/vllm-ascend/issues/1962)      |
-| Qwen2-Audio | ✅ |  |
+| InternVL2                      | ✅            |                                                                      |
-| LLaVA-Next |  | Need test |
+| InternVL2.5                    | ✅            |                                                                      |
-| LLaVA-Next-Video | | Need test |
+| Qwen2-Audio                    | ✅            |                                                                      |
-| Phi-3-Vison/Phi-3.5-Vison | | Need test |
+| Aria                           | ✅            |                                                                      |
-| GLM-4v | | Need test |
+| LLaVA-Next                     | ✅            |                                                                      |
-| Ultravox |  | Need test |
+| LLaVA-Next-Video               | ✅            |                                                                      |
 | MiniCPM-V                      | ✅            |                                                                      |
 | Mistral3                       | ✅            |                                                                      |
 | Phi-3-Vison/Phi-3.5-Vison      | ✅            |                                                                      |
 | Gemma3                         | ✅            |                                                                      |
 | LLama4                         | ❌            | [1972](https://github.com/vllm-project/vllm-ascend/issues/1972)      |
 | LLama3.2                       | ❌            | [1972](https://github.com/vllm-project/vllm-ascend/issues/1972)      |
 | Keye-VL-8B-Preview             | ❌            | [1963](https://github.com/vllm-project/vllm-ascend/issues/1963)      |
 | Florence-2                     | ❌            | [2259](https://github.com/vllm-project/vllm-ascend/issues/2259)      |
 | GLM-4V                         | ❌            | [2260](https://github.com/vllm-project/vllm-ascend/issues/2260)      |
 | InternVL2.0/2.5/3.0<br>InternVideo2.5/Mono-InternVL | ❌ | [2064](https://github.com/vllm-project/vllm-ascend/issues/2064) |
 | Whisper                        | ❌            | [2262](https://github.com/vllm-project/vllm-ascend/issues/2262)      |
 | Ultravox                       | 🟡 Need test  |                                                                      |