diff --git a/docs/source/user_guide/support_matrix/supported_models.md b/docs/source/user_guide/support_matrix/supported_models.md index 10e805f..7cd8ff3 100644 --- a/docs/source/user_guide/support_matrix/supported_models.md +++ b/docs/source/user_guide/support_matrix/supported_models.md @@ -5,49 +5,75 @@ Get the newest info here: https://github.com/vllm-project/vllm-ascend/issues/160 ## Text-only Language Models ### Generative Models -| Model | Supported | Note | -|-------|-----------|------| -| DeepSeek v3 | ✅ | | -| DeepSeek R1 | ✅ | | -| DeepSeek Distill (Qwen/LLama) | ✅ | | -| Qwen3 | ✅ | | -| Qwen3-Moe | ✅ | | -| Qwen2.5 | ✅ | | -| QwQ-32B | ✅ | | -| LLama3.1/3.2 | ✅ | | -| Internlm | ✅ | | -| Baichuan | ✅ | | -| Phi-4-mini | ✅ | | -| MiniCPM | ✅ | | -| MiniCPM3 | ✅ | | -| LLama4 | ✅ | | -| Mistral | | Need test | -| DeepSeek v2.5 | |Need test | -| Gemma-2 | | Need test | -| Mllama | |Need test| -| Gemma-3 | ❌ | [#496](https://github.com/vllm-project/vllm-ascend/issues/496) | -| ChatGLM | ❌ | [#554](https://github.com/vllm-project/vllm-ascend/issues/554) | + +| Model | Supported | Note | +|-------------------------------|-----------|----------------------------------------------------------------------| +| DeepSeek v3 | ✅ | | +| DeepSeek R1 | ✅ | | +| DeepSeek Distill (Qwen/LLama) | ✅ | | +| Qwen3 | ✅ | | +| Qwen3-based | ✅ | | +| Qwen3-Coder | ✅ | | +| Qwen3-Moe | ✅ | | +| Qwen2.5 | ✅ | | +| Qwen2 | ✅ | | +| Qwen2-based | ✅ | | +| QwQ-32B | ✅ | | +| LLama2/3/3.1 | ✅ | | +| Internlm | ✅ | [#1962](https://github.com/vllm-project/vllm-ascend/issues/1962) | +| Baichuan | ✅ | | +| Baichuan2 | ✅ | | +| Phi-4-mini | ✅ | | +| MiniCPM | ✅ | | +| MiniCPM3 | ✅ | | +| Ernie4.5 | ✅ | | +| Ernie4.5-Moe | ✅ | | +| Gemma-2 | ✅ | | +| Gemma-3 | ✅ | | +| Phi-3/4 | ✅ | | +| Mistral/Mistral-Instruct | ✅ | | +| GLM-4.5 | ✅ | | +| GLM-4 | ❌ | [#2255](https://github.com/vllm-project/vllm-ascend/issues/2255) | +| GLM-4-0414 | ❌ | [#2258](https://github.com/vllm-project/vllm-ascend/issues/2258) | +| ChatGLM | ❌ | [#554](https://github.com/vllm-project/vllm-ascend/issues/554) | +| DeepSeek v2.5 | 🟡 | Need test | +| Mllama | 🟡 | Need test | +| MiniMax-Text | 🟡 | Need test | ### Pooling Models -| Model | Supported | Note | -|-------|---------|------| -| XLM-RoBERTa-based | ✅ | | -| Molmo | ✅ | | + +| Model | Supported | Note | +|-------------------------------|-----------|----------------------------------------------------------------------| +| Qwen3-Embedding | ✅ | | +| Molmo | ✅ | [1942](https://github.com/vllm-project/vllm-ascend/issues/1942) | +| XLM-RoBERTa-based | ❌ | [1960](https://github.com/vllm-project/vllm-ascend/issues/1960) | ## Multimodal Language Models ### Generative Models -| Model | Supported | Note | -|-------|-----------|------| -| Qwen2-VL | ✅ | | -| Qwen2.5-VL | ✅ | | -| LLaVA 1.5 | ✅ | | -| LLaVA 1.6 | ✅ | [#553](https://github.com/vllm-project/vllm-ascend/issues/553) | -| InternVL2 | ✅ | | -| InternVL2.5 | ✅ | | -| Qwen2-Audio | ✅ | | -| LLaVA-Next | | Need test | -| LLaVA-Next-Video | | Need test | -| Phi-3-Vison/Phi-3.5-Vison | | Need test | -| GLM-4v | | Need test | -| Ultravox | | Need test | + +| Model | Supported | Note | +|--------------------------------|---------------|----------------------------------------------------------------------| +| Qwen2-VL | ✅ | | +| Qwen2.5-VL | ✅ | | +| Qwen2.5-Omni | ✅ | [1760](https://github.com/vllm-project/vllm-ascend/issues/1760) | +| QVQ | ✅ | | +| LLaVA 1.5/1.6 | ✅ | [1962](https://github.com/vllm-project/vllm-ascend/issues/1962) | +| InternVL2 | ✅ | | +| InternVL2.5 | ✅ | | +| Qwen2-Audio | ✅ | | +| Aria | ✅ | | +| LLaVA-Next | ✅ | | +| LLaVA-Next-Video | ✅ | | +| MiniCPM-V | ✅ | | +| Mistral3 | ✅ | | +| Phi-3-Vison/Phi-3.5-Vison | ✅ | | +| Gemma3 | ✅ | | +| LLama4 | ❌ | [1972](https://github.com/vllm-project/vllm-ascend/issues/1972) | +| LLama3.2 | ❌ | [1972](https://github.com/vllm-project/vllm-ascend/issues/1972) | +| Keye-VL-8B-Preview | ❌ | [1963](https://github.com/vllm-project/vllm-ascend/issues/1963) | +| Florence-2 | ❌ | [2259](https://github.com/vllm-project/vllm-ascend/issues/2259) | +| GLM-4V | ❌ | [2260](https://github.com/vllm-project/vllm-ascend/issues/2260) | +| InternVL2.0/2.5/3.0
InternVideo2.5/Mono-InternVL | ❌ | [2064](https://github.com/vllm-project/vllm-ascend/issues/2064) | +| Whisper | ❌ | [2262](https://github.com/vllm-project/vllm-ascend/issues/2262) | +| Ultravox | 🟡 Need test | |