Initial commit for vLLM-Kunlun Plugin

2025-12-10 12:05:39 +08:00
commit c728e52505
131 changed files with 28816 additions and 0 deletions
--- a/docs/source/user_guide/support_matrix/index.md
+++ b/docs/source/user_guide/support_matrix/index.md
@@ -0,0 +1,10 @@
+# Features and Models
+
+This section provides a detailed matrix supported by vLLM-Kunlun.
+
+:::{toctree}
+:caption: Support Matrix
+:maxdepth: 1
+supported_models
+supported_features
+:::
--- a/docs/source/user_guide/support_matrix/supported_features.md
+++ b/docs/source/user_guide/support_matrix/supported_features.md
@@ -0,0 +1,14 @@
+# Supported Features
+
+The feature support principle of vLLM-KunLun is: **aligned with the vLLM**. We are also actively collaborating with the community to accelerate support.
+
+You can check the [support status of vLLM V1 Engine][v1_user_guide]. Below is the feature support status of vLLM-KunLun:
+
+## Features Supported
+|Feature|Status|Note|
+|-|-|-|
+|Tensor Parallel|🟢 Functional||
+|Experts Parallel|🟢 Functional||
+|Graph Mode|🟢 Functional||
+|Quantization| 🟢 Functional||
+|LoRA|⚠️ Need Test|Only LLM models|
--- a/docs/source/user_guide/support_matrix/supported_models.md
+++ b/docs/source/user_guide/support_matrix/supported_models.md
@@ -0,0 +1,33 @@
+# Supported Models
+
+## Generative Models
+
+| Model         | Support       | W8A8 | LoRA | Tensor Parallel | Expert Parallel | Data Parallel | Piecewise Kunlun Graph |
+| :------------ | :------------ | :--- | :--- | :-------------- | :-------------- | :------------ | :--------------------- |
+| Qwen2         | ✅             |      | ✅    | ✅               |                 | ✅             | ✅                      |
+| Qwen2.5       | ✅             |      | ✅    | ✅               |                 | ✅             | ✅                      |
+| Qwen3         | ✅             |      | ✅    | ✅               |                 | ✅             | ✅                      |
+| Qwen3-Moe     | ✅             | ✅    | ✅    | ✅               | ✅               | ✅             | ✅                      |
+| Qwen3-Coder   | ✅             | ✅    | ✅    | ✅               | ✅               | ✅             | ✅                      |
+| QwQ-32B       | ✅             |      |      | ✅               |                 | ✅             | ✅                      |
+| LLama2        | ✅             |      |      | ✅               |                 | ✅             | ✅                      |
+| LLama3        | ✅             |      |      | ✅               |                 | ✅             | ✅                      |
+| LLama3.1      | ✅             |      |      | ✅               |                 | ✅             | ✅                      |
+| GLM-4.5       | ✅             | ✅    | ✅    | ✅               | ✅               | ✅             | ✅                      |
+| GLM-4.5-Air   | ✅             | ✅    | ✅    | ✅               | ✅               | ✅             | ✅                      |
+| Qwen3-next    | 🔜Comming soon |      |      |                 |                 |               |                        |
+| gpt-oss       | 🔜Comming soon |      |      |                 |                 |               |                        |
+| DeepSeek-V3   | 🔜Comming soon |      |      |                 |                 |               |                        |
+| DeepSeek-V3.2 | 🔜Comming soon |      |      |                 |                 |               |                        |
+
+## Multimodal Language Models
+| Model        | Support       | W8A8 | LoRA | Tensor Parallel | Expert Parallel | Data Parallel | Piecewise Kunlun Graph |
+| :----------- | :------------ | :--- | :--- | :-------------- | :-------------- | :------------ | :--------------------- |
+|Qianfan-VL    | ✅     |       |      |       ✅|               |✅               |✅|
+| Qwen2.5VL    | ✅             |      |      | ✅               |                 | ✅             | ✅                      |
+| InternVL2.5  | ✅             |      |      | ✅               |                 | ✅             | ✅                      |
+| InternVL3    | ✅             |      |      | ✅               |                 | ✅             | ✅                      |
+| InternVL3.5  | ✅             |      |      | ✅               |                 | ✅             | ✅                      |
+| InternS1     | ✅             |      |      | ✅               |                 | ✅             | ✅                      |
+| Qwen2.5-Omni | 🔜Comming soon |      |      |                 |                 |               |                        |
+| Qwen3-VL     | 🔜Comming soon |      |      |                 |                 |               |                        |