Initial commit for vLLM-Kunlun Plugin

2025-12-10 12:05:39 +08:00
commit c728e52505
131 changed files with 28816 additions and 0 deletions
--- a/docs/source/developer_guide/evaluation/accuracy_report/GLM-4.5-Air.md
+++ b/docs/source/developer_guide/evaluation/accuracy_report/GLM-4.5-Air.md
@@ -0,0 +1,18 @@
+# GLM-Air-4.5
+
+* vLLM Version: vLLM: 0.10.1.1 , vLLM-KunLun Version: v0.10.1.1
+* Software Environment:OS: Ubuntu 22.04, PyTorch ≥ 2.5.1
+* Hardware Environment: KunLun P800
+* Parallel mode:TP8
+
+```bash
+-------------+----------+---------------+---------+-----+--------+---------+
+| Model       | Dataset  | Metric        | Subset  | Num | Score  | Cat.0   |
+-------------+----------+---------------+---------+-----+--------+---------+
+| GLM-4.5-Air | math_500 | AveragePass@1 | Level 1 | 43  | 0.9302 | default |
+| GLM-4.5-Air | math_500 | AveragePass@1 | Level 2 | 90  | 0.9222 | default |
+| GLM-4.5-Air | math_500 | AveragePass@1 | Level 3 | 105 | 0.8762 | default |
+| GLM-4.5-Air | math_500 | AveragePass@1 | Level 4 | 128 | 0.8984 | default |
+| GLM-4.5-Air | math_500 | AveragePass@1 | Level 5 | 134 | 0.8955 | default |
+-------------+----------+---------------+---------+-----+--------+---------+
+```
--- a/docs/source/developer_guide/evaluation/accuracy_report/GLM-4.5.md
+++ b/docs/source/developer_guide/evaluation/accuracy_report/GLM-4.5.md
@@ -0,0 +1,18 @@
+# GLM-4.5
+
+* vLLM Version: vLLM: 0.10.1.1 , vLLM-KunLun Version: v0.10.1.1
+* Software Environment:OS: Ubuntu 22.04, PyTorch ≥ 2.5.1
+* Hardware Environment: KunLun P800
+* Parallel mode:TP8
+
+```bash
+---------+----------+---------------+---------+-----+--------+---------+
+| Model   | Dataset  | Metric        | Subset  | Num | Score  | Cat.0   |
+---------+----------+---------------+---------+-----+--------+---------+
+| GLM-4.5 | math_500 | AveragePass@1 | Level 1 |  43 | 0.9302 | default |
+| GLM-4.5 | math_500 | AveragePass@1 | Level 2 |  90 | 0.8111 | default |
+| GLM-4.5 | math_500 | AveragePass@1 | Level 3 | 105 | 0.7143 | default |
+| GLM-4.5 | math_500 | AveragePass@1 | Level 4 | 128 | 0.6172 | default |
+| GLM-4.5 | math_500 | AveragePass@1 | Level 5 | 134 | 0.5149 | default |
+---------+----------+---------------+---------+-----+--------+---------+
+```
--- a/docs/source/developer_guide/evaluation/accuracy_report/InternVL3_5-30B-A3B.md
+++ b/docs/source/developer_guide/evaluation/accuracy_report/InternVL3_5-30B-A3B.md
@@ -0,0 +1,18 @@
+# InternVL3_5-30B-A3B
+
+* vLLM Version: vLLM: 0.10.1.1 , vLLM-KunLun Version: v0.10.1.1
+* Software Environment:OS: Ubuntu 22.04, PyTorch ≥ 2.5.1
+* Hardware Environment: KunLun P800
+* Parallel mode:TP8
+
+```
+-------------+---------------------+--------------+---------------+-------+
+|  task_type  |       metric        | dataset_name | average_score | count |
+-------------+---------------------+--------------+---------------+-------+
+|    exam     |         acc         |   mmmu_pro   |    0.5449     |  334  |
+|    math     |         acc         |  math_vista  |    0.6847     |  333  |
+|    exam     |         acc         |   mmlu_pro   |    0.6126     |  111  |
+| instruction | prompt_level_strict |    ifeval    |    0.7658     |  111  |
+|    math     |         acc         |    gsm8k     |    0.9369     |  111  |
+-------------+---------------------+--------------+---------------+-------+
+```
--- a/docs/source/developer_guide/evaluation/accuracy_report/Qwen2.5-VL-7B-Instruct.md
+++ b/docs/source/developer_guide/evaluation/accuracy_report/Qwen2.5-VL-7B-Instruct.md
@@ -0,0 +1,18 @@
+# Qwen2.5-VL-7B-Instruct
+
+* vLLM Version: vLLM: 0.10.1.1 , vLLM-KunLun Version: v0.10.1.1
+* Software Environment:OS: Ubuntu 22.04, PyTorch ≥ 2.5.1
+* Hardware Environment: KunLun P800
+* Parallel mode:TP1
+
+```
+-------------+---------------------+--------------+---------------+-------+
+|  task_type  |       metric        | dataset_name | average_score | count |
+-------------+---------------------+--------------+---------------+-------+
+|    exam     |         acc         |   mmmu_pro   |     0.521     |  334  |
+|    math     |         acc         |  math_vista  |    0.6066     |  333  |
+|    exam     |         acc         |   mmlu_pro   |    0.5405     |  111  |
+| instruction | prompt_level_strict |    ifeval    |    0.6937     |  111  |
+|    math     |         acc         |    gsm8k     |    0.8288     |  111  |
+-------------+---------------------+--------------+---------------+-------+
+```
--- a/docs/source/developer_guide/evaluation/accuracy_report/index.md
+++ b/docs/source/developer_guide/evaluation/accuracy_report/index.md
@@ -0,0 +1,10 @@
+# Accuracy Report
+
+:::{toctree}
+:caption: Accuracy Report
+:maxdepth: 1
+Qwen2.5-VL-7B-Instruct
+InternVL3_5-30B-A3B
+GLM-4.5
+GLM-4.5-Air
+:::