Initial commit for vLLM-Kunlun Plugin
This commit is contained in:
@@ -0,0 +1,18 @@
|
||||
# GLM-Air-4.5
|
||||
|
||||
* vLLM Version: vLLM: 0.10.1.1 , vLLM-KunLun Version: v0.10.1.1
|
||||
* Software Environment:OS: Ubuntu 22.04, PyTorch ≥ 2.5.1
|
||||
* Hardware Environment: KunLun P800
|
||||
* Parallel mode:TP8
|
||||
|
||||
```bash
|
||||
+-------------+----------+---------------+---------+-----+--------+---------+
|
||||
| Model | Dataset | Metric | Subset | Num | Score | Cat.0 |
|
||||
+-------------+----------+---------------+---------+-----+--------+---------+
|
||||
| GLM-4.5-Air | math_500 | AveragePass@1 | Level 1 | 43 | 0.9302 | default |
|
||||
| GLM-4.5-Air | math_500 | AveragePass@1 | Level 2 | 90 | 0.9222 | default |
|
||||
| GLM-4.5-Air | math_500 | AveragePass@1 | Level 3 | 105 | 0.8762 | default |
|
||||
| GLM-4.5-Air | math_500 | AveragePass@1 | Level 4 | 128 | 0.8984 | default |
|
||||
| GLM-4.5-Air | math_500 | AveragePass@1 | Level 5 | 134 | 0.8955 | default |
|
||||
+-------------+----------+---------------+---------+-----+--------+---------+
|
||||
```
|
||||
@@ -0,0 +1,18 @@
|
||||
# GLM-4.5
|
||||
|
||||
* vLLM Version: vLLM: 0.10.1.1 , vLLM-KunLun Version: v0.10.1.1
|
||||
* Software Environment:OS: Ubuntu 22.04, PyTorch ≥ 2.5.1
|
||||
* Hardware Environment: KunLun P800
|
||||
* Parallel mode:TP8
|
||||
|
||||
```bash
|
||||
+---------+----------+---------------+---------+-----+--------+---------+
|
||||
| Model | Dataset | Metric | Subset | Num | Score | Cat.0 |
|
||||
+---------+----------+---------------+---------+-----+--------+---------+
|
||||
| GLM-4.5 | math_500 | AveragePass@1 | Level 1 | 43 | 0.9302 | default |
|
||||
| GLM-4.5 | math_500 | AveragePass@1 | Level 2 | 90 | 0.8111 | default |
|
||||
| GLM-4.5 | math_500 | AveragePass@1 | Level 3 | 105 | 0.7143 | default |
|
||||
| GLM-4.5 | math_500 | AveragePass@1 | Level 4 | 128 | 0.6172 | default |
|
||||
| GLM-4.5 | math_500 | AveragePass@1 | Level 5 | 134 | 0.5149 | default |
|
||||
+---------+----------+---------------+---------+-----+--------+---------+
|
||||
```
|
||||
@@ -0,0 +1,18 @@
|
||||
# InternVL3_5-30B-A3B
|
||||
|
||||
* vLLM Version: vLLM: 0.10.1.1 , vLLM-KunLun Version: v0.10.1.1
|
||||
* Software Environment:OS: Ubuntu 22.04, PyTorch ≥ 2.5.1
|
||||
* Hardware Environment: KunLun P800
|
||||
* Parallel mode:TP8
|
||||
|
||||
```
|
||||
+-------------+---------------------+--------------+---------------+-------+
|
||||
| task_type | metric | dataset_name | average_score | count |
|
||||
+-------------+---------------------+--------------+---------------+-------+
|
||||
| exam | acc | mmmu_pro | 0.5449 | 334 |
|
||||
| math | acc | math_vista | 0.6847 | 333 |
|
||||
| exam | acc | mmlu_pro | 0.6126 | 111 |
|
||||
| instruction | prompt_level_strict | ifeval | 0.7658 | 111 |
|
||||
| math | acc | gsm8k | 0.9369 | 111 |
|
||||
+-------------+---------------------+--------------+---------------+-------+
|
||||
```
|
||||
@@ -0,0 +1,18 @@
|
||||
# Qwen2.5-VL-7B-Instruct
|
||||
|
||||
* vLLM Version: vLLM: 0.10.1.1 , vLLM-KunLun Version: v0.10.1.1
|
||||
* Software Environment:OS: Ubuntu 22.04, PyTorch ≥ 2.5.1
|
||||
* Hardware Environment: KunLun P800
|
||||
* Parallel mode:TP1
|
||||
|
||||
```
|
||||
+-------------+---------------------+--------------+---------------+-------+
|
||||
| task_type | metric | dataset_name | average_score | count |
|
||||
+-------------+---------------------+--------------+---------------+-------+
|
||||
| exam | acc | mmmu_pro | 0.521 | 334 |
|
||||
| math | acc | math_vista | 0.6066 | 333 |
|
||||
| exam | acc | mmlu_pro | 0.5405 | 111 |
|
||||
| instruction | prompt_level_strict | ifeval | 0.6937 | 111 |
|
||||
| math | acc | gsm8k | 0.8288 | 111 |
|
||||
+-------------+---------------------+--------------+---------------+-------+
|
||||
```
|
||||
@@ -0,0 +1,10 @@
|
||||
# Accuracy Report
|
||||
|
||||
:::{toctree}
|
||||
:caption: Accuracy Report
|
||||
:maxdepth: 1
|
||||
Qwen2.5-VL-7B-Instruct
|
||||
InternVL3_5-30B-A3B
|
||||
GLM-4.5
|
||||
GLM-4.5-Air
|
||||
:::
|
||||
Reference in New Issue
Block a user