From ab34025f004889016e54aa0ed54defb5f43467b0 Mon Sep 17 00:00:00 2001 From: luopingyi Date: Tue, 21 Oct 2025 11:10:03 +0800 Subject: [PATCH] Update README.md MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit 添加截止到10.21的模型测试结果 --- README.md | 43 +++++++++++++++++++++++++++++++++++++------ 1 file changed, 37 insertions(+), 6 deletions(-) diff --git a/README.md b/README.md index d8aec83..2204faa 100644 --- a/README.md +++ b/README.md @@ -71,11 +71,42 @@ curl -X POST http://localhost:10086/v1/chat/completions \ }' ``` -## 开始使用 +## 测试数据集 -推荐您使用以下版本快速开始使用: +视觉多模态任务数据集见 vlm-dataset -| Version | Release type | Doc | -|------------|--------------|--------------------------------------| -|v0.11.0rc0| 最新RC版本 |请查看[快速开始](https://vllm-ascend.readthedocs.io/en/latest/quick_start.html)和[安装指南](https://vllm-ascend.readthedocs.io/en/latest/installation.html)了解更多| -|v0.9.1| 最新正式/稳定版本 |[快速开始](https://vllm-ascend.readthedocs.io/en/v0.9.1-dev/quick_start.html) and [安装指南](https://vllm-ascend.readthedocs.io/en/v0.9.1-dev/installation.html)了解更多| +## 昇腾-910系列上模型运行测试结果 +在昇腾-910系列上对部分模型进行适配,测试方式为在 Nvidia A100 和 昇腾-910B4 加速卡上对对应数据集进行测试,获取运行时间 + +### 视觉多模态 + +| 模型名称 | 昇腾-910B4运行时间/s | Nvidia A100运行时间/s | +| ----------------------- | -------------- | ----------------- | +| Qwen2.5-VL-3B-Instruct | 7.5688 | 3.4735 | +| Qwen2.5-VL-7B-Instruct | 10.6117 | 4.5430 | +| Qwen2-VL-7B-Instruct | 4.3974 | 2.7123 | +| Qwen2-VL-2B-Instruct | 7.9134 | 2.6749 | +| InternVL2_5-1B-MPO | 3.6658 | 1.9166 | +| InternVL2-8B | 15.8963 | 3.7747 | +| InternVL2_5-2B | 11.3071 | 2.3767 | +| InternVL2_5-1B | 10.9917 | 2.0399 | +| InternVL2_5-4B | 11.0892 | 2.6751 | +| InternVL2-1B | 4.6318 | 2.0094 | +| InternVL2_5-8B-MPO | 10.7414 | 2.6034 | +| Phi-3.5-vision-instruct | 14.5275 | 3.4563 | +| MiniCPM-V-4_5 | 31.9896 | 3.4504 | +| InternVL3-1B-hf | 19.9975 | 2.8482 | +| InternVL3-8B-Instruct | 9.6205 | 2.4711 | +| InternVL3-2B-hf | 17.7860 | 3.0497 | +| InternVL3-9B | 13.1422 | 3.7643 | +| MiniCPM-V-4 | 13.7100 | 3.7743 | +| llava-1.5-7b-hf | 8.8733 | 2.5678 | +| MiMo-VL-7B-RL | 28.3977 | 8.8021 | +| InternVL2-4B | 29.3529 | 7.0642 | + +### 统一多模态(暂时用视觉多模态的数据集测试) + +| 模型名称 | 昇腾-910B4运行时间/s | Nvidia A100运行时间/s | +| ----------------------- | -------------- | ----------------- | +| Qwen2.5-Omni-3B | 13.9121 | 10.6149 | +| Qwen2.5-Omni-7B | 12.8182 | 4.3004 | \ No newline at end of file