forked from EngineX-Ascend/enginex-ascend-910-vllm
Update README.md
添加截止到10.21的模型测试结果
This commit is contained in:
43
README.md
43
README.md
@@ -71,11 +71,42 @@ curl -X POST http://localhost:10086/v1/chat/completions \
|
||||
}'
|
||||
```
|
||||
|
||||
## 开始使用
|
||||
## 测试数据集
|
||||
|
||||
推荐您使用以下版本快速开始使用:
|
||||
视觉多模态任务数据集见 vlm-dataset
|
||||
|
||||
| Version | Release type | Doc |
|
||||
|------------|--------------|--------------------------------------|
|
||||
|v0.11.0rc0| 最新RC版本 |请查看[快速开始](https://vllm-ascend.readthedocs.io/en/latest/quick_start.html)和[安装指南](https://vllm-ascend.readthedocs.io/en/latest/installation.html)了解更多|
|
||||
|v0.9.1| 最新正式/稳定版本 |[快速开始](https://vllm-ascend.readthedocs.io/en/v0.9.1-dev/quick_start.html) and [安装指南](https://vllm-ascend.readthedocs.io/en/v0.9.1-dev/installation.html)了解更多|
|
||||
## 昇腾-910系列上模型运行测试结果
|
||||
在昇腾-910系列上对部分模型进行适配,测试方式为在 Nvidia A100 和 昇腾-910B4 加速卡上对对应数据集进行测试,获取运行时间
|
||||
|
||||
### 视觉多模态
|
||||
|
||||
| 模型名称 | 昇腾-910B4运行时间/s | Nvidia A100运行时间/s |
|
||||
| ----------------------- | -------------- | ----------------- |
|
||||
| Qwen2.5-VL-3B-Instruct | 7.5688 | 3.4735 |
|
||||
| Qwen2.5-VL-7B-Instruct | 10.6117 | 4.5430 |
|
||||
| Qwen2-VL-7B-Instruct | 4.3974 | 2.7123 |
|
||||
| Qwen2-VL-2B-Instruct | 7.9134 | 2.6749 |
|
||||
| InternVL2_5-1B-MPO | 3.6658 | 1.9166 |
|
||||
| InternVL2-8B | 15.8963 | 3.7747 |
|
||||
| InternVL2_5-2B | 11.3071 | 2.3767 |
|
||||
| InternVL2_5-1B | 10.9917 | 2.0399 |
|
||||
| InternVL2_5-4B | 11.0892 | 2.6751 |
|
||||
| InternVL2-1B | 4.6318 | 2.0094 |
|
||||
| InternVL2_5-8B-MPO | 10.7414 | 2.6034 |
|
||||
| Phi-3.5-vision-instruct | 14.5275 | 3.4563 |
|
||||
| MiniCPM-V-4_5 | 31.9896 | 3.4504 |
|
||||
| InternVL3-1B-hf | 19.9975 | 2.8482 |
|
||||
| InternVL3-8B-Instruct | 9.6205 | 2.4711 |
|
||||
| InternVL3-2B-hf | 17.7860 | 3.0497 |
|
||||
| InternVL3-9B | 13.1422 | 3.7643 |
|
||||
| MiniCPM-V-4 | 13.7100 | 3.7743 |
|
||||
| llava-1.5-7b-hf | 8.8733 | 2.5678 |
|
||||
| MiMo-VL-7B-RL | 28.3977 | 8.8021 |
|
||||
| InternVL2-4B | 29.3529 | 7.0642 |
|
||||
|
||||
### 统一多模态(暂时用视觉多模态的数据集测试)
|
||||
|
||||
| 模型名称 | 昇腾-910B4运行时间/s | Nvidia A100运行时间/s |
|
||||
| ----------------------- | -------------- | ----------------- |
|
||||
| Qwen2.5-Omni-3B | 13.9121 | 10.6149 |
|
||||
| Qwen2.5-Omni-7B | 12.8182 | 4.3004 |
|
||||
Reference in New Issue
Block a user