From 64fdf4cbef2752931826aae0aa409a3b322a39f9 Mon Sep 17 00:00:00 2001 From: Li Wang Date: Thu, 17 Apr 2025 14:56:51 +0800 Subject: [PATCH] [Doc]Update faq (#536) ### What this PR does / why we need it? update performance and accuracy faq Signed-off-by: wangli --- docs/source/faqs.md | 14 +++++++++++++- 1 file changed, 13 insertions(+), 1 deletion(-) diff --git a/docs/source/faqs.md b/docs/source/faqs.md index 6b052a4..77260c6 100644 --- a/docs/source/faqs.md +++ b/docs/source/faqs.md @@ -116,4 +116,16 @@ Currently, running on v0.7.3, we should run w8a8 with vllm + vllm-ascend + mindi ### 13. There is not output in log when loading models using vllm-ascend, How to solve it? -If you're using vllm 0.7.3 version, this is a known progress bar display issue in VLLM, which has been resolved in [this PR](https://github.com/vllm-project/vllm/pull/12428), please cherry-pick it locally by yourself. Otherwise, please fill up an issue. \ No newline at end of file +If you're using vllm 0.7.3 version, this is a known progress bar display issue in VLLM, which has been resolved in [this PR](https://github.com/vllm-project/vllm/pull/12428), please cherry-pick it locally by yourself. Otherwise, please fill up an issue. + +### 14. How vllm-ascend is tested + +vllm-ascend is tested by functionnal test, performance test and accuracy test. + +- **Functionnal test**: we added CI, includes portion of vllm's native unit tests and vllm-ascend's own unit tests,on vllm-ascend's test, we test basic functional usability for popular models, include `Qwen2.5-7B-Instruct`、 `Qwen2.5-VL-7B-Instruct`、`Qwen2.5-VL-32B-Instruct`、`QwQ-32B`. + +- **Performance test**: we provide [benchmark](https://github.com/vllm-project/vllm-ascend/tree/main/benchmarks) tools for end-to-end performance benchmark which can easily to re-route locally, we'll publish a perf website like [vllm](https://simon-mo-workspace.observablehq.cloud/vllm-dashboard-v0/perf) does to show the performance test results for each pull request + +- **Accuracy test**: we're working on adding accuracy test to CI as well. + +Finnall, for each release, we'll publish the performance test and accuracy test report in the future.