[Doc] Update doc (#3836)

### What this PR does / why we need it? Update doc ### Does this PR introduce _any_ user-facing change? ### How was this patch tested? - vLLM version: v0.11.0rc3 - vLLM main: https://github.com/vllm-project/vllm/commit/releases/v0.11.1 Signed-off-by: hfadzxy <starmoon_zhang@163.com>
2025-10-29 11:03:39 +08:00
parent 1e31b07fa7
commit 789ba4c5c2
47 changed files with 583 additions and 566 deletions
--- a/docs/source/developer_guide/evaluation/using_evalscope.md
+++ b/docs/source/developer_guide/evaluation/using_evalscope.md
@@ -2,7 +2,7 @@

 This document will guide you have model inference stress testing and accuracy testing using [EvalScope](https://github.com/modelscope/evalscope).

-## 1. Online serving
+## 1. Online server

 You can run docker container to start the vLLM server on a single NPU:

@@ -32,7 +32,7 @@ docker run --rm \
 vllm serve Qwen/Qwen2.5-7B-Instruct --max_model_len 26240
 ```

-If your service start successfully, you can see the info shown below:
+If the vLLM server is started successfully, you can see information shown below:

 ```
 INFO:     Started server process [6873]
@@ -40,7 +40,7 @@ INFO:     Waiting for application startup.
 INFO:     Application startup complete.
 ```

-Once your server is started, you can query the model with input prompts in new terminal:
+Once your server is started, you can query the model with input prompts in a new terminal:

 ```
 curl http://localhost:8000/v1/completions \
@@ -55,7 +55,7 @@ curl http://localhost:8000/v1/completions \

 ## 2. Install EvalScope using pip

-You can install EvalScope by using:
+You can install EvalScope as follows:

 ```bash
 python3 -m venv .venv-evalscope
@@ -63,9 +63,9 @@ source .venv-evalscope/bin/activate
 pip install gradio plotly evalscope
 ```

-## 3. Run gsm8k accuracy test using EvalScope
+## 3. Run GSM8K using EvalScope for accuracy testing

-You can `evalscope eval` run gsm8k accuracy test:
+You can use `evalscope eval` to run GSM8K for accuracy testing:

 ```
 evalscope eval \
@@ -77,7 +77,7 @@ evalscope eval \
 --limit 10
 ```

-After 1-2 mins, the output is as shown below:
+After 1 to 2 minutes, the output is shown below:

 ```shell
 +---------------------+-----------+-----------------+----------+-------+---------+---------+
@@ -87,7 +87,7 @@ After 1-2 mins, the output is as shown below:
 +---------------------+-----------+-----------------+----------+-------+---------+---------+
 ```

-See more detail in: [EvalScope doc - Model API Service Evaluation](https://evalscope.readthedocs.io/en/latest/get_started/basic_usage.html#model-api-service-evaluation).
+See more detail in [EvalScope doc - Model API Service Evaluation](https://evalscope.readthedocs.io/en/latest/get_started/basic_usage.html#model-api-service-evaluation).

 ## 4. Run model inference stress testing using EvalScope

@@ -99,7 +99,7 @@ pip install evalscope[perf] -U

 ### Basic usage

-You can use `evalscope perf` run perf test:
+You can use `evalscope perf` to run perf testing:

 ```
 evalscope perf \
@@ -114,7 +114,7 @@ evalscope perf \

 ### Output results

-After 1-2 mins, the output is as shown below:
+After 1 to 2 minutes, the output is shown below:

 ```shell
 Benchmarking summary:
@@ -173,4 +173,4 @@ Percentile results:
 +------------+----------+---------+-------------+--------------+---------------+----------------------+
 ```

-See more detail in: [EvalScope doc - Model Inference Stress Testing](https://evalscope.readthedocs.io/en/latest/user_guides/stress_test/quick_start.html#basic-usage).
+See more detail in [EvalScope doc - Model Inference Stress Testing](https://evalscope.readthedocs.io/en/latest/user_guides/stress_test/quick_start.html#basic-usage).