[feature] vllm-ascend support msprobe (eager mode dump) (#4241)

### What this PR does / why we need it?
vllm-ascend need to dump data during model execution to debug some
precision problems, here msprobe provide the corresponding abilities, so
msprobe will join vllm-ascend to make debug easier

### Does this PR introduce _any_ user-facing change?
```
'dump_config': '/path/to/config.json'
```



- vLLM version: v0.11.0
- vLLM main:
2918c1b49c

---------

Signed-off-by: Tjh-UKN <2559659915@qq.com>
This commit is contained in:
Tjh-UKN
2025-11-24 21:58:31 +08:00
committed by GitHub
parent 5b1a7514eb
commit 00ea61ec88
17 changed files with 1385 additions and 159 deletions

View File

@@ -42,6 +42,7 @@ The following table lists additional configuration options available in vLLM Asc
| `num_wait_worker_iterations` | int | `30` | The forward iterations when the EPLB worker will finish CPU tasks. In our test default value 30 can cover most cases. |
| `expert_map_record_path` | str | `None` | When dynamic EPLB is completed, save the current expert load heatmap to the specified path. |
| `init_redundancy_expert` | int | `0` | Specify redundant experts during initialization. |
| `dump_config` | str | `None` | Configuration file path for msprobe dump(eager mode). |
The details of each configuration option are as follows: