[Bugfix] Fix the error "cur batch_size is invalid" during profile_run in the torchair scenario (#3243)

### What this PR does / why we need it?
Fix the error "cur batch_size is invalid" during profile_run in the
torchair scenario.

### Does this PR introduce _any_ user-facing change?

### How was this patch tested?

- vLLM version: v0.10.2
- vLLM main:
https://github.com/vllm-project/vllm/commit/releases/v0.11.0

Signed-off-by: WithHades <244036962@qq.com>
This commit is contained in:
无脸男
2025-09-29 11:51:07 +08:00
committed by GitHub
parent 8870966031
commit 373f84a193

View File

@@ -2513,7 +2513,7 @@ class NPUModelRunner(LoRAModelRunnerMixin):
if self._select_moe_comm_method(
self.mc2_tokens_capacity,
with_prefill=True) == MoECommType.MC2:
self._dummy_run(self.mc2_tokens_capacity)
self._dummy_run(self.mc2_tokens_capacity, with_prefill=True)
output = None
if get_pp_group().is_last_rank: