### What this PR does / why we need it? #4443 introduces a precision issue in scenarios where MTP >= 3 + deepseek v3.1, and this pr reverts it - vLLM version: release/v0.13.0 - vLLM main: bc0a5a0c08 Signed-off-by: GDzhu01 <809721801@qq.com>
bc0a5a0c08
with_prefill
set_ascend_forward_context