### What this PR does / why we need it? The GPQA dataset accuracy in the PD separation scenario of testing is 33.2, which does not meet the paper's requirement of 70. Resolve this accuracy issue. ### Does this PR introduce _any_ user-facing change? ### How was this patch tested? qpqa has accuracy issues, but modifying the code can ensure the accuracy meets the standard - vLLM version: v0.11.0rc3 - vLLM main: https://github.com/vllm-project/vllm/commit/v0.11.0 --------- Signed-off-by: fjw <2270923832@qq.com>