LI SHENGYONG
cd59323e40
[Bugfix] Revert pr4214 multi-stream collect expert hotpot (#5529)
### What this PR does / why we need it?
PR4214 was intended to collect expert heat by processing multiple
streams, which could lead to memory overwriting and accuracy issues.
After communicating with the PR submitter, this PR has been reverted.
### Does this PR introduce _any_ user-facing change?
### How was this patch tested?
qwen3-moe dynamic eplb
Befor revert
| dataset | version | metric | mode | vllm-api-general-chat |
|----- | ----- | ----- | ----- | -----|
| aime2024 | 604a78 | accuracy | gen | 43.33 |
After revert
| dataset | version | metric | mode | vllm-api-general-chat |
|----- | ----- | ----- | ----- | -----|
| aime2024 | 604a78 | accuracy | gen | 86.67 |
baseline (without eplb)
| dataset | version | metric | mode | vllm-api-general-chat |
|----- | ----- | ----- | ----- | -----|
| aime2024 | 604a78 | accuracy | gen | 86.67 |
- vLLM version: v0.13.0
- vLLM main:
45c1ca1ca1
Signed-off-by: shenchuxiaofugui <1311027364@qq.com>
2026-01-07 11:26:47 +08:00
..
2026-01-07 11:26:47 +08:00
2026-01-07 09:11:26 +08:00
2025-12-17 08:53:44 +08:00
2025-11-26 14:28:55 +08:00
2025-12-18 20:25:44 +08:00
2026-01-06 16:41:39 +08:00
2025-12-25 10:43:24 +08:00
2026-01-06 16:41:39 +08:00
2025-12-27 18:42:46 +08:00
2025-12-29 15:28:34 +08:00
2026-01-05 20:12:41 +08:00
2026-01-06 16:41:39 +08:00
2025-12-12 14:41:20 +08:00
2025-10-31 17:16:31 +08:00