LI SHENGYONG
611e223b7d
[EPLB][Bugfix] EPLB support fp/bf16 (#5531)
### What this PR does / why we need it?
EPLB support dtype of fp/bf16.
### Does this PR introduce _any_ user-facing change?
### How was this patch tested?
w8a8_dynamic Baseline:
| dataset | version | metric | mode | vllm-api-general-chat |
|----- | ----- | ----- | ----- | -----|
| aime2024 | 604a78 | accuracy | gen | 86.67 |
w8a8_dynamic eplb:
| dataset | version | metric | mode | vllm-api-general-chat |
|----- | ----- | ----- | ----- | -----|
| aime2024 | 604a78 | accuracy | gen | 86.67 |
The fp16 conversation is normal.
The fp16 test is in progress.
Baseline fp16
| dataset | version | metric | mode | vllm-api-general-chat |
|----- | ----- | ----- | ----- | -----|
| aime2024 | 604a78 | accuracy | gen | 86.67 |
eplb fp16
| dataset | version | metric | mode | vllm-api-general-chat |
|----- | ----- | ----- | ----- | -----|
| aime2024 | 604a78 | accuracy | gen | 83.33 |
- vLLM version: v0.13.0
- vLLM main:
45c1ca1ca1
Signed-off-by: shenchuxiaofugui <1311027364@qq.com>
2026-01-26 14:28:16 +08:00
..
2026-01-26 14:28:16 +08:00
2026-01-26 09:15:06 +08:00
2025-12-17 08:53:44 +08:00
2026-01-24 20:34:29 +08:00
2026-01-10 22:57:57 +08:00
2026-01-08 09:05:02 +08:00
2026-01-24 20:34:29 +08:00
2026-01-23 14:13:47 +08:00
2025-12-25 10:43:24 +08:00
2026-01-23 09:45:08 +08:00
2026-01-26 10:20:24 +08:00
2026-01-21 22:01:22 +08:00
2026-01-23 09:45:08 +08:00
2025-12-12 14:41:20 +08:00
2025-10-31 17:16:31 +08:00