LI SHENGYONG
611e223b7d
[EPLB][Bugfix] EPLB support fp/bf16 (#5531)
### What this PR does / why we need it?
EPLB support dtype of fp/bf16.
### Does this PR introduce _any_ user-facing change?
### How was this patch tested?
w8a8_dynamic Baseline:
| dataset | version | metric | mode | vllm-api-general-chat |
|----- | ----- | ----- | ----- | -----|
| aime2024 | 604a78 | accuracy | gen | 86.67 |
w8a8_dynamic eplb:
| dataset | version | metric | mode | vllm-api-general-chat |
|----- | ----- | ----- | ----- | -----|
| aime2024 | 604a78 | accuracy | gen | 86.67 |
The fp16 conversation is normal.
The fp16 test is in progress.
Baseline fp16
| dataset | version | metric | mode | vllm-api-general-chat |
|----- | ----- | ----- | ----- | -----|
| aime2024 | 604a78 | accuracy | gen | 86.67 |
eplb fp16
| dataset | version | metric | mode | vllm-api-general-chat |
|----- | ----- | ----- | ----- | -----|
| aime2024 | 604a78 | accuracy | gen | 83.33 |
- vLLM version: v0.13.0
- vLLM main:
45c1ca1ca1
Signed-off-by: shenchuxiaofugui <1311027364@qq.com>
2026-01-26 14:28:16 +08:00
..
2026-01-26 14:12:33 +08:00
2025-11-28 18:06:39 +08:00
2026-01-26 09:04:54 +08:00
2026-01-26 09:04:54 +08:00
2026-01-24 22:10:18 +08:00
2026-01-19 08:59:46 +08:00
2026-01-19 08:59:46 +08:00
2026-01-26 09:04:54 +08:00
2026-01-26 14:28:16 +08:00
2026-01-24 22:45:38 +08:00
2026-01-24 22:45:38 +08:00
2026-01-24 22:08:33 +08:00
2026-01-26 14:28:16 +08:00
2026-01-24 22:08:33 +08:00
2026-01-23 14:13:47 +08:00
2026-01-26 09:08:42 +08:00
2026-01-26 09:04:54 +08:00
2026-01-26 14:05:23 +08:00
2026-01-22 15:46:59 +08:00
2026-01-16 20:57:46 +08:00
2026-01-24 22:49:33 +08:00
2026-01-25 17:45:29 +08:00
2026-01-19 08:59:46 +08:00
2026-01-16 20:57:46 +08:00
2026-01-22 09:26:39 +08:00
2026-01-16 20:57:46 +08:00
2026-01-16 20:57:46 +08:00
2026-01-25 17:39:19 +08:00
2026-01-16 20:57:46 +08:00
2026-01-24 20:34:29 +08:00