[Readme] EPLB Support Scenarios (#4315)

### What this PR does / why we need it?
Add information on the scope of EPLB support.

---------

Signed-off-by: shenchuxiaofugui <1311027364@qq.com>
This commit is contained in:
LI SHENGYONG
2025-11-21 14:25:39 +08:00
committed by GitHub
parent 9c6d0b422c
commit c94b38c82e
2 changed files with 14 additions and 0 deletions

View File

@@ -12,6 +12,13 @@ Expert balancing for MoE models in LLM serving is essential for optimal performa
- Adaptive Scaling: Automatically adjusts to workload fluctuations while maintaining stable performance.
- Fault Tolerance: Redundant expert placement ensures system resilience during hardware failures.
## Support Scenarios
### Models:
DeepseekV3/V3.1/R1、Qwen3-MOE
### MOE QuantType:
W8A8-dynamic
## How to Use EPLB
### Dynamic EPLB