xc-llm-ascend

Files

LI SHENGYONG 4b2f0130bc [V0.18.0][EPLB][BugFix] Fix moe_load precision in allgather (#7890 )

### What this PR does / why we need it?
Fixed the bug of incorrect reshape usage.
For example:
ori_tensor: [[1, 2, 3], [4, 5, 6]]
after reshape:
[[1, 2], [3, 4], [5, 6]]
after permute:
[[1, 4], [2, 5], [3, 6]]
Now, we will directly use squeeze for a more intuitive understanding.
pr for main:
#7887 

### Does this PR introduce _any_ user-facing change?
The actual peak-to-average ratio has successfully decreased.

Signed-off-by: shenchuxiaofugui <1311027364@qq.com>

2026-04-02 09:20:31 +08:00

adaptor

[EPLB][bugfix] Bugfix for fused mc2 (#6794 )

2026-03-09 11:26:57 +08:00

core

[EPLB] Reduce the memory used for batch_isend_irecv (#7344 )

2026-03-20 12:25:58 +08:00

__init__.py

Dynamic Expert Load Balance with Zero-like-overhead (#2956 )

2025-09-17 10:36:43 +08:00

eplb_updator.py

[V0.18.0][EPLB][BugFix] Fix moe_load precision in allgather (#7890 )

2026-04-02 09:20:31 +08:00

utils.py

[EPLB] Avoiding eplb's dependency on a specified model (#6528 )

2026-02-10 15:58:44 +08:00