Commit Graph

6 Commits

Author SHA1 Message Date
HouseWest
ca47e24f5d [Feature] improve TBO: two chunk overlap (#8144) 2025-08-05 21:11:01 -07:00
Cheng Wan
6c88f6c8d9 [5/N] MoE Refactor: Update MoE parallelism arguments (#8658) 2025-08-01 01:20:03 -07:00
Emmanuel Ferdman
f40942ad63 Migrate to assertEqual (#6741)
Signed-off-by: Emmanuel Ferdman <emmanuelferdman@gmail.com>
2025-06-09 16:47:39 -07:00
Yi Zhang
f9bab3d591 qwen3moe support two batch overlap (#6598) 2025-05-25 23:08:16 -07:00
fzyzcjy
a191a0e47c Improve performance of two batch overlap in some imbalanced cases (#6593) 2025-05-25 22:36:18 -07:00
fzyzcjy
0d47788025 Support overlapping two batches (#4068) 2025-05-24 17:39:07 -07:00