Commit Graph

16 Commits

Author SHA1 Message Date
Chang Su
28684f909d [router] upgrade kernel version in pd ci (#9720) 2025-08-27 16:02:41 -07:00
Shangming Cai
8ca07bd948 [CI] Fix sgl-router disaggregation test (#9222)
Signed-off-by: Shangming Cai <csmthu@gmail.com>
2025-08-15 02:24:44 -07:00
Jiaqi Gu
c9ee738515 Fuse writing KV buffer into rope kernel (part 2: srt) (#9014)
Co-authored-by: fzyzcjy <5236035+fzyzcjy@users.noreply.github.com>
2025-08-12 13:15:30 -07:00
Lianmin Zheng
2c7f01bc89 Reorganize CI and test files (#9027) 2025-08-10 12:30:06 -07:00
Simo Lin
3817a37d87 [router] upgrade to latest sgl kernel for router ci (#9019) 2025-08-09 21:49:18 -07:00
Lianmin Zheng
706bd69cc5 Clean up server_args.py to have a dedicated function for model specific adjustments (#8983) 2025-08-08 19:56:50 -07:00
Simo Lin
16a4c66d25 [router] update pd router ci summary step with new threshold (#8916) 2025-08-07 07:15:38 -07:00
Simo Lin
89e6521c61 [router] re-enable pd router benchmark CI (#8912) 2025-08-07 06:29:36 -07:00
Liangsheng Yin
7a27e798ca [CI] Do not trigger pd-disaggregation CI in draft PR (#8737) 2025-08-04 05:12:20 +08:00
Simo Lin
aee0ef52f5 [router] update router pypi version (#8628) 2025-07-31 11:24:12 -07:00
Simo Lin
ae807774f5 [ci] fix genai-bench execution cmd (#8629) 2025-07-31 10:40:54 -07:00
Keyang Ru
7c9697178e [CI]Add genai-bench Performance Validation for PD Router (#8477)
Co-authored-by: key4ng <rukeyang@gamil.com>
2025-07-28 16:58:23 -07:00
Shangming Cai
70e37b97bf chore: upgrade mooncake 0.3.5 (#8341)
Signed-off-by: Shangming Cai <caishangming@linux.alibaba.com>
2025-07-25 01:17:26 -07:00
Simo Lin
c8f31042a8 [router] Refactor router and policy traits with dependency injection (#7987)
Co-authored-by: Jin Pan <jpan236@wisc.edu>
Co-authored-by: Keru Yang <rukeyang@gmail.com>
Co-authored-by: Yingyi Huang <yingyihuang2000@outlook.com>
Co-authored-by: Philip Zhu <phlipzhux@gmail.com>
2025-07-18 14:24:24 -07:00
Shangming Cai
2ff572e28c [CI][Router] Fix bench_one_batch_server for pd router test (#7731)
Signed-off-by: Shangming Cai <caishangming@linux.alibaba.com>
2025-07-02 23:18:24 -07:00
Keyang Ru
29bd4c8135 [CI] Add CI Testing for Prefill-Decode Disaggregation with Router (#7540) 2025-06-27 00:18:56 -07:00