Commit Graph

390 Commits

Author SHA1 Message Date
Yineng Zhang
1ac16add8b chore: support blackwell cu129 image (#8928) 2025-08-07 14:24:57 -07:00
Simo Lin
16a4c66d25 [router] update pd router ci summary step with new threshold (#8916) 2025-08-07 07:15:38 -07:00
Simo Lin
89e6521c61 [router] re-enable pd router benchmark CI (#8912) 2025-08-07 06:29:36 -07:00
fzyzcjy
b114a8105b Support B200 in CI (#8861) 2025-08-06 21:42:44 +08:00
Yineng Zhang
aeac900ca2 fix: resolve ci issue (#8859) 2025-08-06 02:28:14 -07:00
Yineng Zhang
3ae8e3ea8f chore: upgrade torch 2.8.0 (#8836) 2025-08-05 17:32:01 -07:00
kk
32d9e39a29 Fix potential memory fault issue and ncclSystemError in CI test (#8681)
Co-authored-by: wunhuang <wunhuang@amd.com>
2025-08-05 12:19:37 -07:00
Yineng Zhang
194561f27a feat: support sgl-kernel cu129 (#8800) 2025-08-05 02:33:47 -07:00
Even Zhou
fee0ab0fba [CI] Ascend NPU CI enhancement (#8294)
Co-authored-by: ronnie_zheng <zl19940307@163.com>
2025-08-03 22:16:38 -07:00
Liangsheng Yin
7a27e798ca [CI] Do not trigger pd-disaggregation CI in draft PR (#8737) 2025-08-04 05:12:20 +08:00
Yineng Zhang
5ce5093b97 chore: bump sgl-kernel 0.3.0 with torch 2.8.0 (#8718) 2025-08-03 02:31:50 -07:00
li chaoran
fe5086fd8b chore: speedup NPU CI by cache (#8270)
Signed-off-by: mywaaagh_admin <pkwarcraft@gmail.com>
Co-authored-by: ronnie_zheng <zl19940307@163.com>
2025-07-31 17:29:50 -07:00
Simo Lin
aee0ef52f5 [router] update router pypi version (#8628) 2025-07-31 11:24:12 -07:00
Simo Lin
ae807774f5 [ci] fix genai-bench execution cmd (#8629) 2025-07-31 10:40:54 -07:00
yihong
09f1a247ce fix: fork should not run pypi router (#8604)
Signed-off-by: yihong0618 <zouzou0208@gmail.com>
2025-07-31 02:37:13 -07:00
Simo Lin
a9fd80336d [router] allow longer time out for router e2e (#8560) 2025-07-29 23:43:37 -07:00
Lianmin Zheng
263c9236a0 Always trigger pr-test (#8527) 2025-07-29 04:05:19 -07:00
Lianmin Zheng
69712e6f55 Rename the last step in pr-test.yml as pr-test-finish (#8486) 2025-07-28 19:06:13 -07:00
Keyang Ru
7c9697178e [CI]Add genai-bench Performance Validation for PD Router (#8477)
Co-authored-by: key4ng <rukeyang@gamil.com>
2025-07-28 16:58:23 -07:00
kyleliang-nv
5922c0cbf6 Remove zstd compression for building Dockerfile.gb200 (#8442) 2025-07-27 22:58:53 -07:00
Stefan He
4ad9737045 chore: bump transformer to 4.54.0 (#8416)
Co-authored-by: Binyao Jiang <byjiang1996@gmail.com>
Co-authored-by: Lifu Huang <lifu.hlf@gmail.com>
2025-07-27 21:27:25 -07:00
kyleliang-nv
bb81daefb8 Fix docker buildx push error (#8425) 2025-07-27 17:59:38 -07:00
kyleliang-nv
95217a9b4d Change to use native arm runner (#8414) 2025-07-27 12:48:12 -07:00
kyleliang-nv
62a6b7c773 Add docker release flow for gb200 (#8394) 2025-07-26 21:25:07 -07:00
Lifu Huang
5c705b1dce Add perf tests for LoRA (#8314) 2025-07-26 14:55:22 -07:00
Shangming Cai
70e37b97bf chore: upgrade mooncake 0.3.5 (#8341)
Signed-off-by: Shangming Cai <caishangming@linux.alibaba.com>
2025-07-25 01:17:26 -07:00
ronnie_zheng
93d124ef5a [feature] enable NPU CI (#7935)
Co-authored-by: Even Zhou <14368888+iforgetmyname@users.noreply.github.com>
2025-07-20 13:12:42 -07:00
Lianmin Zheng
9c7a46180c [Doc] Steps to add a new attention backend (#8155) 2025-07-18 16:38:26 -07:00
Simo Lin
c8f31042a8 [router] Refactor router and policy traits with dependency injection (#7987)
Co-authored-by: Jin Pan <jpan236@wisc.edu>
Co-authored-by: Keru Yang <rukeyang@gmail.com>
Co-authored-by: Yingyi Huang <yingyihuang2000@outlook.com>
Co-authored-by: Philip Zhu <phlipzhux@gmail.com>
2025-07-18 14:24:24 -07:00
Cheng Wan
02404a1e35 [ci] recover 8-gpu deepep test (#8105) 2025-07-17 00:46:40 -07:00
Simo Lin
8a7a7770e5 [ci] limit cmake build nproc (#8100) 2025-07-16 18:09:28 -07:00
Sai Enduri
f06bd210c0 Update amd docker image. (#8045)
Co-authored-by: Hubert Lu <55214931+hubertlu-tw@users.noreply.github.com>
2025-07-15 15:09:56 -07:00
Sai Enduri
5dc5866e8e Setup workflow for releasing mi300x and mi350x dockers. (#8035) 2025-07-14 21:51:43 -07:00
Cheng Wan
475a249bb8 temporarily disable deepep-8-gpu and activate two small tests (#7961) 2025-07-11 14:22:05 -07:00
Cheng Wan
d487555f84 [CI] Add deepep tests to CI (#7872) 2025-07-09 01:49:47 -07:00
Yineng Zhang
625018d259 fix: free disk space (#7803) 2025-07-05 18:52:25 -07:00
Shangming Cai
2ff572e28c [CI][Router] Fix bench_one_batch_server for pd router test (#7731)
Signed-off-by: Shangming Cai <caishangming@linux.alibaba.com>
2025-07-02 23:18:24 -07:00
Hubert Lu
3b3f1e3aeb [AMD] Add unit-test-sgl-kernel-amd to AMD CI (#7539) 2025-06-29 15:50:09 -07:00
Simo Lin
7c0db3a6c5 [bugfix] Remove PR comment posting from Rust benchmark workflow (#7625) 2025-06-28 22:10:01 -07:00
Keyang Ru
29bd4c8135 [CI] Add CI Testing for Prefill-Decode Disaggregation with Router (#7540) 2025-06-27 00:18:56 -07:00
Simo Lin
3abc30364d [ci] add router benchmark script and CI (#7498) 2025-06-25 01:28:25 -07:00
Lianmin Zheng
55e03b10c4 Fix a bug in BatchTokenIDOut & Misc style and dependency updates (#7457) 2025-06-23 06:20:39 -07:00
Yineng Zhang
4d8d9b8efd chore: upgrade mooncake-transfer-engine 0.3.4 (#7401) 2025-06-20 16:38:54 -07:00
ybyang
906dbc34f1 [Docker] optimize dockerfile remove deepep and blackwell merge it to… (#7343)
Co-authored-by: Yineng Zhang <me@zhyncs.com>
2025-06-19 17:42:40 -07:00
DiweiSun
8a10c4c3d9 update ci node for xeon (#7265) 2025-06-16 23:44:08 -07:00
Sai Enduri
62a7aa2efc Update CI flakes. (#7244) 2025-06-16 15:19:32 -07:00
Yineng Zhang
7df7c679b6 feat: use zstd for docker (#7205) 2025-06-14 23:13:29 -07:00
Yineng Zhang
4473320380 chore: bump v0.1.8.post2 (#7189) 2025-06-14 17:01:48 -07:00
Arthur Cheng
baa6624d7c [CI] Add CI workflow for sgl-router docker build (#7027) 2025-06-09 23:16:44 -07:00
Yineng Zhang
1c8b42c84c chore: update pr test xeon (#7018) 2025-06-09 17:36:25 -07:00