Lianmin Zheng
|
263c9236a0
|
Always trigger pr-test (#8527)
|
2025-07-29 04:05:19 -07:00 |
|
Lianmin Zheng
|
69712e6f55
|
Rename the last step in pr-test.yml as pr-test-finish (#8486)
|
2025-07-28 19:06:13 -07:00 |
|
Keyang Ru
|
7c9697178e
|
[CI]Add genai-bench Performance Validation for PD Router (#8477)
Co-authored-by: key4ng <rukeyang@gamil.com>
|
2025-07-28 16:58:23 -07:00 |
|
kyleliang-nv
|
5922c0cbf6
|
Remove zstd compression for building Dockerfile.gb200 (#8442)
|
2025-07-27 22:58:53 -07:00 |
|
Stefan He
|
4ad9737045
|
chore: bump transformer to 4.54.0 (#8416)
Co-authored-by: Binyao Jiang <byjiang1996@gmail.com>
Co-authored-by: Lifu Huang <lifu.hlf@gmail.com>
|
2025-07-27 21:27:25 -07:00 |
|
kyleliang-nv
|
bb81daefb8
|
Fix docker buildx push error (#8425)
|
2025-07-27 17:59:38 -07:00 |
|
kyleliang-nv
|
95217a9b4d
|
Change to use native arm runner (#8414)
|
2025-07-27 12:48:12 -07:00 |
|
kyleliang-nv
|
62a6b7c773
|
Add docker release flow for gb200 (#8394)
|
2025-07-26 21:25:07 -07:00 |
|
Lifu Huang
|
5c705b1dce
|
Add perf tests for LoRA (#8314)
|
2025-07-26 14:55:22 -07:00 |
|
Shangming Cai
|
70e37b97bf
|
chore: upgrade mooncake 0.3.5 (#8341)
Signed-off-by: Shangming Cai <caishangming@linux.alibaba.com>
|
2025-07-25 01:17:26 -07:00 |
|
ronnie_zheng
|
93d124ef5a
|
[feature] enable NPU CI (#7935)
Co-authored-by: Even Zhou <14368888+iforgetmyname@users.noreply.github.com>
|
2025-07-20 13:12:42 -07:00 |
|
Lianmin Zheng
|
9c7a46180c
|
[Doc] Steps to add a new attention backend (#8155)
|
2025-07-18 16:38:26 -07:00 |
|
Simo Lin
|
c8f31042a8
|
[router] Refactor router and policy traits with dependency injection (#7987)
Co-authored-by: Jin Pan <jpan236@wisc.edu>
Co-authored-by: Keru Yang <rukeyang@gmail.com>
Co-authored-by: Yingyi Huang <yingyihuang2000@outlook.com>
Co-authored-by: Philip Zhu <phlipzhux@gmail.com>
|
2025-07-18 14:24:24 -07:00 |
|
Cheng Wan
|
02404a1e35
|
[ci] recover 8-gpu deepep test (#8105)
|
2025-07-17 00:46:40 -07:00 |
|
Simo Lin
|
8a7a7770e5
|
[ci] limit cmake build nproc (#8100)
|
2025-07-16 18:09:28 -07:00 |
|
Sai Enduri
|
f06bd210c0
|
Update amd docker image. (#8045)
Co-authored-by: Hubert Lu <55214931+hubertlu-tw@users.noreply.github.com>
|
2025-07-15 15:09:56 -07:00 |
|
Sai Enduri
|
5dc5866e8e
|
Setup workflow for releasing mi300x and mi350x dockers. (#8035)
|
2025-07-14 21:51:43 -07:00 |
|
Cheng Wan
|
475a249bb8
|
temporarily disable deepep-8-gpu and activate two small tests (#7961)
|
2025-07-11 14:22:05 -07:00 |
|
Cheng Wan
|
d487555f84
|
[CI] Add deepep tests to CI (#7872)
|
2025-07-09 01:49:47 -07:00 |
|
Yineng Zhang
|
625018d259
|
fix: free disk space (#7803)
|
2025-07-05 18:52:25 -07:00 |
|
Shangming Cai
|
2ff572e28c
|
[CI][Router] Fix bench_one_batch_server for pd router test (#7731)
Signed-off-by: Shangming Cai <caishangming@linux.alibaba.com>
|
2025-07-02 23:18:24 -07:00 |
|
Hubert Lu
|
3b3f1e3aeb
|
[AMD] Add unit-test-sgl-kernel-amd to AMD CI (#7539)
|
2025-06-29 15:50:09 -07:00 |
|
Simo Lin
|
7c0db3a6c5
|
[bugfix] Remove PR comment posting from Rust benchmark workflow (#7625)
|
2025-06-28 22:10:01 -07:00 |
|
Keyang Ru
|
29bd4c8135
|
[CI] Add CI Testing for Prefill-Decode Disaggregation with Router (#7540)
|
2025-06-27 00:18:56 -07:00 |
|
Simo Lin
|
3abc30364d
|
[ci] add router benchmark script and CI (#7498)
|
2025-06-25 01:28:25 -07:00 |
|
Lianmin Zheng
|
55e03b10c4
|
Fix a bug in BatchTokenIDOut & Misc style and dependency updates (#7457)
|
2025-06-23 06:20:39 -07:00 |
|
Yineng Zhang
|
4d8d9b8efd
|
chore: upgrade mooncake-transfer-engine 0.3.4 (#7401)
|
2025-06-20 16:38:54 -07:00 |
|
ybyang
|
906dbc34f1
|
[Docker] optimize dockerfile remove deepep and blackwell merge it to… (#7343)
Co-authored-by: Yineng Zhang <me@zhyncs.com>
|
2025-06-19 17:42:40 -07:00 |
|
DiweiSun
|
8a10c4c3d9
|
update ci node for xeon (#7265)
|
2025-06-16 23:44:08 -07:00 |
|
Sai Enduri
|
62a7aa2efc
|
Update CI flakes. (#7244)
|
2025-06-16 15:19:32 -07:00 |
|
Yineng Zhang
|
7df7c679b6
|
feat: use zstd for docker (#7205)
|
2025-06-14 23:13:29 -07:00 |
|
Yineng Zhang
|
4473320380
|
chore: bump v0.1.8.post2 (#7189)
|
2025-06-14 17:01:48 -07:00 |
|
Arthur Cheng
|
baa6624d7c
|
[CI] Add CI workflow for sgl-router docker build (#7027)
|
2025-06-09 23:16:44 -07:00 |
|
Yineng Zhang
|
1c8b42c84c
|
chore: update pr test xeon (#7018)
|
2025-06-09 17:36:25 -07:00 |
|
Yineng Zhang
|
7059ae16fb
|
chore: update pr test xeon (#7008)
|
2025-06-09 10:08:44 -07:00 |
|
Yineng Zhang
|
56ccd3c22c
|
chore: upgrade flashinfer v0.2.6.post1 jit (#6958)
Co-authored-by: alcanderian <alcanderian@gmail.com>
Co-authored-by: Qiaolin Yu <qy254@cornell.edu>
Co-authored-by: Baizhou Zhang <sobereddiezhang@gmail.com>
Co-authored-by: Mick <mickjagger19@icloud.com>
Co-authored-by: ispobock <ispobaoke@gmail.com>
|
2025-06-09 09:22:39 -07:00 |
|
Sai Enduri
|
2c18642502
|
Enable more unit tests for AMD CI. (#6983)
|
2025-06-08 19:41:55 -07:00 |
|
Yineng Zhang
|
6c0a48282a
|
chore: bump sgl-kernel v0.1.7 (#6963)
|
2025-06-08 02:43:15 -07:00 |
|
Hubert Lu
|
4740288303
|
[AMD] Add more tests to per-commit-amd (#6926)
|
2025-06-08 01:08:37 -07:00 |
|
Sai Enduri
|
77e928d00e
|
Update server timeout time in AMD CI. (#6953)
|
2025-06-07 15:10:27 -07:00 |
|
HAI
|
b819381fec
|
AITER backend extension and workload optimizations (#6838)
Co-authored-by: wunhuang <wunhuang@amd.com>
Co-authored-by: Hubert Lu <Hubert.Lu@amd.com>
|
2025-06-05 23:00:18 -07:00 |
|
Zaili Wang
|
562f279a2d
|
[CPU] enable CI for PRs, add Dockerfile and auto build task (#6458)
Co-authored-by: diwei sun <diwei.sun@intel.com>
Co-authored-by: Yineng Zhang <me@zhyncs.com>
|
2025-06-05 13:43:54 -07:00 |
|
fzyzcjy
|
0166403c20
|
Support Blackwell DeepEP docker images (#6868)
|
2025-06-05 00:07:53 -07:00 |
|
Junrong Lin
|
2103b80607
|
[CI] update verlengine ci to 4-gpu test (#6007)
|
2025-05-27 14:32:23 -07:00 |
|
Yineng Zhang
|
fc419b62e8
|
Revert "Tiny fix lint CI does not trigger on master (#6609)" (#6610)
|
2025-05-25 22:52:34 -07:00 |
|
fzyzcjy
|
84147254c9
|
Tiny fix lint CI does not trigger on master (#6609)
|
2025-05-25 22:47:03 -07:00 |
|
Shenggui Li
|
3f23d8cdf1
|
added support for tied weights in qwen pipeline parallelism (#6546)
|
2025-05-25 00:00:56 -07:00 |
|
kk
|
7a5e6ce1cb
|
Fix GPU OOM (#6564)
Co-authored-by: michael <michael.zhang@amd.com>
|
2025-05-24 16:38:39 -07:00 |
|
Sai Enduri
|
24c035f2e3
|
Temporarily disable MI325x 8 gpu testing. (#6576)
|
2025-05-24 16:37:22 -07:00 |
|
fzyzcjy
|
505eec4dc9
|
Tiny make Lint CI show diff (#6445)
|
2025-05-21 02:06:25 -07:00 |
|