Lianmin Zheng
|
3e95aa1a09
|
Remove pull_request trigger from CI monitor workflow (#10932)
|
2025-09-25 19:40:38 -07:00 |
|
Xiaoyu Zhang
|
c4197e99bb
|
[ci] add ci-monitor workflow (#10898)
|
2025-09-25 19:29:47 -07:00 |
|
ishandhanani
|
adba172fd1
|
ci: free space on workers for build (#10786)
Co-authored-by: zhyncs <me@zhyncs.com>
|
2025-09-24 02:58:22 -07:00 |
|
Lianmin Zheng
|
b1f0fc1c0b
|
Add CI timeout guidelines (#10829)
|
2025-09-23 22:08:02 -07:00 |
|
Shangming Cai
|
23632d350c
|
Fix latest main ci (#10799)
Signed-off-by: Shangming Cai <csmthu@gmail.com>
|
2025-09-23 12:46:13 -07:00 |
|
Simo Lin
|
ddab4fc7c7
|
[router] fix cache aware routing strategy and lock contention (#10773)
|
2025-09-23 08:53:49 -07:00 |
|
ishandhanani
|
b06db198ba
|
followup: clean up dockerfiles and release yamls (#10783)
|
2025-09-23 00:19:46 -07:00 |
|
ishandhanani
|
1c82d9db28
|
feat: unify dockerfiles (#10705)
Co-authored-by: Yineng Zhang <me@zhyncs.com>
|
2025-09-22 23:23:48 -07:00 |
|
Simo Lin
|
c3a1d7759f
|
[router] remove pd router draining channel (#10767)
|
2025-09-22 20:49:33 -07:00 |
|
Simo Lin
|
7ca1bea63d
|
[router] update ci so only execute benchmarks when labels are added (#10757)
|
2025-09-22 13:23:07 -07:00 |
|
sglang-bot
|
fc3e542009
|
Update release-docs.yml (#10706)
|
2025-09-21 00:22:21 -07:00 |
|
Yineng Zhang
|
ba94b82986
|
fix: update run_suite (#10685)
|
2025-09-20 01:22:06 -07:00 |
|
Shangming Cai
|
74cd6e3902
|
chore: upgrade mooncake 0.3.6.post1 to fix gb200 dockerfile (#10681)
Signed-off-by: Shangming Cai <csmthu@gmail.com>
|
2025-09-20 00:12:26 -07:00 |
|
Yineng Zhang
|
6f993e8b9e
|
chore: cleanup docker image (#10671)
|
2025-09-19 16:56:49 -07:00 |
|
Shangming Cai
|
60fc5b51f6
|
chore: upgrade mooncake 0.3.6 (#10596)
Signed-off-by: Shangming Cai <csmthu@gmail.com>
|
2025-09-18 00:19:30 -07:00 |
|
kyleliang-nv
|
e1d45bc280
|
Fix decord dependency for aarch64 docker build (#10529)
|
2025-09-16 17:34:37 -07:00 |
|
fzyzcjy
|
ae4be601c2
|
Fix CI when sgl-kernel is changed but srt is not changed (#10515)
|
2025-09-16 02:49:54 -07:00 |
|
Yineng Zhang
|
5207424014
|
chore: bump v0.3.10 sgl-kernel (#10478)
|
2025-09-15 15:20:09 -07:00 |
|
Sahithi Chigurupati
|
79acec4fe7
|
[CI] Fix runner for sgl-kernel (#9887)
Signed-off-by: Sahithi Chigurupati <chigurupati.sahithi@gmail.com>
|
2025-09-15 10:55:48 -07:00 |
|
Yineng Zhang
|
5afd036533
|
feat: support pip install sglang (#10465)
|
2025-09-15 03:09:17 -07:00 |
|
Lianmin Zheng
|
50dc0c1e9c
|
Run tests based on labels (#10456)
|
2025-09-15 00:29:20 -07:00 |
|
Lianmin Zheng
|
f73aae0bfc
|
Update GITHUB_TOKEN secret for documentation push (#10458)
|
2025-09-14 21:59:13 -07:00 |
|
Lianmin Zheng
|
b354e3c90d
|
[CI] Fix token key in label-pr.yml workflow (#10452)
|
2025-09-14 20:45:53 -07:00 |
|
Lianmin Zheng
|
65e6f48ce4
|
Update permissions in label-pr.yml (#10450)
|
2025-09-14 20:41:43 -07:00 |
|
Lianmin Zheng
|
0ec580a86c
|
Fix label PR (#10445)
|
2025-09-14 20:33:09 -07:00 |
|
Lianmin Zheng
|
8f6a175803
|
Fix label pr for ci (#10441)
|
2025-09-14 19:48:06 -07:00 |
|
Lianmin Zheng
|
b7d385e812
|
automatically label pr for ci (#10435)
|
2025-09-14 19:13:11 -07:00 |
|
Jintao Zhang
|
f9ee6ae17a
|
[router]: Add Embedding routing logic (#10129)
Signed-off-by: Jintao Zhang <zhangjintao9020@gmail.com>
Co-authored-by: Waël Boukhobza <wawa_wael@live.fr>
|
2025-09-14 18:44:35 -07:00 |
|
Yineng Zhang
|
7ce6c10eb6
|
fix: enable cu124 and cu128 build on main push (#10431)
|
2025-09-14 16:19:35 -07:00 |
|
fzyzcjy
|
e3cf812f7d
|
Fix sgl-kernel + srt CI (#10419)
|
2025-09-14 01:44:47 -07:00 |
|
fzyzcjy
|
a0f844ed5a
|
Let sgl-kernel changes be tested on srt (#10313)
|
2025-09-14 01:09:17 -07:00 |
|
Even Zhou
|
16cd550c85
|
Support Qwen3-Next on Ascend NPU (#10379)
|
2025-09-12 16:31:37 -07:00 |
|
Yineng Zhang
|
9d775b1a2d
|
feat: add deepseek v3 fp4 ut (#10391)
|
2025-09-12 15:43:29 -07:00 |
|
Simo Lin
|
07bcad7fb7
|
[bug] fix router ci syntax error (#10390)
|
2025-09-12 14:39:15 -07:00 |
|
Simo Lin
|
8c86595c93
|
[router] enable sccache in ci and local build (#10099)
|
2025-09-12 09:43:48 -07:00 |
|
Yineng Zhang
|
b3839a7f99
|
fix: resolve transfer_kv_all_layer_direct_lf_pf import error (#10360)
|
2025-09-11 23:53:23 -07:00 |
|
Keyang Ru
|
7b141f816c
|
[router][ci] Add gpu utilization analyze with nvml (#10345)
|
2025-09-11 19:26:02 -07:00 |
|
Yineng Zhang
|
b0d25e72c4
|
chore: bump v0.5.2 (#10221)
|
2025-09-11 16:09:20 -07:00 |
|
Keyang Ru
|
1ee11df8ac
|
[router][ci] add gpu process check and free port before start server (#10338)
|
2025-09-11 14:24:16 -07:00 |
|
Keyang Ru
|
480d1b8b20
|
[router] add benchmark for regular router and pd router (#10280)
|
2025-09-11 12:04:11 -07:00 |
|
Yineng Zhang
|
bfe01a5eef
|
chore: upgrade v0.3.9.post2 sgl-kernel (#10297)
|
2025-09-11 04:10:29 -07:00 |
|
Hank Han
|
3dd6420a4d
|
[CI] add pyproject.toml to deepseek w4a8 ci (#10314)
|
2025-09-11 02:10:50 -07:00 |
|
BourneSun0527
|
4aa1e69bc7
|
[chore]Add sgl-router to npu images (#10229)
|
2025-09-10 23:51:16 -07:00 |
|
Even Zhou
|
5b64f006ec
|
[Feature] Support DeepEP normal & Redundant Experts on NPU (#9881)
|
2025-09-10 20:35:26 -07:00 |
|
Hubert Lu
|
91b3555d2d
|
Add tests to AMD CI for MI35x (#9662)
Co-authored-by: Sai Enduri <saimanas.enduri@amd.com>
|
2025-09-10 12:50:05 -07:00 |
|
Yineng Zhang
|
f3817cb0b2
|
chore: bump v0.3.9 sgl-kernel (#10208)
|
2025-09-09 01:40:05 -07:00 |
|
Yineng Zhang
|
cdc56ef6c1
|
feat: use sgl-kernel cu129 as default (#10188)
|
2025-09-08 22:01:17 -07:00 |
|
Even Zhou
|
b67c277f86
|
[Bugfix] Qwen3MoE aclrtMemcpy failed with NPUGraph (#10013)
|
2025-09-07 21:50:49 -07:00 |
|
Cao E
|
7577f0e40f
|
Add graph runner support with torch compile on CPU (#7843)
|
2025-09-07 21:33:58 -07:00 |
|
Keyang Ru
|
9eb50ecc9c
|
[router] Improve the router e2e tests (#10102)
|
2025-09-06 16:19:28 -07:00 |
|