Commit Graph

485 Commits

Author SHA1 Message Date
ishandhanani
b06db198ba followup: clean up dockerfiles and release yamls (#10783) 2025-09-23 00:19:46 -07:00
ishandhanani
1c82d9db28 feat: unify dockerfiles (#10705)
Co-authored-by: Yineng Zhang <me@zhyncs.com>
2025-09-22 23:23:48 -07:00
Simo Lin
c3a1d7759f [router] remove pd router draining channel (#10767) 2025-09-22 20:49:33 -07:00
Simo Lin
7ca1bea63d [router] update ci so only execute benchmarks when labels are added (#10757) 2025-09-22 13:23:07 -07:00
sglang-bot
fc3e542009 Update release-docs.yml (#10706) 2025-09-21 00:22:21 -07:00
Yineng Zhang
ba94b82986 fix: update run_suite (#10685) 2025-09-20 01:22:06 -07:00
Shangming Cai
74cd6e3902 chore: upgrade mooncake 0.3.6.post1 to fix gb200 dockerfile (#10681)
Signed-off-by: Shangming Cai <csmthu@gmail.com>
2025-09-20 00:12:26 -07:00
Yineng Zhang
6f993e8b9e chore: cleanup docker image (#10671) 2025-09-19 16:56:49 -07:00
Shangming Cai
60fc5b51f6 chore: upgrade mooncake 0.3.6 (#10596)
Signed-off-by: Shangming Cai <csmthu@gmail.com>
2025-09-18 00:19:30 -07:00
kyleliang-nv
e1d45bc280 Fix decord dependency for aarch64 docker build (#10529) 2025-09-16 17:34:37 -07:00
fzyzcjy
ae4be601c2 Fix CI when sgl-kernel is changed but srt is not changed (#10515) 2025-09-16 02:49:54 -07:00
Yineng Zhang
5207424014 chore: bump v0.3.10 sgl-kernel (#10478) 2025-09-15 15:20:09 -07:00
Sahithi Chigurupati
79acec4fe7 [CI] Fix runner for sgl-kernel (#9887)
Signed-off-by: Sahithi Chigurupati <chigurupati.sahithi@gmail.com>
2025-09-15 10:55:48 -07:00
Yineng Zhang
5afd036533 feat: support pip install sglang (#10465) 2025-09-15 03:09:17 -07:00
Lianmin Zheng
50dc0c1e9c Run tests based on labels (#10456) 2025-09-15 00:29:20 -07:00
Lianmin Zheng
f73aae0bfc Update GITHUB_TOKEN secret for documentation push (#10458) 2025-09-14 21:59:13 -07:00
Lianmin Zheng
b354e3c90d [CI] Fix token key in label-pr.yml workflow (#10452) 2025-09-14 20:45:53 -07:00
Lianmin Zheng
65e6f48ce4 Update permissions in label-pr.yml (#10450) 2025-09-14 20:41:43 -07:00
Lianmin Zheng
0ec580a86c Fix label PR (#10445) 2025-09-14 20:33:09 -07:00
Lianmin Zheng
8f6a175803 Fix label pr for ci (#10441) 2025-09-14 19:48:06 -07:00
Lianmin Zheng
b7d385e812 automatically label pr for ci (#10435) 2025-09-14 19:13:11 -07:00
Jintao Zhang
f9ee6ae17a [router]: Add Embedding routing logic (#10129)
Signed-off-by: Jintao Zhang <zhangjintao9020@gmail.com>
Co-authored-by: Waël Boukhobza <wawa_wael@live.fr>
2025-09-14 18:44:35 -07:00
Yineng Zhang
7ce6c10eb6 fix: enable cu124 and cu128 build on main push (#10431) 2025-09-14 16:19:35 -07:00
fzyzcjy
e3cf812f7d Fix sgl-kernel + srt CI (#10419) 2025-09-14 01:44:47 -07:00
fzyzcjy
a0f844ed5a Let sgl-kernel changes be tested on srt (#10313) 2025-09-14 01:09:17 -07:00
Even Zhou
16cd550c85 Support Qwen3-Next on Ascend NPU (#10379) 2025-09-12 16:31:37 -07:00
Yineng Zhang
9d775b1a2d feat: add deepseek v3 fp4 ut (#10391) 2025-09-12 15:43:29 -07:00
Simo Lin
07bcad7fb7 [bug] fix router ci syntax error (#10390) 2025-09-12 14:39:15 -07:00
Simo Lin
8c86595c93 [router] enable sccache in ci and local build (#10099) 2025-09-12 09:43:48 -07:00
Yineng Zhang
b3839a7f99 fix: resolve transfer_kv_all_layer_direct_lf_pf import error (#10360) 2025-09-11 23:53:23 -07:00
Keyang Ru
7b141f816c [router][ci] Add gpu utilization analyze with nvml (#10345) 2025-09-11 19:26:02 -07:00
Yineng Zhang
b0d25e72c4 chore: bump v0.5.2 (#10221) 2025-09-11 16:09:20 -07:00
Keyang Ru
1ee11df8ac [router][ci] add gpu process check and free port before start server (#10338) 2025-09-11 14:24:16 -07:00
Keyang Ru
480d1b8b20 [router] add benchmark for regular router and pd router (#10280) 2025-09-11 12:04:11 -07:00
Yineng Zhang
bfe01a5eef chore: upgrade v0.3.9.post2 sgl-kernel (#10297) 2025-09-11 04:10:29 -07:00
Hank Han
3dd6420a4d [CI] add pyproject.toml to deepseek w4a8 ci (#10314) 2025-09-11 02:10:50 -07:00
BourneSun0527
4aa1e69bc7 [chore]Add sgl-router to npu images (#10229) 2025-09-10 23:51:16 -07:00
Even Zhou
5b64f006ec [Feature] Support DeepEP normal & Redundant Experts on NPU (#9881) 2025-09-10 20:35:26 -07:00
Hubert Lu
91b3555d2d Add tests to AMD CI for MI35x (#9662)
Co-authored-by: Sai Enduri <saimanas.enduri@amd.com>
2025-09-10 12:50:05 -07:00
Yineng Zhang
f3817cb0b2 chore: bump v0.3.9 sgl-kernel (#10208) 2025-09-09 01:40:05 -07:00
Yineng Zhang
cdc56ef6c1 feat: use sgl-kernel cu129 as default (#10188) 2025-09-08 22:01:17 -07:00
Even Zhou
b67c277f86 [Bugfix] Qwen3MoE aclrtMemcpy failed with NPUGraph (#10013) 2025-09-07 21:50:49 -07:00
Cao E
7577f0e40f Add graph runner support with torch compile on CPU (#7843) 2025-09-07 21:33:58 -07:00
Keyang Ru
9eb50ecc9c [router] Improve the router e2e tests (#10102) 2025-09-06 16:19:28 -07:00
Keyang Ru
b3e7a2cee4 increase the rust e2e timeout (#10116) 2025-09-06 16:17:34 -07:00
hzh0425
1a3d6f31da Modify ci workflow for auto-partitioning in 2-GPU backend tests (#10029) 2025-09-06 10:28:42 +08:00
Keyang Ru
21b9a4b435 [router] Introduce router integration tests (#10086) 2025-09-05 18:52:53 -07:00
Simo Lin
bde73ee43f [router] add rust cache in benchmark ci (#10080) 2025-09-05 09:59:36 -07:00
Keyang Ru
4f0e28d7fc [router] add rust cache for rust unit test (#10079) 2025-09-05 09:58:59 -07:00
Keyang Ru
045ab92dc0 [router] add py binding unit tests to coverage 80% (#10043) 2025-09-05 08:40:21 -07:00