Commit Graph

515 Commits

Author SHA1 Message Date
Shangming Cai
70fbb3adf6 [CI] Refactor PD disaggregation test suite (#11363)
Signed-off-by: Shangming Cai <csmthu@gmail.com>
2025-10-09 18:50:39 -07:00
Lianmin Zheng
9b8ebb2798 move more files under srt/utils (#11285) 2025-10-09 16:46:15 -07:00
sunxxuns
a57f0e3d56 reverse the amd ci test back to 1200s and split the 8-gpu deepseek job into two. (#11238)
Co-authored-by: root <root@smci350-zts-gtu-e17-15.zts-gtu.dcgpu>
2025-10-06 19:27:57 -04:00
ishandhanani
73ea484af1 docker: add manifest to versioned docker releases (#11268) 2025-10-06 14:53:40 -07:00
Lianmin Zheng
eb30b888db Remove env var warnings for release (#11262) 2025-10-06 10:09:17 -07:00
Kangyan-Zhou
8fd41eae93 Improve bot release workflow (#11240) 2025-10-05 21:28:27 -07:00
Lianmin Zheng
b6b4b56395 Update condition for sgl-kernel-benchmark-test (#11254) 2025-10-05 20:55:02 -07:00
Lianmin Zheng
366a603e95 Use cu128 for torch audio to fix some CI tests (#11251) 2025-10-05 19:52:32 -07:00
Kangyan-Zhou
a20fc7b7dc Create two new GH workflows to automatically bump SGLang and Kernel version (#10996) 2025-10-05 18:14:05 -07:00
Lianmin Zheng
d645ae90a3 Rename runner labels (#11228) 2025-10-05 18:05:41 -07:00
sunxxuns
5e142484e2 [Fix AMD CI] VRAM cleanup (#11174)
Co-authored-by: root <root@smci350-zts-gtu-e17-15.zts-gtu.dcgpu>
2025-10-05 19:03:53 -04:00
DiweiSun
307fc060e8 fix xeon ci check (#10838) 2025-10-04 00:17:36 +08:00
Vedant V Jhaveri
7e61737d3f [Generative Scores API] add performance tests to CICD (#10830) 2025-10-02 19:57:55 -07:00
Sai Enduri
195a59fe23 Refactor AMD CI. (#11128) 2025-10-01 01:12:28 -07:00
ishandhanani
47488cc353 docker: x86 dev builds for hopper and blackwell (#11075) 2025-10-01 00:06:38 -07:00
Lianmin Zheng
a17e70f5cc Use more general heuristics to set the default value of --mem-fraction-static (#10975)
Co-authored-by: sglang-bot <sglangbot@gmail.com>
2025-09-29 10:11:03 -07:00
Xiaoyu Zhang
6f16bf9d9d [Ci Monitor] Auto uploaded performance data to sglang_ci_data repo (#10976) 2025-09-29 16:17:27 +08:00
Xiaoyu Zhang
11965b0daf Fix sgl-kernel benchmark dead code (#11022) 2025-09-29 15:06:40 +08:00
Kangyan-Zhou
0c9174108a Unify SGL Kernel Releases (#10701) 2025-09-28 19:48:28 -07:00
Xiaoyu Zhang
2387c22b56 Ci monitor support performance (#10965) 2025-09-27 09:11:21 +08:00
Mick
777eb53897 ci: refactor nightly test (#10495) 2025-09-26 15:24:30 -07:00
Xiaoyu Zhang
05a3526654 Restruct gpu_memory_settings in a unify function and relax max_cuda_graph_bs (#10372)
Co-authored-by: Lianmin Zheng <lianminzheng@gmail.com>
Co-authored-by: sglang-bot <sglangbot@gmail.com>
2025-09-26 15:10:49 -07:00
Mick
fff7fbabe6 ci: fix rate-limit of huggingface with hf auth login (#10947) 2025-09-26 11:02:44 -07:00
Sahithi Chigurupati
c3d2ad4ee6 CI: Fix docker manifest build (#10936) 2025-09-25 23:22:55 -07:00
Lianmin Zheng
3e95aa1a09 Remove pull_request trigger from CI monitor workflow (#10932) 2025-09-25 19:40:38 -07:00
Xiaoyu Zhang
c4197e99bb [ci] add ci-monitor workflow (#10898) 2025-09-25 19:29:47 -07:00
ishandhanani
adba172fd1 ci: free space on workers for build (#10786)
Co-authored-by: zhyncs <me@zhyncs.com>
2025-09-24 02:58:22 -07:00
Lianmin Zheng
b1f0fc1c0b Add CI timeout guidelines (#10829) 2025-09-23 22:08:02 -07:00
Shangming Cai
23632d350c Fix latest main ci (#10799)
Signed-off-by: Shangming Cai <csmthu@gmail.com>
2025-09-23 12:46:13 -07:00
Simo Lin
ddab4fc7c7 [router] fix cache aware routing strategy and lock contention (#10773) 2025-09-23 08:53:49 -07:00
ishandhanani
b06db198ba followup: clean up dockerfiles and release yamls (#10783) 2025-09-23 00:19:46 -07:00
ishandhanani
1c82d9db28 feat: unify dockerfiles (#10705)
Co-authored-by: Yineng Zhang <me@zhyncs.com>
2025-09-22 23:23:48 -07:00
Simo Lin
c3a1d7759f [router] remove pd router draining channel (#10767) 2025-09-22 20:49:33 -07:00
Simo Lin
7ca1bea63d [router] update ci so only execute benchmarks when labels are added (#10757) 2025-09-22 13:23:07 -07:00
sglang-bot
fc3e542009 Update release-docs.yml (#10706) 2025-09-21 00:22:21 -07:00
Yineng Zhang
ba94b82986 fix: update run_suite (#10685) 2025-09-20 01:22:06 -07:00
Shangming Cai
74cd6e3902 chore: upgrade mooncake 0.3.6.post1 to fix gb200 dockerfile (#10681)
Signed-off-by: Shangming Cai <csmthu@gmail.com>
2025-09-20 00:12:26 -07:00
Yineng Zhang
6f993e8b9e chore: cleanup docker image (#10671) 2025-09-19 16:56:49 -07:00
Shangming Cai
60fc5b51f6 chore: upgrade mooncake 0.3.6 (#10596)
Signed-off-by: Shangming Cai <csmthu@gmail.com>
2025-09-18 00:19:30 -07:00
kyleliang-nv
e1d45bc280 Fix decord dependency for aarch64 docker build (#10529) 2025-09-16 17:34:37 -07:00
fzyzcjy
ae4be601c2 Fix CI when sgl-kernel is changed but srt is not changed (#10515) 2025-09-16 02:49:54 -07:00
Yineng Zhang
5207424014 chore: bump v0.3.10 sgl-kernel (#10478) 2025-09-15 15:20:09 -07:00
Sahithi Chigurupati
79acec4fe7 [CI] Fix runner for sgl-kernel (#9887)
Signed-off-by: Sahithi Chigurupati <chigurupati.sahithi@gmail.com>
2025-09-15 10:55:48 -07:00
Yineng Zhang
5afd036533 feat: support pip install sglang (#10465) 2025-09-15 03:09:17 -07:00
Lianmin Zheng
50dc0c1e9c Run tests based on labels (#10456) 2025-09-15 00:29:20 -07:00
Lianmin Zheng
f73aae0bfc Update GITHUB_TOKEN secret for documentation push (#10458) 2025-09-14 21:59:13 -07:00
Lianmin Zheng
b354e3c90d [CI] Fix token key in label-pr.yml workflow (#10452) 2025-09-14 20:45:53 -07:00
Lianmin Zheng
65e6f48ce4 Update permissions in label-pr.yml (#10450) 2025-09-14 20:41:43 -07:00
Lianmin Zheng
0ec580a86c Fix label PR (#10445) 2025-09-14 20:33:09 -07:00
Lianmin Zheng
8f6a175803 Fix label pr for ci (#10441) 2025-09-14 19:48:06 -07:00