Commit Graph

539 Commits

Author SHA1 Message Date
Lianmin Zheng
b9a54e0968 [minor] sync code on python/sglang/test/test_deterministic.py and improve ci tests (#11777)
Co-authored-by: Stefan He <hebiaobuaa@gmail.com>
Co-authored-by: Byron Hsu <byronhsu1230@gmail.com>
2025-10-17 14:25:22 -07:00
Chang Su
9e87b60f37 [router][CI] Clean up deprecated fields in pr-test-pd-router.yml (#11739) 2025-10-16 19:01:00 -07:00
Chang Su
dc01313da1 [router] Add rustfmt and set group imports by default (#11732) 2025-10-16 17:33:29 -07:00
Hank Han
0dd6cf16ba [ci]use H20 to run disaggregation test (#11543) 2025-10-16 11:42:42 -07:00
Even Zhou
3cceaa381a [Bugfix] Fix Qwen3/DSV3/DSV3.2 model support (#11510) 2025-10-16 15:14:09 +08:00
DiweiSun
4c03dbaaef [CI][XPU]enable sglang CI on Intel XPU (#9493)
Co-authored-by: huaiyuzh <huaiyu.zheng@intel.com>
Co-authored-by: Ma Mingfei <mingfei.ma@intel.com>
Co-authored-by: coderabbitai[bot] <136622811+coderabbitai[bot]@users.noreply.github.com>
2025-10-15 17:13:19 -07:00
sglang-bot
9767a1e41b Update release-docker-dev.yml (#11603) 2025-10-14 03:06:48 -07:00
Sai Enduri
1d08653972 [AMD CI] Add image and weights caching. (#11593) 2025-10-14 02:51:35 -07:00
Johnny
cb8f3d90d3 [NVIDIA] update pyproject.toml to support cu130 option (#11521) 2025-10-13 13:03:31 -07:00
Baizhou Zhang
9f1f699a7a [CI] Add Basic Test for DeepSeek V3.2 (#11308) 2025-10-13 11:41:02 -07:00
Lianmin Zheng
5e3f7e7fa9 Minor: improve sampler & remove unused fields from model_config.py (#11531) 2025-10-13 11:04:44 -07:00
Simo Lin
13d596c93e [router][ci] Add Nightly Release Workflow for SGLang Router (#11527) 2025-10-13 09:28:55 -04:00
Mick
0c0779d667 ci: improve nightly-ci (#11385) 2025-10-12 21:19:34 -07:00
Cheng Wan
6cd296940a [lint] Fix the lint issue (#11516) 2025-10-12 16:22:46 -07:00
Yineng Zhang
0ecb42613d fix: revert temporarily remove b200 tests (#11515) 2025-10-12 15:02:37 -07:00
Lianmin Zheng
5a6ec8f999 Fix unit tests (#11503) 2025-10-12 07:45:57 -07:00
Lianmin Zheng
548a57b1f3 Fix port conflicts in CI (#11497) 2025-10-12 06:46:36 -07:00
Lianmin Zheng
88e73ed048 Temporarily remove b200 tests (#11501) 2025-10-12 06:41:37 -07:00
Sahithi Chigurupati
9fcf73069f [CI] Add nightly builds to dockerhub (#9804)
Signed-off-by: Sahithi Chigurupati <chigurupati.sahithi@gmail.com>
2025-10-11 18:27:46 -07:00
Zaili Wang
0a304870e8 fix Xeon CI (#11454) 2025-10-11 14:08:28 -07:00
Lianmin Zheng
61055cb309 Reorder PD disagg CI tests (#11438) 2025-10-10 17:56:49 -07:00
Keyang Ru
a0835c3a62 [router] Fix ci nvcc not found error (#11411) 2025-10-10 15:43:16 -07:00
Sahithi Chigurupati
af96ca1136 [CI] Merge build-dev into workflow matrix (#11345)
Signed-off-by: Sahithi Chigurupati <chigurupati.sahithi@gmail.com>
2025-10-10 11:13:42 -07:00
Zaili Wang
f19613e6c3 Dedicated toml files for CPU/XPU (#10734) 2025-10-10 00:44:55 -07:00
Shangming Cai
70fbb3adf6 [CI] Refactor PD disaggregation test suite (#11363)
Signed-off-by: Shangming Cai <csmthu@gmail.com>
2025-10-09 18:50:39 -07:00
Lianmin Zheng
9b8ebb2798 move more files under srt/utils (#11285) 2025-10-09 16:46:15 -07:00
sunxxuns
a57f0e3d56 reverse the amd ci test back to 1200s and split the 8-gpu deepseek job into two. (#11238)
Co-authored-by: root <root@smci350-zts-gtu-e17-15.zts-gtu.dcgpu>
2025-10-06 19:27:57 -04:00
ishandhanani
73ea484af1 docker: add manifest to versioned docker releases (#11268) 2025-10-06 14:53:40 -07:00
Lianmin Zheng
eb30b888db Remove env var warnings for release (#11262) 2025-10-06 10:09:17 -07:00
Kangyan-Zhou
8fd41eae93 Improve bot release workflow (#11240) 2025-10-05 21:28:27 -07:00
Lianmin Zheng
b6b4b56395 Update condition for sgl-kernel-benchmark-test (#11254) 2025-10-05 20:55:02 -07:00
Lianmin Zheng
366a603e95 Use cu128 for torch audio to fix some CI tests (#11251) 2025-10-05 19:52:32 -07:00
Kangyan-Zhou
a20fc7b7dc Create two new GH workflows to automatically bump SGLang and Kernel version (#10996) 2025-10-05 18:14:05 -07:00
Lianmin Zheng
d645ae90a3 Rename runner labels (#11228) 2025-10-05 18:05:41 -07:00
sunxxuns
5e142484e2 [Fix AMD CI] VRAM cleanup (#11174)
Co-authored-by: root <root@smci350-zts-gtu-e17-15.zts-gtu.dcgpu>
2025-10-05 19:03:53 -04:00
DiweiSun
307fc060e8 fix xeon ci check (#10838) 2025-10-04 00:17:36 +08:00
Vedant V Jhaveri
7e61737d3f [Generative Scores API] add performance tests to CICD (#10830) 2025-10-02 19:57:55 -07:00
Sai Enduri
195a59fe23 Refactor AMD CI. (#11128) 2025-10-01 01:12:28 -07:00
ishandhanani
47488cc353 docker: x86 dev builds for hopper and blackwell (#11075) 2025-10-01 00:06:38 -07:00
Lianmin Zheng
a17e70f5cc Use more general heuristics to set the default value of --mem-fraction-static (#10975)
Co-authored-by: sglang-bot <sglangbot@gmail.com>
2025-09-29 10:11:03 -07:00
Xiaoyu Zhang
6f16bf9d9d [Ci Monitor] Auto uploaded performance data to sglang_ci_data repo (#10976) 2025-09-29 16:17:27 +08:00
Xiaoyu Zhang
11965b0daf Fix sgl-kernel benchmark dead code (#11022) 2025-09-29 15:06:40 +08:00
Kangyan-Zhou
0c9174108a Unify SGL Kernel Releases (#10701) 2025-09-28 19:48:28 -07:00
Xiaoyu Zhang
2387c22b56 Ci monitor support performance (#10965) 2025-09-27 09:11:21 +08:00
Mick
777eb53897 ci: refactor nightly test (#10495) 2025-09-26 15:24:30 -07:00
Xiaoyu Zhang
05a3526654 Restruct gpu_memory_settings in a unify function and relax max_cuda_graph_bs (#10372)
Co-authored-by: Lianmin Zheng <lianminzheng@gmail.com>
Co-authored-by: sglang-bot <sglangbot@gmail.com>
2025-09-26 15:10:49 -07:00
Mick
fff7fbabe6 ci: fix rate-limit of huggingface with hf auth login (#10947) 2025-09-26 11:02:44 -07:00
Sahithi Chigurupati
c3d2ad4ee6 CI: Fix docker manifest build (#10936) 2025-09-25 23:22:55 -07:00
Lianmin Zheng
3e95aa1a09 Remove pull_request trigger from CI monitor workflow (#10932) 2025-09-25 19:40:38 -07:00
Xiaoyu Zhang
c4197e99bb [ci] add ci-monitor workflow (#10898) 2025-09-25 19:29:47 -07:00