Kangyan-Zhou
|
2c057fbfa8
|
Update Github action title for kernel build (#12029)
|
2025-10-23 13:39:40 -07:00 |
|
Johnny
|
e7aa4664b3
|
[NVIDIA] Build CUDA 13 (#11299)
Co-authored-by: ishandhanani <ishandhanani@gmail.com>
Co-authored-by: Baizhou Zhang <sobereddiezhang@gmail.com>
|
2025-10-22 20:03:12 -07:00 |
|
Lianmin Zheng
|
b9a54e0968
|
[minor] sync code on python/sglang/test/test_deterministic.py and improve ci tests (#11777)
Co-authored-by: Stefan He <hebiaobuaa@gmail.com>
Co-authored-by: Byron Hsu <byronhsu1230@gmail.com>
|
2025-10-17 14:25:22 -07:00 |
|
Hank Han
|
0dd6cf16ba
|
[ci]use H20 to run disaggregation test (#11543)
|
2025-10-16 11:42:42 -07:00 |
|
Baizhou Zhang
|
9f1f699a7a
|
[CI] Add Basic Test for DeepSeek V3.2 (#11308)
|
2025-10-13 11:41:02 -07:00 |
|
Cheng Wan
|
6cd296940a
|
[lint] Fix the lint issue (#11516)
|
2025-10-12 16:22:46 -07:00 |
|
Yineng Zhang
|
0ecb42613d
|
fix: revert temporarily remove b200 tests (#11515)
|
2025-10-12 15:02:37 -07:00 |
|
Lianmin Zheng
|
5a6ec8f999
|
Fix unit tests (#11503)
|
2025-10-12 07:45:57 -07:00 |
|
Lianmin Zheng
|
548a57b1f3
|
Fix port conflicts in CI (#11497)
|
2025-10-12 06:46:36 -07:00 |
|
Lianmin Zheng
|
88e73ed048
|
Temporarily remove b200 tests (#11501)
|
2025-10-12 06:41:37 -07:00 |
|
Lianmin Zheng
|
61055cb309
|
Reorder PD disagg CI tests (#11438)
|
2025-10-10 17:56:49 -07:00 |
|
Shangming Cai
|
70fbb3adf6
|
[CI] Refactor PD disaggregation test suite (#11363)
Signed-off-by: Shangming Cai <csmthu@gmail.com>
|
2025-10-09 18:50:39 -07:00 |
|
Lianmin Zheng
|
9b8ebb2798
|
move more files under srt/utils (#11285)
|
2025-10-09 16:46:15 -07:00 |
|
Lianmin Zheng
|
b6b4b56395
|
Update condition for sgl-kernel-benchmark-test (#11254)
|
2025-10-05 20:55:02 -07:00 |
|
Lianmin Zheng
|
d645ae90a3
|
Rename runner labels (#11228)
|
2025-10-05 18:05:41 -07:00 |
|
Vedant V Jhaveri
|
7e61737d3f
|
[Generative Scores API] add performance tests to CICD (#10830)
|
2025-10-02 19:57:55 -07:00 |
|
Lianmin Zheng
|
a17e70f5cc
|
Use more general heuristics to set the default value of --mem-fraction-static (#10975)
Co-authored-by: sglang-bot <sglangbot@gmail.com>
|
2025-09-29 10:11:03 -07:00 |
|
Xiaoyu Zhang
|
11965b0daf
|
Fix sgl-kernel benchmark dead code (#11022)
|
2025-09-29 15:06:40 +08:00 |
|
Kangyan-Zhou
|
0c9174108a
|
Unify SGL Kernel Releases (#10701)
|
2025-09-28 19:48:28 -07:00 |
|
Xiaoyu Zhang
|
05a3526654
|
Restruct gpu_memory_settings in a unify function and relax max_cuda_graph_bs (#10372)
Co-authored-by: Lianmin Zheng <lianminzheng@gmail.com>
Co-authored-by: sglang-bot <sglangbot@gmail.com>
|
2025-09-26 15:10:49 -07:00 |
|
Mick
|
fff7fbabe6
|
ci: fix rate-limit of huggingface with hf auth login (#10947)
|
2025-09-26 11:02:44 -07:00 |
|
Lianmin Zheng
|
b1f0fc1c0b
|
Add CI timeout guidelines (#10829)
|
2025-09-23 22:08:02 -07:00 |
|
Shangming Cai
|
23632d350c
|
Fix latest main ci (#10799)
Signed-off-by: Shangming Cai <csmthu@gmail.com>
|
2025-09-23 12:46:13 -07:00 |
|
Yineng Zhang
|
ba94b82986
|
fix: update run_suite (#10685)
|
2025-09-20 01:22:06 -07:00 |
|
fzyzcjy
|
ae4be601c2
|
Fix CI when sgl-kernel is changed but srt is not changed (#10515)
|
2025-09-16 02:49:54 -07:00 |
|
Lianmin Zheng
|
50dc0c1e9c
|
Run tests based on labels (#10456)
|
2025-09-15 00:29:20 -07:00 |
|
Yineng Zhang
|
7ce6c10eb6
|
fix: enable cu124 and cu128 build on main push (#10431)
|
2025-09-14 16:19:35 -07:00 |
|
fzyzcjy
|
e3cf812f7d
|
Fix sgl-kernel + srt CI (#10419)
|
2025-09-14 01:44:47 -07:00 |
|
fzyzcjy
|
a0f844ed5a
|
Let sgl-kernel changes be tested on srt (#10313)
|
2025-09-14 01:09:17 -07:00 |
|
Yineng Zhang
|
9d775b1a2d
|
feat: add deepseek v3 fp4 ut (#10391)
|
2025-09-12 15:43:29 -07:00 |
|
hzh0425
|
1a3d6f31da
|
Modify ci workflow for auto-partitioning in 2-GPU backend tests (#10029)
|
2025-09-06 10:28:42 +08:00 |
|
Lianmin Zheng
|
05e4787243
|
[CI] Fix the trigger condition for PR test workflows (#9761)
|
2025-08-30 15:47:10 -07:00 |
|
Lianmin Zheng
|
2c7f01bc89
|
Reorganize CI and test files (#9027)
|
2025-08-10 12:30:06 -07:00 |
|
Lianmin Zheng
|
ef48d5547e
|
Fix CI (#9013)
|
2025-08-09 16:00:10 -07:00 |
|
Lianmin Zheng
|
706bd69cc5
|
Clean up server_args.py to have a dedicated function for model specific adjustments (#8983)
|
2025-08-08 19:56:50 -07:00 |
|
fzyzcjy
|
b114a8105b
|
Support B200 in CI (#8861)
|
2025-08-06 21:42:44 +08:00 |
|
Lianmin Zheng
|
263c9236a0
|
Always trigger pr-test (#8527)
|
2025-07-29 04:05:19 -07:00 |
|
Lianmin Zheng
|
69712e6f55
|
Rename the last step in pr-test.yml as pr-test-finish (#8486)
|
2025-07-28 19:06:13 -07:00 |
|
Lifu Huang
|
5c705b1dce
|
Add perf tests for LoRA (#8314)
|
2025-07-26 14:55:22 -07:00 |
|
Lianmin Zheng
|
9c7a46180c
|
[Doc] Steps to add a new attention backend (#8155)
|
2025-07-18 16:38:26 -07:00 |
|
Cheng Wan
|
02404a1e35
|
[ci] recover 8-gpu deepep test (#8105)
|
2025-07-17 00:46:40 -07:00 |
|
Cheng Wan
|
475a249bb8
|
temporarily disable deepep-8-gpu and activate two small tests (#7961)
|
2025-07-11 14:22:05 -07:00 |
|
Cheng Wan
|
d487555f84
|
[CI] Add deepep tests to CI (#7872)
|
2025-07-09 01:49:47 -07:00 |
|
Lianmin Zheng
|
55e03b10c4
|
Fix a bug in BatchTokenIDOut & Misc style and dependency updates (#7457)
|
2025-06-23 06:20:39 -07:00 |
|
Junrong Lin
|
2103b80607
|
[CI] update verlengine ci to 4-gpu test (#6007)
|
2025-05-27 14:32:23 -07:00 |
|
Shenggui Li
|
3f23d8cdf1
|
added support for tied weights in qwen pipeline parallelism (#6546)
|
2025-05-25 00:00:56 -07:00 |
|
fzyzcjy
|
f11481b921
|
Add 4-GPU runner tests and split existing tests (#6383)
|
2025-05-18 11:56:51 -07:00 |
|
Ying Sheng
|
bad7c26fdc
|
[PP] Fix init_memory_pool desync & add PP for mixtral (#6223)
|
2025-05-12 12:38:09 -07:00 |
|
Lianmin Zheng
|
03227c5fa6
|
[CI] Reorganize the 8 gpu tests (#6192)
|
2025-05-11 10:55:06 -07:00 |
|
Lianmin Zheng
|
17c36c5511
|
[CI] Disabled deepep tests temporarily because it takes too much time. (#6186)
|
2025-05-10 23:40:50 -07:00 |
|