Xiaoyu Zhang
|
8e51049f56
|
[CI Monitor] Ci monitor only deal with main branch in default (#11538)
|
2025-10-13 13:50:04 -07:00 |
|
Baizhou Zhang
|
9f1f699a7a
|
[CI] Add Basic Test for DeepSeek V3.2 (#11308)
|
2025-10-13 11:41:02 -07:00 |
|
Xiaoyu Zhang
|
6806c4e63e
|
[CI monitor] Improve CI analyzer: fix job failure tracking and add CUDA-focused filtering (#11505)
|
2025-10-13 13:31:09 +08:00 |
|
Yineng Zhang
|
05f015f65f
|
chore: remove flashinfer cleanup cache (#11514)
|
2025-10-12 14:56:33 -07:00 |
|
Lianmin Zheng
|
5a6ec8f999
|
Fix unit tests (#11503)
|
2025-10-12 07:45:57 -07:00 |
|
Lianmin Zheng
|
548a57b1f3
|
Fix port conflicts in CI (#11497)
|
2025-10-12 06:46:36 -07:00 |
|
Liangsheng Yin
|
20a6c0a63d
|
Beta spec-overlap for EAGLE (#11398)
Co-authored-by: Lianmin Zheng <15100009+merrymercy@users.noreply.github.com>
Co-authored-by: Hanming Lu <69857889+hanming-lu@users.noreply.github.com>
|
2025-10-12 11:02:22 +08:00 |
|
Lianmin Zheng
|
61055cb309
|
Reorder PD disagg CI tests (#11438)
|
2025-10-10 17:56:49 -07:00 |
|
Yineng Zhang
|
4299aebdbb
|
chore: update pyproject (#11420)
|
2025-10-10 00:56:30 -07:00 |
|
Yineng Zhang
|
d8467db727
|
fix: reinstall torch in deps install (#11414)
|
2025-10-09 22:58:18 -07:00 |
|
Yineng Zhang
|
44cb060785
|
chore: upgrade flashinfer 0.4.0 (#11364)
|
2025-10-09 14:17:54 -07:00 |
|
Lianmin Zheng
|
0e7b353009
|
Fix code sync scripts (#11276)
|
2025-10-06 15:35:01 -07:00 |
|
Kangyan-Zhou
|
8fd41eae93
|
Improve bot release workflow (#11240)
|
2025-10-05 21:28:27 -07:00 |
|
Lianmin Zheng
|
366a603e95
|
Use cu128 for torch audio to fix some CI tests (#11251)
|
2025-10-05 19:52:32 -07:00 |
|
Kangyan-Zhou
|
a20fc7b7dc
|
Create two new GH workflows to automatically bump SGLang and Kernel version (#10996)
|
2025-10-05 18:14:05 -07:00 |
|
Lianmin Zheng
|
d645ae90a3
|
Rename runner labels (#11228)
|
2025-10-05 18:05:41 -07:00 |
|
Cheng Wan
|
41763ba079
|
Remove gdrcopy check in ci_install_deepep.sh (#11237)
|
2025-10-05 17:35:22 -07:00 |
|
sunxxuns
|
5e142484e2
|
[Fix AMD CI] VRAM cleanup (#11174)
Co-authored-by: root <root@smci350-zts-gtu-e17-15.zts-gtu.dcgpu>
|
2025-10-05 19:03:53 -04:00 |
|
fzyzcjy
|
fdc4e1e570
|
Tiny move files to utils folder (#11166)
|
2025-10-03 22:40:06 +08:00 |
|
Xiaoyu Zhang
|
6f16bf9d9d
|
[Ci Monitor] Auto uploaded performance data to sglang_ci_data repo (#10976)
|
2025-09-29 16:17:27 +08:00 |
|
Xiaoyu Zhang
|
2387c22b56
|
Ci monitor support performance (#10965)
|
2025-09-27 09:11:21 +08:00 |
|
Mick
|
777eb53897
|
ci: refactor nightly test (#10495)
|
2025-09-26 15:24:30 -07:00 |
|
Mick
|
fff7fbabe6
|
ci: fix rate-limit of huggingface with hf auth login (#10947)
|
2025-09-26 11:02:44 -07:00 |
|
Xiaoyu Zhang
|
c1f39013b7
|
[ci feature] add ci monitor (#10872)
|
2025-09-24 23:16:29 -07:00 |
|
Lianmin Zheng
|
38c00ed7a1
|
Fix multimodal registry and code sync scripts (#10759)
Co-authored-by: cctry <shiyang@x.ai>
|
2025-09-22 15:36:01 -07:00 |
|
Shangming Cai
|
74cd6e3902
|
chore: upgrade mooncake 0.3.6.post1 to fix gb200 dockerfile (#10681)
Signed-off-by: Shangming Cai <csmthu@gmail.com>
|
2025-09-20 00:12:26 -07:00 |
|
Yi Zhang
|
e07b21ceaf
|
update deepep version for qwen3-next deepep moe (#10624)
|
2025-09-18 11:35:22 -07:00 |
|
Teng Ma
|
77098aea7b
|
[HiCache] Add tests for hicache storage mooncake backend (#10171)
Signed-off-by: Shangming Cai <csmthu@gmail.com>
Co-authored-by: hzh0425 <hzh0425@apache.org>
Co-authored-by: Shangming Cai <csmthu@gmail.com>
|
2025-09-18 01:07:16 +08:00 |
|
Yineng Zhang
|
5c08d7d21d
|
fix: resolve sgl-kernel ut (#10476)
|
2025-09-15 11:42:48 -07:00 |
|
Yineng Zhang
|
5afd036533
|
feat: support pip install sglang (#10465)
|
2025-09-15 03:09:17 -07:00 |
|
Jintao Zhang
|
f9ee6ae17a
|
[router]: Add Embedding routing logic (#10129)
Signed-off-by: Jintao Zhang <zhangjintao9020@gmail.com>
Co-authored-by: Waël Boukhobza <wawa_wael@live.fr>
|
2025-09-14 18:44:35 -07:00 |
|
fzyzcjy
|
a0f844ed5a
|
Let sgl-kernel changes be tested on srt (#10313)
|
2025-09-14 01:09:17 -07:00 |
|
fzyzcjy
|
abea9250da
|
Auto determine sgl kernel version in blackwell CI (#10318)
|
2025-09-14 01:06:30 -07:00 |
|
Even Zhou
|
16cd550c85
|
Support Qwen3-Next on Ascend NPU (#10379)
|
2025-09-12 16:31:37 -07:00 |
|
Yineng Zhang
|
bfe01a5eef
|
chore: upgrade v0.3.9.post2 sgl-kernel (#10297)
|
2025-09-11 04:10:29 -07:00 |
|
Even Zhou
|
5b64f006ec
|
[Feature] Support DeepEP normal & Redundant Experts on NPU (#9881)
|
2025-09-10 20:35:26 -07:00 |
|
Hubert Lu
|
91b3555d2d
|
Add tests to AMD CI for MI35x (#9662)
Co-authored-by: Sai Enduri <saimanas.enduri@amd.com>
|
2025-09-10 12:50:05 -07:00 |
|
Lzhang-hub
|
4efe2c57c9
|
support vlm model spec bench (#10173)
|
2025-09-10 13:37:04 +08:00 |
|
Lianmin Zheng
|
bcf1955f7e
|
Revert "chore: upgrade v0.3.9 sgl-kernel" (#10245)
|
2025-09-09 19:05:20 -07:00 |
|
Yineng Zhang
|
d3ee70985f
|
chore: upgrade v0.3.9 sgl-kernel (#10220)
|
2025-09-09 03:16:25 -07:00 |
|
Liangsheng Yin
|
6e95f5e5bd
|
Simplify Router arguments passing and build it in docker image (#9964)
|
2025-09-05 12:13:55 +08:00 |
|
Yineng Zhang
|
de9217334b
|
feat: add gpt oss b200 ci (#9988)
|
2025-09-03 17:26:38 -07:00 |
|
Lianmin Zheng
|
646076b71e
|
Update guidelines for syncing code between repos (#9831)
|
2025-08-30 16:10:35 -07:00 |
|
Lianmin Zheng
|
0d04008936
|
[CI] Code sync tools (#9830)
|
2025-08-30 16:02:29 -07:00 |
|
Chayenne
|
9b08d975a0
|
[docs] Refactor, remove compiled results and add gpt-oss (#9613)
Co-authored-by: zhaochenyang20 <zhaochenyang20@gmail.com>
|
2025-08-25 15:27:06 -07:00 |
|
Chang Su
|
7638f5e44e
|
[router] Implement gRPC SGLangSchedulerClient (#9364)
|
2025-08-19 16:44:11 -07:00 |
|
Lianmin Zheng
|
c480a3f6ea
|
Minor style fixes for sgl-kernel (#9289)
|
2025-08-18 09:38:35 -07:00 |
|
michael-amd
|
0fc8bf2cd4
|
[AMD] Update fallback images for AMD CI (#9159)
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
|
2025-08-13 20:15:10 -07:00 |
|
li chaoran
|
2ecbd8b8bf
|
[feat] add ascend readme and docker release (#8700)
Signed-off-by: mywaaagh_admin <pkwarcraft@gmail.com>
Signed-off-by: lichaoran <pkwarcraft@gmail.com>
Co-authored-by: Even Zhou <even.y.zhou@outlook.com>
Co-authored-by: ronnie_zheng <zl19940307@163.com>
|
2025-08-12 13:25:42 -07:00 |
|
Yi Zhang
|
89f1d4f536
|
update deepep commit to support qwen3-coder (#9066)
|
2025-08-11 10:42:33 -07:00 |
|