Commit Graph

245 Commits

Author SHA1 Message Date
HandH1998
2ac189edc8 Amd test fp8 (#4261) 2025-03-10 10:12:09 -07:00
Lianmin Zheng
5a6400eec5 Test no vllm custom allreduce (#4256) 2025-03-10 10:08:25 -07:00
Lianmin Zheng
3d56585a97 increase the timeout of nightly-test.yml (#4262) 2025-03-10 05:07:03 -07:00
Lianmin Zheng
aa957102a9 Simplify tests & Fix trtllm custom allreduce registration (#4252) 2025-03-10 01:24:22 -07:00
Lianmin Zheng
e8a69e4d0c Clean up fp8 support (#4230) 2025-03-09 21:46:35 -07:00
Lianmin Zheng
fbd560028a Auto balance CI tests (#4238) 2025-03-09 21:05:55 -07:00
Lianmin Zheng
8abf74e3c9 Rename files in sgl kernel to avoid nested folder structure (#4213)
Co-authored-by: zhyncs <me@zhyncs.com>
2025-03-08 22:54:51 -08:00
Yineng Zhang
ee132a4515 use latest sgl-kernel for mla test (#4222) 2025-03-08 22:27:47 -08:00
Lianmin Zheng
48473684cc Split test_mla.py into two files (#4216) 2025-03-08 15:40:49 -08:00
Lianmin Zheng
2cadd51d11 Test no vllm custom allreduce (#4210) 2025-03-08 05:23:06 -08:00
Lianmin Zheng
8d323e95e4 Use clang format 18 in pr-test-sgl-kernel.yml (#4203) 2025-03-08 01:28:10 -08:00
saienduri
e1aaa79ac9 Update amd ci docker image to v0.4.3.post4-rocm630. (#4189) 2025-03-07 13:02:02 -08:00
Yineng Zhang
7e3bb52705 update release-pypi-kernel 2025-03-07 01:48:47 -08:00
Chayenne
9854a18a51 Hot fix small vocal eagle in docs (#4154)
Co-authored-by: ybyang <ybyang7@iflytek.com>
2025-03-06 15:13:26 -08:00
Lianmin Zheng
bc1534ff32 Fix a draft model accuracy bug in eagle; support step=1; return logprob in eagle (#4134)
Co-authored-by: Sehoon Kim <kssteven418@gmail.com>
Co-authored-by: SangBin Cho <rkooo567@gmail.com>
Co-authored-by: Sehoon Kim <sehoon@x.ai>
2025-03-06 06:13:59 -08:00
saienduri
55dc8e4d52 Add tag suffix to nightly docker builds. (#4129) 2025-03-05 23:22:36 -08:00
saienduri
44d7646371 remove testing on PR workflow change (#4110) 2025-03-05 16:03:18 -08:00
saienduri
cd85b78f94 Create release-docker-amd-nightly.yml (#4105) 2025-03-05 14:46:26 -08:00
Ke Bao
d3fe9bae56 Add accuracy test for TP torch compile (#3994) 2025-03-02 13:18:18 -08:00
fzyzcjy
e3e0bc50a9 [Feature] SPMD for SGLang + Verl (#3852) 2025-02-28 09:53:10 -08:00
Qing
0519269d20 [Docs] Disable notebook CI when merge to main (#3905) 2025-02-26 22:13:33 -08:00
Lianmin Zheng
d7934cde45 Fix CI and install docs (#3821) 2025-02-24 16:17:38 -08:00
Yineng Zhang
f983213a1f update pr-test (#3663) 2025-02-18 17:23:43 +08:00
Yineng Zhang
07ab4d4a2d fix #3654 2025-02-18 15:16:16 +08:00
saienduri
522e18eaeb Update amd docker image. (#3654) 2025-02-17 20:12:55 -08:00
Yineng Zhang
e319153be8 update unit test (#3636) 2025-02-17 21:06:10 +08:00
saienduri
7474bed883 Update to latest amd image. (#3597) 2025-02-17 00:29:47 +08:00
Shi Shuai
7443197a63 [CI] Improve Docs CI Efficiency (#3587)
Co-authored-by: zhaochenyang20 <zhaochen20@outlook.com>
2025-02-14 19:57:00 -08:00
Yineng Zhang
70f894b810 feat: support flashinfer mla attention for deepseek v3 (#3550) 2025-02-14 08:50:14 +08:00
Jackmin801
5f0e7de339 [Feat] Return hidden states (experimental) (#3364)
Co-authored-by: Chayenne <zhaochen20@outlook.com>
2025-02-10 15:54:37 -08:00
Yineng Zhang
4fe92bfca5 fix mla test (#3469) 2025-02-10 21:12:00 +08:00
Yineng Zhang
d87272750b fix ci (#3441) 2025-02-10 04:22:28 +08:00
Yineng Zhang
4cfd3add6d support version in sgl-kernel (#3439) 2025-02-10 03:49:52 +08:00
Yineng Zhang
5da3d21c8b update pr-test ci (#3376) 2025-02-07 21:08:35 +08:00
Yineng Zhang
2b1808cec4 update unit test in AMD CI (#3366) 2025-02-07 17:25:16 +08:00
Chayenne
76ca91dff2 Docs/CI: Enable Fake Finish for Docs Only PR (#3350) 2025-02-06 19:33:31 -08:00
saienduri
200d3b1608 Add sgl-kernel to MI300 CI paths tested. (#3335)
Co-authored-by: HAI <hixiao@gmail.com>
2025-02-06 00:45:38 -08:00
saienduri
2d9c319594 Docker switch (#3327)
Co-authored-by: HAI <hixiao@gmail.com>
2025-02-05 18:06:50 -08:00
saienduri
04d8cd2088 Initial Enablement of CI on MI300 (#3168) 2025-02-05 10:45:12 -08:00
Yineng Zhang
d39899e85c upgrade flashinfer v0.2.0.post2 (#3288)
Co-authored-by: pankajroark <pankajroark@users.noreply.github.com>
2025-02-04 21:41:40 +08:00
HAI
566d61d90f ROCm: bump 6.3.0 (#3259) 2025-02-03 04:13:40 +08:00
Yineng Zhang
b49d6d0fee support 12.5 CUDA runtime (#3231) 2025-01-31 20:31:38 +08:00
Jhin
7b9b4f4426 Docs fix about EAGLE and streaming output (#3166)
Co-authored-by: Chayenne <zhaochenyang@ucla.edu>
Co-authored-by: Chayenne <zhaochen20@outlook.com>
Co-authored-by: Jhin <jhinpan@umich.edu>
2025-01-27 18:10:45 -08:00
Yineng Zhang
827aa8730b cleanup sgl-kernel kernels (#3175) 2025-01-27 19:11:01 +08:00
Yineng Zhang
f265d15b96 use self-hosted to build sgl-kernel (#3154) 2025-01-26 23:02:57 +08:00
Lianmin Zheng
4a61253123 Do not load OPENAI_KEY from secrets (#3147) 2025-01-26 01:54:03 -08:00
Lianmin Zheng
4f118a39d7 Fix repetition penalty (#3139) 2025-01-25 21:48:58 -08:00
Yineng Zhang
822bae8c00 feat: cross python wheel for sgl-kernel (#3138) 2025-01-26 13:21:34 +08:00
Lianmin Zheng
da6f8081f6 Fix CI tests (#3132) 2025-01-25 17:43:39 -08:00
Yineng Zhang
896c07441e update installation doc for sgl-kernel (#3129) 2025-01-26 00:00:13 +08:00