sglang

Author	SHA1	Message	Date
aoshen524	588865f0e0	[Feature] Support Tensor Parallelism and Weight Slicing for Lora (#4274 ) Co-authored-by: ShenAo1111 <1377693092@qq.com> Co-authored-by: Baizhou Zhang <sobereddiezhang@gmail.com>	2025-03-18 20:33:07 -07:00
Yineng Zhang	c787298547	use sgl custom all reduce (#4441 )	2025-03-18 00:46:41 -07:00
Lianmin Zheng	82dec1f70b	Remove redundant type conversion (#4513 )	2025-03-17 05:57:35 -07:00
Lianmin Zheng	5493c3343e	Fix data parallel + tensor parallel (#4499 )	2025-03-17 05:13:16 -07:00
Lianmin Zheng	06d12b39d3	Remove filter for pr-tests (#4468 )	2025-03-16 00:57:26 -07:00
Lianmin Zheng	c30976fb41	Fix finish step for pr tests and notebook tests (#4467 )	2025-03-16 00:52:06 -07:00
Yineng Zhang	ad1ae7f7cd	use topk_softmax with sgl-kernel (#4439 )	2025-03-14 15:59:06 -07:00
Yineng Zhang	977d7cd26a	cleanup deps 1/n (#4400 ) Co-authored-by: sleepcoo <sleepcoo@gmail.com>	2025-03-14 00:00:33 -07:00
Lianmin Zheng	a5a892ffd3	Fix auto merge & add back get_flat_data_by_layer (#4393 )	2025-03-13 08:46:25 -07:00
HandH1998	2ac189edc8	Amd test fp8 (#4261 )	2025-03-10 10:12:09 -07:00
Lianmin Zheng	5a6400eec5	Test no vllm custom allreduce (#4256 )	2025-03-10 10:08:25 -07:00
Lianmin Zheng	3d56585a97	increase the timeout of nightly-test.yml (#4262 )	2025-03-10 05:07:03 -07:00
Lianmin Zheng	aa957102a9	Simplify tests & Fix trtllm custom allreduce registration (#4252 )	2025-03-10 01:24:22 -07:00
Lianmin Zheng	e8a69e4d0c	Clean up fp8 support (#4230 )	2025-03-09 21:46:35 -07:00
Lianmin Zheng	fbd560028a	Auto balance CI tests (#4238 )	2025-03-09 21:05:55 -07:00
Lianmin Zheng	8abf74e3c9	Rename files in sgl kernel to avoid nested folder structure (#4213 ) Co-authored-by: zhyncs <me@zhyncs.com>	2025-03-08 22:54:51 -08:00
Yineng Zhang	ee132a4515	use latest sgl-kernel for mla test (#4222 )	2025-03-08 22:27:47 -08:00
Lianmin Zheng	48473684cc	Split test_mla.py into two files (#4216 )	2025-03-08 15:40:49 -08:00
Lianmin Zheng	2cadd51d11	Test no vllm custom allreduce (#4210 )	2025-03-08 05:23:06 -08:00
Lianmin Zheng	8d323e95e4	Use clang format 18 in pr-test-sgl-kernel.yml (#4203 )	2025-03-08 01:28:10 -08:00
saienduri	e1aaa79ac9	Update amd ci docker image to v0.4.3.post4-rocm630. (#4189 )	2025-03-07 13:02:02 -08:00
Yineng Zhang	7e3bb52705	update release-pypi-kernel	2025-03-07 01:48:47 -08:00
Chayenne	9854a18a51	Hot fix small vocal eagle in docs (#4154 ) Co-authored-by: ybyang <ybyang7@iflytek.com>	2025-03-06 15:13:26 -08:00
Lianmin Zheng	bc1534ff32	Fix a draft model accuracy bug in eagle; support step=1; return logprob in eagle (#4134 ) Co-authored-by: Sehoon Kim <kssteven418@gmail.com> Co-authored-by: SangBin Cho <rkooo567@gmail.com> Co-authored-by: Sehoon Kim <sehoon@x.ai>	2025-03-06 06:13:59 -08:00
saienduri	55dc8e4d52	Add tag suffix to nightly docker builds. (#4129 )	2025-03-05 23:22:36 -08:00
saienduri	44d7646371	remove testing on PR workflow change (#4110 )	2025-03-05 16:03:18 -08:00
saienduri	cd85b78f94	Create release-docker-amd-nightly.yml (#4105 )	2025-03-05 14:46:26 -08:00
Ke Bao	d3fe9bae56	Add accuracy test for TP torch compile (#3994 )	2025-03-02 13:18:18 -08:00
fzyzcjy	e3e0bc50a9	[Feature] SPMD for SGLang + Verl (#3852 )	2025-02-28 09:53:10 -08:00
Qing	0519269d20	[Docs] Disable notebook CI when merge to main (#3905 )	2025-02-26 22:13:33 -08:00
Lianmin Zheng	d7934cde45	Fix CI and install docs (#3821 )	2025-02-24 16:17:38 -08:00
Yineng Zhang	f983213a1f	update pr-test (#3663 )	2025-02-18 17:23:43 +08:00
Yineng Zhang	07ab4d4a2d	fix #3654	2025-02-18 15:16:16 +08:00
saienduri	522e18eaeb	Update amd docker image. (#3654 )	2025-02-17 20:12:55 -08:00
Yineng Zhang	e319153be8	update unit test (#3636 )	2025-02-17 21:06:10 +08:00
saienduri	7474bed883	Update to latest amd image. (#3597 )	2025-02-17 00:29:47 +08:00
Shi Shuai	7443197a63	[CI] Improve Docs CI Efficiency (#3587 ) Co-authored-by: zhaochenyang20 <zhaochen20@outlook.com>	2025-02-14 19:57:00 -08:00
Yineng Zhang	70f894b810	feat: support flashinfer mla attention for deepseek v3 (#3550 )	2025-02-14 08:50:14 +08:00
Jackmin801	5f0e7de339	[Feat] Return hidden states (experimental) (#3364 ) Co-authored-by: Chayenne <zhaochen20@outlook.com>	2025-02-10 15:54:37 -08:00
Yineng Zhang	4fe92bfca5	fix mla test (#3469 )	2025-02-10 21:12:00 +08:00
Yineng Zhang	d87272750b	fix ci (#3441 )	2025-02-10 04:22:28 +08:00
Yineng Zhang	4cfd3add6d	support version in sgl-kernel (#3439 )	2025-02-10 03:49:52 +08:00
Yineng Zhang	5da3d21c8b	update pr-test ci (#3376 )	2025-02-07 21:08:35 +08:00
Yineng Zhang	2b1808cec4	update unit test in AMD CI (#3366 )	2025-02-07 17:25:16 +08:00
Chayenne	76ca91dff2	Docs/CI: Enable Fake Finish for Docs Only PR (#3350 )	2025-02-06 19:33:31 -08:00
saienduri	200d3b1608	Add sgl-kernel to MI300 CI paths tested. (#3335 ) Co-authored-by: HAI <hixiao@gmail.com>	2025-02-06 00:45:38 -08:00
saienduri	2d9c319594	Docker switch (#3327 ) Co-authored-by: HAI <hixiao@gmail.com>	2025-02-05 18:06:50 -08:00
saienduri	04d8cd2088	Initial Enablement of CI on MI300 (#3168 )	2025-02-05 10:45:12 -08:00
Yineng Zhang	d39899e85c	upgrade flashinfer v0.2.0.post2 (#3288 ) Co-authored-by: pankajroark <pankajroark@users.noreply.github.com>	2025-02-04 21:41:40 +08:00
HAI	566d61d90f	ROCm: bump 6.3.0 (#3259 )	2025-02-03 04:13:40 +08:00

1 2 3 4 5 ...

254 Commits