sglang

Author	SHA1	Message	Date
Lianmin Zheng	9eefe2c0b7	Set CUDA_VISIBLE_DEVICES to achieve one GPU per process (#9170 ) Co-authored-by: SangBin Cho <rkooo567@gmail.com> Co-authored-by: Cheng Wan <cwan@x.ai> Co-authored-by: Cheng Wan <54331508+ch-wan@users.noreply.github.com>	2025-10-17 17:30:06 -07:00
Chang Su	627974405d	[Lint] Add `python/sglang` to ruff F401 checks and remove unused imports in files (#11685 )	2025-10-17 16:49:46 -07:00
Lianmin Zheng	fdd7c69d65	[Auto Sync] Update common.py (20251017) (#11782 ) Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> Co-authored-by: Cheng Wan <54331508+ch-wan@users.noreply.github.com>	2025-10-17 15:03:42 -07:00
Chunyuan WU	8fcc69e7c4	Turn on shm_allreduce and shm_allgather for fp16 (#10725 )	2025-10-17 12:35:20 -07:00
Baizhou Zhang	b0d1d717e1	Revert "make radix cache deterministic" (#11728 )	2025-10-16 14:36:15 -07:00
Alex Chi Z	dc965db0e0	make radix cache deterministic (#10721 ) Signed-off-by: Alex Chi Z <iskyzh@gmail.com>	2025-10-14 21:01:52 +08:00
Yongtong Wu	a20e7df8d0	Improve dp attention port assignment scheme (#5889 ) Co-authored-by: Cheng Wan <cwan@x.ai>	2025-10-12 17:55:59 -07:00
Vincent Zhong	a220536f40	[ perf ] Replace json-> orjson in hot path (#11221 ) Signed-off-by: vincentzed <207368749+vincentzed@users.noreply.github.com>	2025-10-12 20:30:58 +08:00
Kai-Hsun Chen	43190becfa	[chore][1/N] Avoid using default mutable parameters (#11478 ) Signed-off-by: Kai-Hsun Chen <khchen@x.ai>	2025-10-12 20:26:39 +08:00
Liu-congo	c80a96dae9	[BugFix] test_mla_fp8.py fails on Cublas 12.9 (#11360 ) Signed-off-by: Liu-congo <1502632128@qq.com>	2025-10-10 21:14:24 -07:00
Lianmin Zheng	61055cb309	Reorder PD disagg CI tests (#11438 )	2025-10-10 17:56:49 -07:00
Yingchun Lai	0fe87213bb	fix: fix gpu-proc affinity set incorrectly when pp_size > 1 (#11389 )	2025-10-09 18:40:05 -07:00
Lianmin Zheng	9b8ebb2798	move more files under srt/utils (#11285 )	2025-10-09 16:46:15 -07:00
Netanel Haber	d6837aea4d	model: Support Hybrid Mamba2 NemotronHForCausalLM (nvidia/NVIDIA-Nemotron-Nano-9B-v2) (#10909 ) Signed-off-by: Netanel Haber <nhaber@nvidia.com>	2025-10-09 00:37:38 +08:00
fzyzcjy	efbc687c28	Support DeepSeek V3.2 Exp (#11061 ) Co-authored-by: Stefan He <11166516+hebiao064@users.noreply.github.com> Co-authored-by: Liangsheng Yin <95566987+hnyls2002@users.noreply.github.com> Co-authored-by: Baizhou Zhang <56809903+fridge003@users.noreply.github.com> Co-authored-by: DarkSharpness <76582120+darksharpness@users.noreply.github.com> Co-authored-by: ZhengdQin <46387172+zhengdqin@users.noreply.github.com> Co-authored-by: DarkSharpness <2040703891@qq.com> Co-authored-by: hnyls2002 <lsyincs@gmail.com> Co-authored-by: Zhengda Qin <zhengdqin@gmail.com> Co-authored-by: Liangsheng Yin <hnyls2002@gmail.com> Co-authored-by: HAI <hixiao@gmail.com> Co-authored-by: Baizhou Zhang <sobereddiezhang@gmail.com>	2025-10-06 00:24:15 -07:00
fzyzcjy	fdc4e1e570	Tiny move files to utils folder (#11166 )	2025-10-03 22:40:06 +08:00

16 Commits