sglang

Author	SHA1	Message	Date
Lianmin Zheng	03464890e0	Separate two entry points: Engine and HTTP server (#2996 ) Co-authored-by: fzyzcjy <5236035+fzyzcjy@users.noreply.github.com>	2025-01-19 22:09:24 -08:00
Yineng Zhang	def5c31873	docs: update supported_models (#2987 )	2025-01-20 00:44:30 +08:00
Mick	3d93f84a00	[Feature] Support minicpmv v2.6 (#2785 ) Co-authored-by: Chayenne <zhaochen20@outlook.com> Co-authored-by: yizhang2077 <1109276519@qq.com>	2025-01-18 14:14:19 -08:00
Lianmin Zheng	72c7776355	Fix linear.py and improve weight loading (#2851 ) Co-authored-by: SangBin Cho <rkooo567@gmail.com>	2025-01-13 01:39:14 -08:00
Fred Reiss	993956c6b1	Add support for IBM Granite 3.x models (#2437 )	2024-12-11 06:30:23 -08:00
Lianmin Zheng	0e7409adb6	Fix the overlap for xgrammar (#2377 )	2024-12-06 05:49:29 -08:00
vchzls	3cde5eb629	docs: Improve instructions for supporting new models (#2363 ) Co-authored-by: zhaohoulong <zhaohoulong@xiaomi.com>	2024-12-06 04:27:17 -08:00
Lianmin Zheng	dfec7fca06	Rename sglang.bench_latency to sglang.bench_one_batch (#2118 )	2024-11-21 20:07:48 -08:00
Tanjiro	8c280cee55	add phi-3 small support (#2062 ) Co-authored-by: Tushar Goel <114812108+AI-Tushar@users.noreply.github.com>	2024-11-17 18:47:43 -08:00
RangiLyu	f18b9c7252	support internlm2-reward (#1994 )	2024-11-11 15:09:58 -08:00
aqweteddy	f16eb15d0d	Gemma2 reward model support (#1954 )	2024-11-07 22:42:27 -08:00
Lianmin Zheng	f5113e50ae	[Doc] improve relative links and structure (#1924 )	2024-11-05 01:12:10 -08:00