sglang

Author	SHA1	Message	Date
Pan Lyu	c913ed4046	support clip embedding model (#4506 )	2025-03-27 00:18:15 -07:00
Didier Durand	44f47d3ee1	Update supported_models.md: adding open-r1 Olympic Code 32B by HuggingFace (#4628 )	2025-03-27 00:16:16 -07:00
Mick	1e86457c90	model: Minicpmo (#3023 )	2025-03-24 20:08:40 -07:00
Ximingwang-09	22c3702e1e	[Model] Support Qwen2ForSequenceClassification (#4609 ) Co-authored-by: ximing.wxm <ximing.wxm@antgroup.com>	2025-03-24 19:13:44 -07:00
Adarsh Shirawalmath	fb8886037c	[Docs] Update docs for gemma3 and VLM chat templates (#4674 )	2025-03-22 08:02:19 -07:00
萝卜菜	d6d21640d3	[Feature] Support Deepseek-VL2 (#2798 ) Co-authored-by: Edenzzzz <wtan45@wisc.edu> Co-authored-by: Chayenne <zhaochen20@outlook.com> Co-authored-by: Yi Zhang <1109276519@qq.com>	2025-03-16 23:07:59 -07:00
Mick	9d02bb3e2a	Urgent model support: support gemma-3-it (#4424 )	2025-03-16 17:37:32 -07:00
江家瑋	26c372c13c	docs: Add Llama 3.3 to supported models (#4453 ) Signed-off-by: JiangJiaWei1103 <waynechuang97@gmail.com>	2025-03-15 16:33:43 -07:00
Mick	01090e8ac3	model: Support Janus-pro (#3203 )	2025-03-12 11:02:11 -07:00
Pan Lyu	361971b859	Add Support for Qwen2-VL Multi-modal Embedding Models (#3694 )	2025-03-06 16:46:20 -08:00
Adarsh Shirawalmath	19fd57bcd7	[docs] fix HF reference script command (#4148 )	2025-03-06 13:21:54 -08:00
Mick	45205d88a0	bench: Add MMMU benchmark for vLM (#3562 )	2025-02-22 08:10:59 -08:00
simveit	20b765a26e	Model: Support Qwen 72B RM model. (#3772 )	2025-02-21 14:38:21 -08:00
Mick	bcc213df61	Model: Support Qwen 2.5 vl (#3258 )	2025-02-16 00:58:53 -08:00
Mick	ced680663c	doc: Support a new vLM (#3405 ) Co-authored-by: zhaochenyang20 <zhaochen20@outlook.com>	2025-02-12 00:43:14 -08:00
Didier Durand	9490d15772	fix supported_models Qwen typo (#3498 )	2025-02-12 02:59:18 +08:00
Ravi Theja	9829e77e3f	Docs: Update supported models with Mistral 3 (#3229 ) Co-authored-by: Ravi Theja Desetty <ravitheja@Ravis-MacBook-Pro.local>	2025-01-31 00:01:46 -08:00
Mick	9f635ea50d	[Fix] Address remaining issues of supporting MiniCPMV (#2977 )	2025-01-28 00:22:13 -08:00
Adarsh Shirawalmath	4505a43614	[Docs] minor update for phi-3 and phi-4 (#3096 )	2025-01-24 04:00:20 -08:00
Lianmin Zheng	03464890e0	Separate two entry points: Engine and HTTP server (#2996 ) Co-authored-by: fzyzcjy <5236035+fzyzcjy@users.noreply.github.com>	2025-01-19 22:09:24 -08:00
Yineng Zhang	def5c31873	docs: update supported_models (#2987 )	2025-01-20 00:44:30 +08:00
Mick	3d93f84a00	[Feature] Support minicpmv v2.6 (#2785 ) Co-authored-by: Chayenne <zhaochen20@outlook.com> Co-authored-by: yizhang2077 <1109276519@qq.com>	2025-01-18 14:14:19 -08:00
Lianmin Zheng	72c7776355	Fix linear.py and improve weight loading (#2851 ) Co-authored-by: SangBin Cho <rkooo567@gmail.com>	2025-01-13 01:39:14 -08:00
Fred Reiss	993956c6b1	Add support for IBM Granite 3.x models (#2437 )	2024-12-11 06:30:23 -08:00
Lianmin Zheng	0e7409adb6	Fix the overlap for xgrammar (#2377 )	2024-12-06 05:49:29 -08:00
vchzls	3cde5eb629	docs: Improve instructions for supporting new models (#2363 ) Co-authored-by: zhaohoulong <zhaohoulong@xiaomi.com>	2024-12-06 04:27:17 -08:00
Lianmin Zheng	dfec7fca06	Rename sglang.bench_latency to sglang.bench_one_batch (#2118 )	2024-11-21 20:07:48 -08:00
Tanjiro	8c280cee55	add phi-3 small support (#2062 ) Co-authored-by: Tushar Goel <114812108+AI-Tushar@users.noreply.github.com>	2024-11-17 18:47:43 -08:00
RangiLyu	f18b9c7252	support internlm2-reward (#1994 )	2024-11-11 15:09:58 -08:00
aqweteddy	f16eb15d0d	Gemma2 reward model support (#1954 )	2024-11-07 22:42:27 -08:00
Lianmin Zheng	f5113e50ae	[Doc] improve relative links and structure (#1924 )	2024-11-05 01:12:10 -08:00

31 Commits