sglang

Author	SHA1	Message	Date
Yi Zhang	760b788a58	add qwen3-next doc (#10327 )	2025-09-11 14:29:11 -07:00
Glen Liu	ebd0e1c18b	[doc] add walkthrough for implementing and hosting a simple llama wrapper m… (#10093 )	2025-09-10 12:05:06 +08:00
eigen	b0fcbb74d0	[DOC]: some minor updates (#10134 )	2025-09-07 14:58:15 -07:00
Netanel Haber	4cd08dc592	model: Support nvidia/Llama-3_1-Nemotron-Ultra-253B-v1 (#9301 )	2025-08-26 15:33:40 +08:00
Netanel Haber	845d12a979	model: support nvidia/Llama-3_3-Nemotron-Super-49B-v1 (#9067 ) Co-authored-by: Kyle Huang <kylhuang@nvidia.com>	2025-08-17 01:48:15 -07:00
Zhihao Liu	65736dc524	[Model] Support Qwen3ForSequenceClassification for Qwen3-Embed Model (#7957 )	2025-08-13 11:14:54 -07:00
Hangzhi	03d114496f	Fix typos in supported models documentation (#9119 )	2025-08-12 13:35:24 -07:00
Yichao Cheng	fcc11e5ed5	update support new models doc (#9096 ) Signed-off-by: Xinyuan Tong <xinyuantong.cs@gmail.com> Co-authored-by: Xinyuan Tong <xinyuantong.cs@gmail.com>	2025-08-12 01:21:02 -07:00
Lianmin Zheng	2e8e7e353b	Improve docs and developer guide (#9044 )	2025-08-10 21:05:18 -07:00
Lianmin Zheng	2449a0afe2	Refactor the docs (#9031 )	2025-08-10 19:49:45 -07:00
Binyao Jiang	f29aba8c6e	Support glm4.1v and glm4.5v (#8798 ) Signed-off-by: Xinyuan Tong <justinning0323@outlook.com> Signed-off-by: Xinyuan Tong <xinyuantong.cs@gmail.com> Co-authored-by: Xinyuan Tong <justinning0323@outlook.com> Co-authored-by: Xinyuan Tong <115166877+JustinTong0323@users.noreply.github.com> Co-authored-by: Xinyuan Tong <xinyuantong.cs@gmail.com> Co-authored-by: zRzRzRzRzRzRzR <2448370773@qq.com> Co-authored-by: Minglei Zhu <mingleizhu1122@gmail.com> Co-authored-by: Chang Su <csu272@usc.edu>	2025-08-09 00:59:13 -07:00
Wenbo Yang	1132547496	Add ernie4.py for ERNIE-4.5 (#7657 )	2025-08-08 00:55:48 -07:00
Praneth Paruchuri	d26ca84f39	Support bailing moe (#8680 )	2025-08-05 20:40:34 -07:00
Adarsh Shirawalmath	ec5f944271	[Model] Add support for Arcee Foundational Model (#8154 )	2025-07-30 10:45:25 -07:00
Praneth Paruchuri	83c104b188	Feat: Support for Persimmon Model (#7983 )	2025-07-19 23:07:47 -07:00
Binyao Jiang	b7e951a6db	Feat: Support audio in Phi4-mm model (#8048 )	2025-07-18 21:03:53 -07:00
Minglei Zhu	8a32355704	Feat: Support Granite 3.0 MoE in SGLang (#7959 )	2025-07-17 20:56:03 -07:00
Praneth Paruchuri	cb736df854	Support for Phi-1.5 & Phi-2 models (#7862 )	2025-07-13 18:43:40 -07:00
Binyao Jiang	2d54d4bb64	Feat: Support Phi-3.5-MoE in SGLang (#7907 )	2025-07-09 23:51:33 -07:00
Xinyuan Tong	43e20c0647	Support Mimo-VL (#7579 ) Signed-off-by: Xinyuan Tong <justinning0323@outlook.com>	2025-07-08 14:00:25 -07:00
woodx	97011abc8a	[Doc] add embedding rerank doc (#7364 )	2025-06-19 21:53:54 -07:00
Lifu Huang	98538822d5	Add Phi-4-mm to supported VLM supported model list. (#7178 )	2025-06-13 23:17:40 -07:00
Marc Sun	37f1547587	[FEAT] Add transformers backend support (#5929 )	2025-06-03 21:05:29 -07:00
Brayden Zhong	1aa0fbf416	Add note to add supported model to documentation (#6640 )	2025-05-27 13:18:46 +08:00
simveit	506e5de8fe	Improve supported models doc (#6430 )	2025-05-20 01:43:35 +08:00
applesaucethebun	6dc6b30637	Add missing model to doc (#6396 ) Co-authored-by: Brayden Zhong <b8zhong@uwaterloo.ca>	2025-05-18 12:57:58 -07:00
Vincent Zhong	e9ef39d2e9	docs: Update the MD files (#6373 ) Co-authored-by: Brayden Zhong <b8zhong@uwaterloo.ca>	2025-05-17 09:23:16 -07:00
Kiv Chen	64825b8395	model(vlm): mistral 3.1 (#5099 ) Co-authored-by: KivenChen <sleigh-queue-0y@icloud.com>	2025-05-16 18:36:18 -07:00
Mick	cd7c8a8de6	doc: update developer guide regarding mllms (#6138 ) Signed-off-by: Xinyuan Tong <justinning0323@outlook.com> Co-authored-by: XinyuanTong <115166877+JustinTong0323@users.noreply.github.com> Co-authored-by: Xinyuan Tong <justinning0323@outlook.com>	2025-05-14 23:13:13 +08:00
Kiv Chen	5380cd7ea3	model(vlm): pixtral (#5084 )	2025-05-13 00:16:10 -07:00
Adarsh Shirawalmath	94d42b6794	[Docs] minor Qwen3 and reasoning parser docs fix (#6032 )	2025-05-11 08:22:46 -07:00
mlmz	69276f619a	doc: fix the erroneous documents and example codes about Alibaba-NLP/gme-Qwen2-VL-2B-Instruct (#6199 )	2025-05-11 08:22:11 -07:00
XinyuanTong	9d8ec2e67e	Fix and Clean up chat-template requirement for VLM (#6114 ) Signed-off-by: Xinyuan Tong <justinning0323@outlook.com>	2025-05-11 00:14:09 +08:00
liwenju0	8fefdd32c7	[Feature] add support kimi vl model (#5383 ) Co-authored-by: wenju.li <wenju.li@deepctr.cn>	2025-04-29 21:31:19 -07:00
Adarsh Shirawalmath	5c08aa4958	[Docs] Update docs for Qwen3 and Qwen3MoE (#5836 )	2025-04-29 13:48:30 -07:00
Lianmin Zheng	5641a09458	Revert "[Model] Support `ArcticForCausalLM` architecture (Snowflake/snowflake-arctic-instruct)" (#5754 )	2025-04-25 15:50:28 -07:00
Brayden Zhong	43fb95c2fa	[Model] Support `ArcticForCausalLM` architecture (Snowflake/snowflake-arctic-instruct) (#5078 ) Co-authored-by: vincent-4 <vincentzhongy+githubvincent4@gmail.com>	2025-04-25 15:24:09 +08:00
Michael Yao	7c99103f4c	[Doc] Fix two 404 links caused by sglang typo (#5667 ) Signed-off-by: windsonsea <haifeng.yao@daocloud.io>	2025-04-23 23:21:55 +08:00
Adarsh Shirawalmath	4aa6bab0b0	[Docs] Supported Model Docs - Major restructuring (#5290 ) Co-authored-by: zhaochenyang20 <zhaochen20@outlook.com>	2025-04-11 09:17:47 -07:00

39 Commits