sglang

Author	SHA1	Message	Date
Xu Wenqing	b9683be653	Support DeepSeek-V3.1 tool call (#9446 ) Signed-off-by: 许文卿 <xwq391974@alibaba-inc.com> Co-authored-by: Xinyuan Tong <xinyuantong.cs@gmail.com>	2025-08-26 20:22:19 -07:00
Jonas	a0a77d937b	Fix Harmony reasoning parser for and auto-separation for gpt-oss models (#9190 ) Co-authored-by: Chang Su <chang.s.su@oracle.com> Co-authored-by: Chayenne <zhaochen20@outlook.com> Co-authored-by: zhaochenyang20 <zhaochenyang20@gmail.com> Co-authored-by: minleminzui <2969413251@qq.com> Co-authored-by: maocheng23 <maocheng@berkeley.edu> Co-authored-by: Xinyuan Tong <xinyuantong.cs@gmail.com>	2025-08-25 15:26:26 -07:00
Chang Su	c9dd70fbde	tool-call(dsv3): Improve deepseek-v3 chat template and tool_choice = `required` (#9525 )	2025-08-23 01:46:56 -07:00
Chang Su	a218490136	(gpt-oss, oai, chat): Remove Harmony Integration and Implement Native GPT-OSS Tool Call Support (#9043 )	2025-08-11 18:59:18 -07:00
Binyao Jiang	f29aba8c6e	Support glm4.1v and glm4.5v (#8798 ) Signed-off-by: Xinyuan Tong <justinning0323@outlook.com> Signed-off-by: Xinyuan Tong <xinyuantong.cs@gmail.com> Co-authored-by: Xinyuan Tong <justinning0323@outlook.com> Co-authored-by: Xinyuan Tong <115166877+JustinTong0323@users.noreply.github.com> Co-authored-by: Xinyuan Tong <xinyuantong.cs@gmail.com> Co-authored-by: zRzRzRzRzRzRzR <2448370773@qq.com> Co-authored-by: Minglei Zhu <mingleizhu1122@gmail.com> Co-authored-by: Chang Su <csu272@usc.edu>	2025-08-09 00:59:13 -07:00
maocheng23	b3359dc9bf	Update qwen3_coder_detector.py for streaming (#8371 )	2025-08-08 14:51:03 -07:00
Lianmin Zheng	91e2f902db	Fix kimi k2 function call format (#8968 )	2025-08-08 13:25:14 -07:00
Chang Su	92cc32d9fc	Support v1/responses and use harmony in serving_chat (#8837 ) Signed-off-by: Xinyuan Tong <justinning0323@outlook.com> Signed-off-by: Xinyuan Tong <xinyuantong.cs@gmail.com> Co-authored-by: Xinyuan Tong <justinning0323@outlook.com> Co-authored-by: Xinyuan Tong <xinyuantong.cs@gmail.com>	2025-08-06 16:20:34 -07:00
Chang Su	51c38163c1	model: support Step3V (#8583 ) Signed-off-by: Xinyuan Tong <justinning0323@outlook.com> Co-authored-by: nnnobody-code <nnnobody@foxmail.com> Co-authored-by: ispobock <ispobaoke@gmail.com> Co-authored-by: Qiaolin-Yu <qy254@cornell.edu> Co-authored-by: Qiaolin-Yu <liin1211@outlook.com> Co-authored-by: Xinyuan Tong <justinning0323@outlook.com> Co-authored-by: Xinyuan Tong <115166877+JustinTong0323@users.noreply.github.com>	2025-07-31 02:41:00 -07:00
Binyao Jiang	581e7dcb92	GLM-4.5 Model Support Follow-up (#8445 )	2025-07-27 23:35:20 -07:00
Yuxuan Zhang	6d6a8bc278	GLM-4.5 Model Support (#8224 ) Co-authored-by: Lifu Huang <lifu.hlf@gmail.com> Co-authored-by: Binyao Jiang <byjiang1996@gmail.com> Co-authored-by: Stefan He <hebiaobuaa@gmail.com>	2025-07-27 22:54:07 -07:00
Chang Su	f8260f2539	[Bugfix][Feat] Add XML-ish grammar in EBNFComposer and fix misc bugs in Qwen3 detector (#8357 )	2025-07-25 12:03:16 -07:00
xianzhiT	624a3b8d1f	Fix incomplete tool call capture issue in streaming response of DeepSeek-V3 when enable MTP (#7562 )	2025-07-23 17:40:23 -07:00
Chang Su	01079e174f	feat(function call): complete utility method for KimiK2Detector and enhance documentation (#8043 )	2025-07-23 17:37:31 -07:00
yhyang201	0dfe2491ac	Preliminary Support for Qwen3XMLDetector (#8260 ) Co-authored-by: Chayenne <zhaochen20@outlook.com>	2025-07-23 06:49:38 +08:00
Atream	615553079d	Support Kimi K2 (#7940 )	2025-07-11 00:02:21 -07:00
Chang Su	34b6b8426f	feat(func_call): Add more check in `BaseFormatDetector.parse_streaming_increment` (#7479 )	2025-06-23 11:15:47 -07:00
Chang Su	72676cd6c0	feat(oai refactor): Replace `openai_api` with `entrypoints/openai` (#7351 ) Co-authored-by: Jin Pan <jpan236@wisc.edu>	2025-06-21 13:21:06 -07:00
Chang Su	fdfd5224bf	fix: Fix CI test_function_call_parser.py (#7425 )	2025-06-22 00:25:08 +08:00
ehuaa	bec5891083	[BugFix]fix qwen25 invoke function call streaming responses with curly braces as the starting indicator (#7394 )	2025-06-20 22:50:08 -07:00
Chang Su	e726131523	bugfix(tool call ebnf): Fix EBNF generation for optional function parameters (#7283 )	2025-06-17 13:36:07 -07:00
Chang Su	f18b068f15	feat(tool call): Enhance Llama32Detector for improved JSON parsing in non-stream (#6784 )	2025-05-30 17:05:17 -07:00
Chao Yang	4fac524b14	update llama4 chat template and pythonic parser (#6679 ) Co-authored-by: Chang Su <chang.s.su@oracle.com>	2025-05-30 17:01:22 -07:00
Chang Su	c673727e0e	refactor(tool call): Fix BaseFormatDetector tool_index issue and refactor `parse_streaming_increment` (#6715 )	2025-05-29 00:08:45 -07:00
Chang Su	461a730280	fix(deepseekv3): Fix DeepSeekV3Detector tool_index assignment and multi-tool call streaming support (#6655 )	2025-05-28 00:22:53 -07:00
Chang Su	41ba767f0c	feat: Add warnings for invalid tool_choice and UTs (#6582 )	2025-05-27 16:53:19 -07:00
Chang Su	bdb962d755	fix(tool call): Fix tool_index in PythonicDetector and issues with mixed output in non-streaming (#6678 )	2025-05-27 16:18:42 -07:00
Shi Shuai	755a36614b	fix: added "\n" to qwen25 tool parser structural tags (#6631 )	2025-05-26 19:25:45 -07:00
Chang Su	16f69b1f65	feat: Improve Mistral and Qwen25 function call parsing (#6597 )	2025-05-25 23:07:23 -07:00
Chang Su	ed0c3035cd	feat(Tool Calling): Support `required` and specific function mode (#6550 )	2025-05-23 21:00:37 -07:00

30 Commits