sglang

Author	SHA1	Message	Date
Vincent Zhong	36a6b8dbfc	Update `v1/responses` to be more OpenAI-compatible. (#9624 )	2025-10-05 18:47:46 +00:00
fzyzcjy	fdc4e1e570	Tiny move files to utils folder (#11166 )	2025-10-03 22:40:06 +08:00
Tejesh Anand	8cc27fdc46	Use jsonschema to constrain required or specific tool choice (#10550 )	2025-09-27 13:18:50 -04:00
eraser00	0ac6114694	Replace the Kimi-K2 generated tool call idx with history tool call count (#10612 ) Co-authored-by: eraser00 <eraser00@github.com>	2025-09-25 18:47:40 -07:00
Lianmin Zheng	35ec2a45a8	[minor] Remove deprecated function `get_ip` (#10883 )	2025-09-25 16:18:04 -07:00
Xinyuan Tong	71f24ef8f6	feat: add cache_salt support to request (#10718 ) Signed-off-by: Xinyuan Tong <xinyuantong.cs@gmail.com>	2025-09-23 23:30:25 -07:00
Xinyuan Tong	f3b5db6ee8	Feat: support disable tool parser (#10184 )	2025-09-10 14:03:55 -07:00
gongwei-130	3fd1431df2	support enable in the reasoning field to enable thingking for thinkin… (#9715 )	2025-08-29 10:57:32 -07:00
gongwei-130	9a7c8842ba	accomendate json schema in the "schema" field, not in "json_schema" field of response_format (#9786 )	2025-08-28 23:51:50 -07:00
cicirori	b6c14ec0b4	add `response_format` support for `completion` API (#9665 )	2025-08-26 15:01:29 -07:00
Xiaotong Jiang	0936c766ed	Fix kimi k2 function calling format (#9606 )	2025-08-26 00:50:59 -07:00
Lifu Huang	29a610b4d9	Fix broken CI TestRequestLengthValidation (#9095 )	2025-08-11 22:59:56 -07:00
Binyao Jiang	f29aba8c6e	Support glm4.1v and glm4.5v (#8798 ) Signed-off-by: Xinyuan Tong <justinning0323@outlook.com> Signed-off-by: Xinyuan Tong <xinyuantong.cs@gmail.com> Co-authored-by: Xinyuan Tong <justinning0323@outlook.com> Co-authored-by: Xinyuan Tong <115166877+JustinTong0323@users.noreply.github.com> Co-authored-by: Xinyuan Tong <xinyuantong.cs@gmail.com> Co-authored-by: zRzRzRzRzRzRzR <2448370773@qq.com> Co-authored-by: Minglei Zhu <mingleizhu1122@gmail.com> Co-authored-by: Chang Su <csu272@usc.edu>	2025-08-09 00:59:13 -07:00
Binyao Jiang	581e7dcb92	GLM-4.5 Model Support Follow-up (#8445 )	2025-07-27 23:35:20 -07:00
Yuxuan Zhang	6d6a8bc278	GLM-4.5 Model Support (#8224 ) Co-authored-by: Lifu Huang <lifu.hlf@gmail.com> Co-authored-by: Binyao Jiang <byjiang1996@gmail.com> Co-authored-by: Stefan He <hebiaobuaa@gmail.com>	2025-07-27 22:54:07 -07:00
Chang Su	58dd95fbc8	Fix test_openai_server (#8419 )	2025-07-27 13:36:01 -07:00
Chang Su	b47eda3316	bugfix: Fix multiple finish_reason chunks and tool_calls finish reason check (#8417 )	2025-07-27 13:31:06 -07:00
Binyao Jiang	e983d66680	Fix: Improve test_openai_function_calling unit test and fix reasoning_parser.py think_start_token logic (#8316 ) Co-authored-by: Chang Su <chang.s.su@oracle.com>	2025-07-27 13:12:59 -07:00
xianzhiT	c87d4fec99	Fix the issue of incorrect finish reason in final stream response chunk returned during tool call (#7708 ) Co-authored-by: Xinyuan Tong <115166877+JustinTong0323@users.noreply.github.com>	2025-07-23 13:28:53 -07:00
Chang Su	112b496a6c	misc: Improvement to serving_chat.py and add more ut (#7489 )	2025-06-24 17:19:51 -07:00
Chang Su	fa42e41962	ci: Revert openai_server related tests in AMD suites (#7449 )	2025-06-23 15:28:22 -07:00
Chang Su	34b6b8426f	feat(func_call): Add more check in `BaseFormatDetector.parse_streaming_increment` (#7479 )	2025-06-23 11:15:47 -07:00
Chang Su	b7a2df0a44	refactor(test): reorganize OpenAI test file structure (#7408 )	2025-06-21 19:37:48 -07:00

23 Commits