gongwei-130
|
3fd1431df2
|
support enable in the reasoning field to enable thingking for thinkin… (#9715)
|
2025-08-29 10:57:32 -07:00 |
|
gongwei-130
|
9a7c8842ba
|
accomendate json schema in the "schema" field, not in "json_schema" field of response_format (#9786)
|
2025-08-28 23:51:50 -07:00 |
|
cicirori
|
b6c14ec0b4
|
add response_format support for completion API (#9665)
|
2025-08-26 15:01:29 -07:00 |
|
Xiaotong Jiang
|
0936c766ed
|
Fix kimi k2 function calling format (#9606)
|
2025-08-26 00:50:59 -07:00 |
|
Lifu Huang
|
29a610b4d9
|
Fix broken CI TestRequestLengthValidation (#9095)
|
2025-08-11 22:59:56 -07:00 |
|
Binyao Jiang
|
f29aba8c6e
|
Support glm4.1v and glm4.5v (#8798)
Signed-off-by: Xinyuan Tong <justinning0323@outlook.com>
Signed-off-by: Xinyuan Tong <xinyuantong.cs@gmail.com>
Co-authored-by: Xinyuan Tong <justinning0323@outlook.com>
Co-authored-by: Xinyuan Tong <115166877+JustinTong0323@users.noreply.github.com>
Co-authored-by: Xinyuan Tong <xinyuantong.cs@gmail.com>
Co-authored-by: zRzRzRzRzRzRzR <2448370773@qq.com>
Co-authored-by: Minglei Zhu <mingleizhu1122@gmail.com>
Co-authored-by: Chang Su <csu272@usc.edu>
|
2025-08-09 00:59:13 -07:00 |
|
Binyao Jiang
|
581e7dcb92
|
GLM-4.5 Model Support Follow-up (#8445)
|
2025-07-27 23:35:20 -07:00 |
|
Yuxuan Zhang
|
6d6a8bc278
|
GLM-4.5 Model Support (#8224)
Co-authored-by: Lifu Huang <lifu.hlf@gmail.com>
Co-authored-by: Binyao Jiang <byjiang1996@gmail.com>
Co-authored-by: Stefan He <hebiaobuaa@gmail.com>
|
2025-07-27 22:54:07 -07:00 |
|
Chang Su
|
58dd95fbc8
|
Fix test_openai_server (#8419)
|
2025-07-27 13:36:01 -07:00 |
|
Chang Su
|
b47eda3316
|
bugfix: Fix multiple finish_reason chunks and tool_calls finish reason check (#8417)
|
2025-07-27 13:31:06 -07:00 |
|
Binyao Jiang
|
e983d66680
|
Fix: Improve test_openai_function_calling unit test and fix reasoning_parser.py think_start_token logic (#8316)
Co-authored-by: Chang Su <chang.s.su@oracle.com>
|
2025-07-27 13:12:59 -07:00 |
|
xianzhiT
|
c87d4fec99
|
Fix the issue of incorrect finish reason in final stream response chunk returned during tool call (#7708)
Co-authored-by: Xinyuan Tong <115166877+JustinTong0323@users.noreply.github.com>
|
2025-07-23 13:28:53 -07:00 |
|
Chang Su
|
112b496a6c
|
misc: Improvement to serving_chat.py and add more ut (#7489)
|
2025-06-24 17:19:51 -07:00 |
|
Chang Su
|
fa42e41962
|
ci: Revert openai_server related tests in AMD suites (#7449)
|
2025-06-23 15:28:22 -07:00 |
|
Chang Su
|
34b6b8426f
|
feat(func_call): Add more check in BaseFormatDetector.parse_streaming_increment (#7479)
|
2025-06-23 11:15:47 -07:00 |
|
Chang Su
|
b7a2df0a44
|
refactor(test): reorganize OpenAI test file structure (#7408)
|
2025-06-21 19:37:48 -07:00 |
|