ybyang
|
dbb16bedd5
|
Support Thinking Budget (via custom_logit_processor for OpenAI API) [Fix #6572] (#11416)
Signed-off-by: ybyang <ybyang7@iflytek.com>
Co-authored-by: YorkSu <york_su@qq.com>
|
2025-10-21 16:27:56 +08:00 |
|
Chang Su
|
7ba3de0e92
|
[oai serving chat] Add argument --sampling-defaults and fix ChatCompletionRequest defaults (#11304)
|
2025-10-08 00:36:05 +00:00 |
|
Vincent Zhong
|
36a6b8dbfc
|
Update v1/responses to be more OpenAI-compatible. (#9624)
|
2025-10-05 18:47:46 +00:00 |
|
fzyzcjy
|
fdc4e1e570
|
Tiny move files to utils folder (#11166)
|
2025-10-03 22:40:06 +08:00 |
|
Tejesh Anand
|
8cc27fdc46
|
Use jsonschema to constrain required or specific tool choice (#10550)
|
2025-09-27 13:18:50 -04:00 |
|
eraser00
|
0ac6114694
|
Replace the Kimi-K2 generated tool call idx with history tool call count (#10612)
Co-authored-by: eraser00 <eraser00@github.com>
|
2025-09-25 18:47:40 -07:00 |
|
Xinyuan Tong
|
f3b5db6ee8
|
Feat: support disable tool parser (#10184)
|
2025-09-10 14:03:55 -07:00 |
|
gongwei-130
|
3fd1431df2
|
support enable in the reasoning field to enable thingking for thinkin… (#9715)
|
2025-08-29 10:57:32 -07:00 |
|
gongwei-130
|
9a7c8842ba
|
accomendate json schema in the "schema" field, not in "json_schema" field of response_format (#9786)
|
2025-08-28 23:51:50 -07:00 |
|
cicirori
|
b6c14ec0b4
|
add response_format support for completion API (#9665)
|
2025-08-26 15:01:29 -07:00 |
|
Xiaotong Jiang
|
0936c766ed
|
Fix kimi k2 function calling format (#9606)
|
2025-08-26 00:50:59 -07:00 |
|
Chang Su
|
b47eda3316
|
bugfix: Fix multiple finish_reason chunks and tool_calls finish reason check (#8417)
|
2025-07-27 13:31:06 -07:00 |
|
Chang Su
|
112b496a6c
|
misc: Improvement to serving_chat.py and add more ut (#7489)
|
2025-06-24 17:19:51 -07:00 |
|
Chang Su
|
b7a2df0a44
|
refactor(test): reorganize OpenAI test file structure (#7408)
|
2025-06-21 19:37:48 -07:00 |
|