Keyang Ru
|
eb7d9261c0
|
[router] conversation item API: create, retrieve and delete (#11369)
|
2025-10-09 17:43:16 -04:00 |
|
Simo Lin
|
88bb627d0d
|
[router] change grpc client from mutable to clone (#11394)
|
2025-10-09 11:00:24 -07:00 |
|
Chang Su
|
ab926dd697
|
[router][grpc] Fix streaming bugs: empty tool names, state pollution, and panics (#11373)
|
2025-10-09 06:53:23 -04:00 |
|
Chang Su
|
a0557642ea
|
[router][lint] Add unused_qualifications to cargo lint warnings (#11366)
|
2025-10-08 22:17:11 -07:00 |
|
Keyang Ru
|
84768d1017
|
[router] Refactor OpenAI router: split monolithic file and move location (#11359)
|
2025-10-09 00:46:39 -04:00 |
|
Simo Lin
|
368fd20622
|
[router][grpc] disable health check generation and increase timeout (#11353)
|
2025-10-08 19:23:08 -07:00 |
|
Chang Su
|
fccac7d126
|
[router][grpc] Add dependencies in Cargo.toml to support chat template rendering (#11342)
|
2025-10-08 15:38:37 -07:00 |
|
Keyang Ru
|
7ac6b900f4
|
[router] Support history management using conversation (#11339)
|
2025-10-08 15:24:02 -07:00 |
|
Chang Su
|
a1080b72a0
|
[router] Fix all unused_qualifications (#11341)
|
2025-10-08 13:55:27 -07:00 |
|
Chang Su
|
a65ca73911
|
[router][grpc] Cleanup debug logs in grpc_server and grpc_router (#11340)
|
2025-10-08 13:26:19 -07:00 |
|
Simo Lin
|
677aa0e25f
|
[router] improve reasoning parser lock and reduce req cloning (#11336)
|
2025-10-08 11:18:15 -07:00 |
|
Simo Lin
|
01c9ee1ab4
|
[router] refactor generate to use new pipeline arch (#11323)
|
2025-10-08 09:38:50 -07:00 |
|
Chang Su
|
edd86b8853
|
[router][grpc] Refactor chat handler in grpc/ to use centralized orchestrator (#11314)
Co-authored-by: Simo Lin <linsimo.mark@gmail.com>
|
2025-10-07 20:50:20 -07:00 |
|
Simo Lin
|
fde9b96392
|
[router] cleanup worker health check to return early (#11310)
|
2025-10-07 16:53:10 -07:00 |
|
Keyang Ru
|
4ed67c27e3
|
[router] support Openai router conversation API CRUD (#11297)
|
2025-10-07 15:31:35 -07:00 |
|
Chang Su
|
420c99acfe
|
[router][grpc] Fix error message format in grpc chat handler (#11307)
|
2025-10-07 13:54:02 -07:00 |
|
Simo Lin
|
f4affd4df5
|
[router] fix grpc connection conversion and add optimization (#11305)
|
2025-10-07 10:39:33 -07:00 |
|
Chang Su
|
64582caa84
|
[router][grpc] Refactor chat template content format detection (#11288)
|
2025-10-07 08:38:51 -07:00 |
|
Simo Lin
|
2fcd56eaf6
|
[router] add get server info and get model info in grpc server (#11303)
|
2025-10-07 08:36:52 -07:00 |
|
Simo Lin
|
79d3495177
|
[router] add reasoning and tool parser argument in router (#11290)
|
2025-10-07 09:08:32 -04:00 |
|
Chang Su
|
a578d300ba
|
[router][grpc] Fix proto3 default value mismatches and cleanup unused fields (#11283)
|
2025-10-06 18:54:51 -07:00 |
|
Chang Su
|
b07c9c76c5
|
[router][grpc] Refine streaming processes (#11277)
|
2025-10-06 15:15:01 -07:00 |
|
Chang Su
|
466992b2d0
|
[router][tool call] Clean up redundant detect_format and has_tool_markers (#11270)
|
2025-10-06 14:04:02 -07:00 |
|
Simo Lin
|
5ee777c98f
|
[router] add ipv6 support across all components (#11219)
|
2025-10-06 08:16:59 -07:00 |
|
Simo Lin
|
d736e0b65e
|
[router] add grpc router pd mode for chat and generate (#11140)
|
2025-10-04 06:58:28 -07:00 |
|
Simo Lin
|
ffd03a9bd3
|
[router] fix get load response parsing (#11213)
|
2025-10-04 06:58:02 -07:00 |
|
Keyang Ru
|
34151f173b
|
[router] Steaming support for MCP Tool Calls in OpenAI Router (#11173)
|
2025-10-03 00:19:43 -07:00 |
|
Chang Su
|
963175d5c0
|
[router][grpc] Support streaming for v1/chat/completions (#11179)
|
2025-10-02 14:35:16 -07:00 |
|
Liangsheng Yin
|
7ff740a6ce
|
Remove dp balance metadata and minimul token balance. (#11170)
|
2025-10-03 01:48:15 +08:00 |
|
Chang Su
|
b658be6f6a
|
[router][grpc] Support tool call parser in streaming (#11160)
|
2025-10-02 03:18:50 -07:00 |
|
Keyang Ru
|
a28b394fba
|
[router] Add multi-turn tool calling loop support for MCP integration (#11143)
|
2025-10-01 12:50:21 -07:00 |
|
Keyang Ru
|
7fb551a75d
|
[router] add mcp list and mcp call in output array (#11112)
|
2025-09-30 21:41:54 -04:00 |
|
Chang Su
|
8ce830a8b0
|
[router][bugfix] Fix input_logprobs handling with None value and logprob_start_len = -1 (#11113)
|
2025-09-30 16:09:40 -07:00 |
|
Chang Su
|
d1676cd483
|
[router][tool call] Full support for ToolChoice (#11085)
Co-authored-by: Simo Lin <linsimo.mark@gmail.com>
|
2025-09-29 22:36:03 -07:00 |
|
Simo Lin
|
33b3c0f85f
|
[router] grpc router generate endpoint support (#11070)
Co-authored-by: Chang Su <chang.s.su@oracle.com>
|
2025-09-29 22:07:53 -07:00 |
|
Chang Su
|
5937a56d47
|
[router][grpc] Add logprobs support to router (#11082)
|
2025-09-29 15:55:06 -07:00 |
|
Chang Su
|
f065e5bea5
|
[router] Use get_pooled in process_single_choice (#11079)
|
2025-09-29 15:48:00 -07:00 |
|
Chang Su
|
4eeaff74a0
|
[router][tool call] Separate JsonParser and LlamaParser (#11073)
|
2025-09-29 10:26:37 -07:00 |
|
Simo Lin
|
816b3a433a
|
[router] add n to generate sampling params (#11069)
|
2025-09-29 07:37:43 -07:00 |
|
Chang Su
|
af4ab65606
|
[router][tool call] Improve normal content extraction and error handling (non-stream) (#11050)
|
2025-09-29 00:19:30 -07:00 |
|
Simo Lin
|
2572886367
|
[router] add harmony tool parser base structure and interface (#11036)
|
2025-09-28 19:46:38 -07:00 |
|
Chang Su
|
dba751a896
|
[router][tool call] Support normal content extraction before tool call (streaming) (#11038)
|
2025-09-28 19:46:06 -07:00 |
|
Simo Lin
|
336e9a6058
|
[router] migrate to rust python module for pythonic parser (#11033)
|
2025-09-28 14:48:59 -04:00 |
|
Yuxuan Zhang
|
abb6781573
|
Update GLM-4.5 Model Doc (#11017)
|
2025-09-28 11:21:27 -07:00 |
|
Simo Lin
|
5519766a4d
|
[router] fix chat template loading and tokenizer path (#10999)
|
2025-09-27 23:54:12 -04:00 |
|
Keyang Ru
|
72392f2908
|
[router] basic mcp support for openai router response api (#10978)
|
2025-09-27 21:49:33 -04:00 |
|
Chang Su
|
c1c8dd1dd0
|
[router][tool parser] Modify tool parser to return both normal text and tool calls (non-stream) (#10995)
|
2025-09-27 18:10:17 -04:00 |
|
Chang Su
|
37f3325b06
|
[router][grpc] Support E2E non-stream chat completions (#10980)
|
2025-09-26 22:02:06 -07:00 |
|
Chang Su
|
0c3db88978
|
[router][grpc] Add helpfer functions for decoder in router.rs and fix specs (#10971)
|
2025-09-26 20:10:45 -04:00 |
|
Simo Lin
|
aae7ead2d0
|
[router] remove old/oudated/useless comments across code base (#10968)
|
2025-09-26 10:48:50 -07:00 |
|