Commit Graph

102 Commits

Author SHA1 Message Date
Keyang Ru
d2478cd4ff [router] Fix response api related spec (#11621) 2025-10-15 09:59:38 -07:00
Simo Lin
3962e39d7c [router] cleanup app context and move to startup (#11617) 2025-10-14 10:19:28 -07:00
Simo Lin
0b9915c132 [router] update generate spec to align with sgl io struct (#11591) 2025-10-14 02:51:33 -04:00
Chang Su
27ef1459e6 [router][protocols] Add Axum validate extractor and use it for /v1/chat/completions endpoint (#11588) 2025-10-13 22:51:15 -07:00
Simo Lin
728af88781 [router] allow user to specify chat template path (#11549) 2025-10-13 10:47:57 -07:00
Keyang Ru
63e84352b7 [router] openai router: support grok model (#11511) 2025-10-12 22:44:43 -04:00
Keyang Ru
eb7d9261c0 [router] conversation item API: create, retrieve and delete (#11369) 2025-10-09 17:43:16 -04:00
Chang Su
ab926dd697 [router][grpc] Fix streaming bugs: empty tool names, state pollution, and panics (#11373) 2025-10-09 06:53:23 -04:00
Chang Su
a0557642ea [router][lint] Add unused_qualifications to cargo lint warnings (#11366) 2025-10-08 22:17:11 -07:00
Keyang Ru
84768d1017 [router] Refactor OpenAI router: split monolithic file and move location (#11359) 2025-10-09 00:46:39 -04:00
Chang Su
fccac7d126 [router][grpc] Add dependencies in Cargo.toml to support chat template rendering (#11342) 2025-10-08 15:38:37 -07:00
Keyang Ru
7ac6b900f4 [router] Support history management using conversation (#11339) 2025-10-08 15:24:02 -07:00
Keyang Ru
4ed67c27e3 [router] support Openai router conversation API CRUD (#11297) 2025-10-07 15:31:35 -07:00
Chang Su
64582caa84 [router][grpc] Refactor chat template content format detection (#11288) 2025-10-07 08:38:51 -07:00
Simo Lin
79d3495177 [router] add reasoning and tool parser argument in router (#11290) 2025-10-07 09:08:32 -04:00
Chang Su
466992b2d0 [router][tool call] Clean up redundant detect_format and has_tool_markers (#11270) 2025-10-06 14:04:02 -07:00
Simo Lin
5ee777c98f [router] add ipv6 support across all components (#11219) 2025-10-06 08:16:59 -07:00
Keyang Ru
34151f173b [router] Steaming support for MCP Tool Calls in OpenAI Router (#11173) 2025-10-03 00:19:43 -07:00
Chang Su
963175d5c0 [router][grpc] Support streaming for v1/chat/completions (#11179) 2025-10-02 14:35:16 -07:00
Chang Su
b658be6f6a [router][grpc] Support tool call parser in streaming (#11160) 2025-10-02 03:18:50 -07:00
Keyang Ru
a28b394fba [router] Add multi-turn tool calling loop support for MCP integration (#11143) 2025-10-01 12:50:21 -07:00
Keyang Ru
7fb551a75d [router] add mcp list and mcp call in output array (#11112) 2025-09-30 21:41:54 -04:00
Chang Su
4eeaff74a0 [router][tool call] Separate JsonParser and LlamaParser (#11073) 2025-09-29 10:26:37 -07:00
Chang Su
af4ab65606 [router][tool call] Improve normal content extraction and error handling (non-stream) (#11050) 2025-09-29 00:19:30 -07:00
Simo Lin
5519766a4d [router] fix chat template loading and tokenizer path (#10999) 2025-09-27 23:54:12 -04:00
Keyang Ru
72392f2908 [router] basic mcp support for openai router response api (#10978) 2025-09-27 21:49:33 -04:00
Chang Su
c1c8dd1dd0 [router][tool parser] Modify tool parser to return both normal text and tool calls (non-stream) (#10995) 2025-09-27 18:10:17 -04:00
Simo Lin
aae7ead2d0 [router] remove old/oudated/useless comments across code base (#10968) 2025-09-26 10:48:50 -07:00
Keyang Ru
a73eb8cd20 [router] Support Oracle DB(ATP) Data Connector (#10845) 2025-09-24 23:59:32 -04:00
Simo Lin
7a06ef984d [router] consolidate health endpoints and flush cache (#10876) 2025-09-24 15:23:21 -07:00
Chang Su
4a87ba217f router-grpc: Add tools processing and other paramters for apply_chat_template (#10877) 2025-09-24 15:23:06 -07:00
Chang Su
9209b209be router-grpc: Support jinja chat template content format detection (#10832) 2025-09-24 11:45:01 -07:00
Chang Su
ee704e6265 [router] add auth middleware for api key auth (#10826) 2025-09-23 16:07:34 -07:00
Keyang Ru
f4e3ebeb05 [router] Support streaming for Openai Router Response api (#10822) 2025-09-23 14:56:28 -07:00
Simo Lin
98c3b04ff2 [router] responses api POST and GET with local storage (#10581)
Co-authored-by: key4ng <rukeyang@gmail.com>
2025-09-23 09:12:02 -07:00
Simo Lin
97c3823931 [router] refactor router and worker management 3/n (#10727) 2025-09-22 12:17:50 -07:00
Qiaolin Yu
e2ac7888b8 [2/2] Support deterministic inference for temperature > 0 (#10678)
Co-authored-by: Baizhou Zhang <sobereddiezhang@gmail.com>
Co-authored-by: hebiao064 <hebiaobuaa@gmail.com>
2025-09-21 19:36:08 -07:00
Jimmy
56321e9fc2 [Router]fix: fix get_load missing api_key (#10385) 2025-09-21 15:28:38 -04:00
Simo Lin
1d1ce62495 [router] refactor router and worker management 2.5/n (#10677) 2025-09-19 20:54:40 -07:00
Simo Lin
00eb5eb721 [router] refactor router and worker management 2/n (#10666) 2025-09-19 12:37:57 -07:00
Simo Lin
873d858b28 [router] refactor worker to builder pattern 5/n (#10653) 2025-09-19 05:43:23 -04:00
Simo Lin
ac2a723bb3 [router] refactor worker to builder pattern 3/n (#10647) 2025-09-18 22:52:57 -07:00
Chang Su
5fe39e85a2 [router] fix router manager and router init in server (#10499) 2025-09-15 22:23:26 -07:00
Simo Lin
16e9335998 [router] add router db connector for responses api (#10487) 2025-09-15 22:04:56 -07:00
Simo Lin
7eccbe992d [router] fix service discovery and mcp ut (#10449) 2025-09-14 21:07:23 -07:00
Jintao Zhang
f9ee6ae17a [router]: Add Embedding routing logic (#10129)
Signed-off-by: Jintao Zhang <zhangjintao9020@gmail.com>
Co-authored-by: Waël Boukhobza <wawa_wael@live.fr>
2025-09-14 18:44:35 -07:00
Keyang Ru
366043db8e [router] Add get and cancel method for response api (#10387) 2025-09-12 16:19:38 -07:00
Simo Lin
2f173ea074 [router] allow one router to support different model families and serving mode (#10244) 2025-09-12 16:18:27 -07:00
Frank Fang
4634fd5953 [router] Add Rerank Routing Logic in Regular Router (#10219) 2025-09-12 09:10:18 -07:00
Keyang Ru
a23bdeaf04 [router] Basic OAI Response api (#10346) 2025-09-11 20:56:17 -07:00