Commit Graph

10 Commits

Author SHA1 Message Date
Jintao Zhang
f9ee6ae17a [router]: Add Embedding routing logic (#10129)
Signed-off-by: Jintao Zhang <zhangjintao9020@gmail.com>
Co-authored-by: Waël Boukhobza <wawa_wael@live.fr>
2025-09-14 18:44:35 -07:00
Keyang Ru
366043db8e [router] Add get and cancel method for response api (#10387) 2025-09-12 16:19:38 -07:00
Simo Lin
2f173ea074 [router] allow one router to support different model families and serving mode (#10244) 2025-09-12 16:18:27 -07:00
Frank Fang
4634fd5953 [router] Add Rerank Routing Logic in Regular Router (#10219) 2025-09-12 09:10:18 -07:00
Keyang Ru
a23bdeaf04 [router] Basic OAI Response api (#10346) 2025-09-11 20:56:17 -07:00
Simo Lin
4f8a982d52 [router] clean up dependency injector to use ctx (#10000) 2025-09-03 21:35:51 -07:00
Simo Lin
d966b902af [router] move tokenizer, reasoning, tool initialization to server (#9996) 2025-09-03 19:35:13 -07:00
Chang Su
11dcabc545 Grpc client (#9939) 2025-09-02 11:47:35 -07:00
Chang Su
9a0cac1be0 [router] add grpc pd and regular router init (#9893) 2025-09-01 20:06:15 -07:00
Simo Lin
5343058875 [router] grpc router bootstraps (#9759) 2025-08-28 12:07:06 -07:00