Jintao Zhang
|
f9ee6ae17a
|
[router]: Add Embedding routing logic (#10129)
Signed-off-by: Jintao Zhang <zhangjintao9020@gmail.com>
Co-authored-by: Waël Boukhobza <wawa_wael@live.fr>
|
2025-09-14 18:44:35 -07:00 |
|
Keyang Ru
|
366043db8e
|
[router] Add get and cancel method for response api (#10387)
|
2025-09-12 16:19:38 -07:00 |
|
Simo Lin
|
2f173ea074
|
[router] allow one router to support different model families and serving mode (#10244)
|
2025-09-12 16:18:27 -07:00 |
|
Frank Fang
|
4634fd5953
|
[router] Add Rerank Routing Logic in Regular Router (#10219)
|
2025-09-12 09:10:18 -07:00 |
|
Keyang Ru
|
a23bdeaf04
|
[router] Basic OAI Response api (#10346)
|
2025-09-11 20:56:17 -07:00 |
|
Simo Lin
|
4f8a982d52
|
[router] clean up dependency injector to use ctx (#10000)
|
2025-09-03 21:35:51 -07:00 |
|
LukasBluebaum
|
9d9fa9a537
|
[router] Fix short timeout for the prefill client (#9803)
|
2025-09-01 19:57:04 -07:00 |
|
Simo Lin
|
5343058875
|
[router] grpc router bootstraps (#9759)
|
2025-08-28 12:07:06 -07:00 |
|