Commit Graph

11 Commits

Author SHA1 Message Date
Keyang Ru
7fb551a75d [router] add mcp list and mcp call in output array (#11112) 2025-09-30 21:41:54 -04:00
Simo Lin
f7bab41a29 [router] change log level to warning (#10926) 2025-09-25 17:32:59 -07:00
Chang Su
ee704e6265 [router] add auth middleware for api key auth (#10826) 2025-09-23 16:07:34 -07:00
Simo Lin
97c3823931 [router] refactor router and worker management 3/n (#10727) 2025-09-22 12:17:50 -07:00
Jimmy
56321e9fc2 [Router]fix: fix get_load missing api_key (#10385) 2025-09-21 15:28:38 -04:00
Jintao Zhang
f9ee6ae17a [router]: Add Embedding routing logic (#10129)
Signed-off-by: Jintao Zhang <zhangjintao9020@gmail.com>
Co-authored-by: Waël Boukhobza <wawa_wael@live.fr>
2025-09-14 18:44:35 -07:00
Simo Lin
2f173ea074 [router] allow one router to support different model families and serving mode (#10244) 2025-09-12 16:18:27 -07:00
Keyang Ru
7b141f816c [router][ci] Add gpu utilization analyze with nvml (#10345) 2025-09-11 19:26:02 -07:00
Keyang Ru
480d1b8b20 [router] add benchmark for regular router and pd router (#10280) 2025-09-11 12:04:11 -07:00
Keyang Ru
cda7e47ce7 [router] Add PD router mmlu test (#10256) 2025-09-10 08:47:24 -07:00
Keyang Ru
9eb50ecc9c [router] Improve the router e2e tests (#10102) 2025-09-06 16:19:28 -07:00