Simo Lin
|
97c3823931
|
[router] refactor router and worker management 3/n (#10727)
|
2025-09-22 12:17:50 -07:00 |
|
Jimmy
|
56321e9fc2
|
[Router]fix: fix get_load missing api_key (#10385)
|
2025-09-21 15:28:38 -04:00 |
|
Jintao Zhang
|
f9ee6ae17a
|
[router]: Add Embedding routing logic (#10129)
Signed-off-by: Jintao Zhang <zhangjintao9020@gmail.com>
Co-authored-by: Waël Boukhobza <wawa_wael@live.fr>
|
2025-09-14 18:44:35 -07:00 |
|
Simo Lin
|
2f173ea074
|
[router] allow one router to support different model families and serving mode (#10244)
|
2025-09-12 16:18:27 -07:00 |
|
Keyang Ru
|
7b141f816c
|
[router][ci] Add gpu utilization analyze with nvml (#10345)
|
2025-09-11 19:26:02 -07:00 |
|
Keyang Ru
|
480d1b8b20
|
[router] add benchmark for regular router and pd router (#10280)
|
2025-09-11 12:04:11 -07:00 |
|
Keyang Ru
|
cda7e47ce7
|
[router] Add PD router mmlu test (#10256)
|
2025-09-10 08:47:24 -07:00 |
|
Keyang Ru
|
9eb50ecc9c
|
[router] Improve the router e2e tests (#10102)
|
2025-09-06 16:19:28 -07:00 |
|