Commit Graph

23 Commits

Author SHA1 Message Date
Chang Su
35ef3f2902 [router] fix worker registration in multi model mode (#10486) 2025-09-15 21:05:00 -04:00
Simo Lin
7eccbe992d [router] fix service discovery and mcp ut (#10449) 2025-09-14 21:07:23 -07:00
Simo Lin
2f173ea074 [router] allow one router to support different model families and serving mode (#10244) 2025-09-12 16:18:27 -07:00
Simo Lin
8c86595c93 [router] enable sccache in ci and local build (#10099) 2025-09-12 09:43:48 -07:00
Simo Lin
bbf261ae4a [router] fix grpc connection mode detection (#9999) 2025-09-03 21:36:16 -07:00
Simo Lin
d966b902af [router] move tokenizer, reasoning, tool initialization to server (#9996) 2025-09-03 19:35:13 -07:00
Chang Su
11dcabc545 Grpc client (#9939) 2025-09-02 11:47:35 -07:00
Simo Lin
5343058875 [router] grpc router bootstraps (#9759) 2025-08-28 12:07:06 -07:00
Chang Su
90313fb09a [router] add token bucket rate limiter (#9656) 2025-08-26 10:36:26 -07:00
Simo Lin
3578eb1e9b [router] address worker load tracking consistency (#9523)
Co-authored-by: fzyzcjy <5236035+fzyzcjy@users.noreply.github.com>
2025-08-26 06:40:51 -07:00
Jeff Nettleton
ce3ca9b02f [router] add cargo clippy in CI and fix-up linting errors (#9242) 2025-08-17 11:03:56 -07:00
Jeff Nettleton
d7e38b2f6d [router] clean up lint warnings with clippy execution (#9201) 2025-08-15 11:01:21 -07:00
Simo Lin
21b8846066 [router] allow more health check configuration (#9198) 2025-08-15 08:07:45 -07:00
Simo Lin
6f81a710f7 [pd-router] add retry and circuit breakfor for pd router (#9051) 2025-08-11 05:53:26 -07:00
Simo Lin
067068f271 [router] regular router circuit breaker (#8997) 2025-08-10 21:19:30 -07:00
Tony Lu
36bfddecb9 [router] add metrics for worker and policy (#8971)
Signed-off-by: Tony Lu <tonyluj@gmail.com>
2025-08-08 13:41:40 -07:00
Simo Lin
61a4680494 [router] router circuit breaker core (#8941) 2025-08-08 09:20:22 -07:00
Tien Nguyen
fd05b56750 refactor(sgl-router): Replace once_cell with LazyLock in worker.rs and remove once_cell dependency from Cargo.toml (#8698) 2025-08-07 06:14:03 -07:00
Simo Lin
2fa0462c39 [router] introduce dp worker abstraction (#8639) 2025-08-04 06:42:20 -07:00
Rui Chen
a730ce8162 [feature] [sgl-router] Add a dp-aware routing strategy (#6869) 2025-07-30 05:58:48 -07:00
Simo Lin
fe6a445d1e [router] improve router logs and request id header (#8415) 2025-07-27 19:30:19 -07:00
Simo Lin
41d33e4736 [router] add ut for worker and errors (#8170) 2025-07-19 14:38:33 -07:00
Simo Lin
f2d5c4920e [router] add worker abstraction (#7960) 2025-07-11 20:17:48 -07:00