Commit Graph

19 Commits

Author SHA1 Message Date
Simo Lin
2f173ea074 [router] allow one router to support different model families and serving mode (#10244) 2025-09-12 16:18:27 -07:00
Simo Lin
4f8a982d52 [router] clean up dependency injector to use ctx (#10000) 2025-09-03 21:35:51 -07:00
Simo Lin
5343058875 [router] grpc router bootstraps (#9759) 2025-08-28 12:07:06 -07:00
Bruce-x-1997
9e169ea8b5 [router] add right rustls dependency in sgl-router cargo.toml (#9498)
Co-authored-by: bruce.xu <bruce.xu@gmicloud.ai>
2025-08-24 09:03:15 -07:00
Jeff Nettleton
d7e38b2f6d [router] clean up lint warnings with clippy execution (#9201) 2025-08-15 11:01:21 -07:00
Simo Lin
21b8846066 [router] allow more health check configuration (#9198) 2025-08-15 08:07:45 -07:00
Simo Lin
9d68bdb240 [router] Add Rust Binary Entrypoint for SGLang Router (#9089) 2025-08-11 21:37:36 -07:00
Simo Lin
473400e452 [router] upgrade kube version to latest (#9018) 2025-08-09 22:49:45 -07:00
Simo Lin
61a4680494 [router] router circuit breaker core (#8941) 2025-08-08 09:20:22 -07:00
Simo Lin
354ac43555 [pd-router] Add Configurable Retry Logic for reduce backend pressure (#8744) 2025-08-04 20:42:07 -07:00
Simo Lin
828a4fe944 [router] Implement HTTP Dependency Injection Pattern for Router System (#8714) 2025-08-02 19:16:47 -07:00
Rui Chen
a730ce8162 [feature] [sgl-router] Add a dp-aware routing strategy (#6869) 2025-07-30 05:58:48 -07:00
Simo Lin
fe6a445d1e [router] improve router logs and request id header (#8415) 2025-07-27 19:30:19 -07:00
Simo Lin
c8f31042a8 [router] Refactor router and policy traits with dependency injection (#7987)
Co-authored-by: Jin Pan <jpan236@wisc.edu>
Co-authored-by: Keru Yang <rukeyang@gmail.com>
Co-authored-by: Yingyi Huang <yingyihuang2000@outlook.com>
Co-authored-by: Philip Zhu <phlipzhux@gmail.com>
2025-07-18 14:24:24 -07:00
Simo Lin
f2d5c4920e [router] add worker abstraction (#7960) 2025-07-11 20:17:48 -07:00
Simo Lin
30f2a44a96 [misc] Add PD service discovery support in router (#7361) 2025-06-22 17:54:14 -07:00
Arthur Cheng
ff91474825 [Router] Fix k8s Service Discovery (#6766)
Co-authored-by: Simo Lin <linsimo.mark@gmail.com>
2025-06-02 16:57:23 -07:00
Simo Lin
771669cbe0 [fix]: PyO3 macOS linking and consolidate on tracing for logging 2025-04-29 11:26:38 -07:00
Simo Lin
1468769bde [Misc] add service discovery for sgl router 2025-04-29 10:21:19 -07:00