Commit Graph

35 Commits

Author SHA1 Message Date
Simo Lin
ddcba74b4d [router] Worker Management Workflow Engine (#11868) 2025-10-20 17:00:22 -07:00
Chang Su
dc01313da1 [router] Add rustfmt and set group imports by default (#11732) 2025-10-16 17:33:29 -07:00
Simo Lin
4b62af92ef [router] change worker api to async instead of sync (#11566) 2025-10-14 00:32:21 -07:00
Simo Lin
2eeb27515a [router] disable rate limiter by default (#11435) 2025-10-10 20:43:07 -04:00
Keyang Ru
7ac6b900f4 [router] Support history management using conversation (#11339) 2025-10-08 15:24:02 -07:00
Keyang Ru
4ed67c27e3 [router] support Openai router conversation API CRUD (#11297) 2025-10-07 15:31:35 -07:00
Simo Lin
79d3495177 [router] add reasoning and tool parser argument in router (#11290) 2025-10-07 09:08:32 -04:00
Chang Su
b658be6f6a [router][grpc] Support tool call parser in streaming (#11160) 2025-10-02 03:18:50 -07:00
Simo Lin
aae7ead2d0 [router] remove old/oudated/useless comments across code base (#10968) 2025-09-26 10:48:50 -07:00
Simo Lin
a7fe6e10a1 [router] remove old/oudated/useless comments (#10967) 2025-09-26 09:45:15 -07:00
Simo Lin
d511b2d905 [router] consolidate worker load monitoring (#10894) 2025-09-25 09:59:30 -04:00
Simo Lin
97c3823931 [router] refactor router and worker management 3/n (#10727) 2025-09-22 12:17:50 -07:00
Jimmy
56321e9fc2 [Router]fix: fix get_load missing api_key (#10385) 2025-09-21 15:28:38 -04:00
Simo Lin
00eb5eb721 [router] refactor router and worker management 2/n (#10666) 2025-09-19 12:37:57 -07:00
Simo Lin
16e9335998 [router] add router db connector for responses api (#10487) 2025-09-15 22:04:56 -07:00
Simo Lin
7eccbe992d [router] fix service discovery and mcp ut (#10449) 2025-09-14 21:07:23 -07:00
Simo Lin
2f173ea074 [router] allow one router to support different model families and serving mode (#10244) 2025-09-12 16:18:27 -07:00
Simo Lin
4f8a982d52 [router] clean up dependency injector to use ctx (#10000) 2025-09-03 21:35:51 -07:00
Simo Lin
5343058875 [router] grpc router bootstraps (#9759) 2025-08-28 12:07:06 -07:00
Bruce-x-1997
9e169ea8b5 [router] add right rustls dependency in sgl-router cargo.toml (#9498)
Co-authored-by: bruce.xu <bruce.xu@gmicloud.ai>
2025-08-24 09:03:15 -07:00
Jeff Nettleton
d7e38b2f6d [router] clean up lint warnings with clippy execution (#9201) 2025-08-15 11:01:21 -07:00
Simo Lin
21b8846066 [router] allow more health check configuration (#9198) 2025-08-15 08:07:45 -07:00
Simo Lin
9d68bdb240 [router] Add Rust Binary Entrypoint for SGLang Router (#9089) 2025-08-11 21:37:36 -07:00
Simo Lin
473400e452 [router] upgrade kube version to latest (#9018) 2025-08-09 22:49:45 -07:00
Simo Lin
61a4680494 [router] router circuit breaker core (#8941) 2025-08-08 09:20:22 -07:00
Simo Lin
354ac43555 [pd-router] Add Configurable Retry Logic for reduce backend pressure (#8744) 2025-08-04 20:42:07 -07:00
Simo Lin
828a4fe944 [router] Implement HTTP Dependency Injection Pattern for Router System (#8714) 2025-08-02 19:16:47 -07:00
Rui Chen
a730ce8162 [feature] [sgl-router] Add a dp-aware routing strategy (#6869) 2025-07-30 05:58:48 -07:00
Simo Lin
fe6a445d1e [router] improve router logs and request id header (#8415) 2025-07-27 19:30:19 -07:00
Simo Lin
c8f31042a8 [router] Refactor router and policy traits with dependency injection (#7987)
Co-authored-by: Jin Pan <jpan236@wisc.edu>
Co-authored-by: Keru Yang <rukeyang@gmail.com>
Co-authored-by: Yingyi Huang <yingyihuang2000@outlook.com>
Co-authored-by: Philip Zhu <phlipzhux@gmail.com>
2025-07-18 14:24:24 -07:00
Simo Lin
f2d5c4920e [router] add worker abstraction (#7960) 2025-07-11 20:17:48 -07:00
Simo Lin
30f2a44a96 [misc] Add PD service discovery support in router (#7361) 2025-06-22 17:54:14 -07:00
Arthur Cheng
ff91474825 [Router] Fix k8s Service Discovery (#6766)
Co-authored-by: Simo Lin <linsimo.mark@gmail.com>
2025-06-02 16:57:23 -07:00
Simo Lin
771669cbe0 [fix]: PyO3 macOS linking and consolidate on tracing for logging 2025-04-29 11:26:38 -07:00
Simo Lin
1468769bde [Misc] add service discovery for sgl router 2025-04-29 10:21:19 -07:00