Commit Graph

27 Commits

Author SHA1 Message Date
Simo Lin
2f173ea074 [router] allow one router to support different model families and serving mode (#10244) 2025-09-12 16:18:27 -07:00
Frank Fang
4634fd5953 [router] Add Rerank Routing Logic in Regular Router (#10219) 2025-09-12 09:10:18 -07:00
Keyang Ru
a23bdeaf04 [router] Basic OAI Response api (#10346) 2025-09-11 20:56:17 -07:00
Simo Lin
d966b902af [router] move tokenizer, reasoning, tool initialization to server (#9996) 2025-09-03 19:35:13 -07:00
Chang Su
90313fb09a [router] add token bucket rate limiter (#9656) 2025-08-26 10:36:26 -07:00
Keyang Ru
5ef545e678 [router] Move all protocols to spec.rs file (#9519) 2025-08-22 14:18:47 -07:00
Keyang Ru
ce67b2d586 [router]restructure protocol modules for better organization (#9321) 2025-08-19 01:07:58 +00:00
Simo Lin
9d68bdb240 [router] Add Rust Binary Entrypoint for SGLang Router (#9089) 2025-08-11 21:37:36 -07:00
Simo Lin
a69b637014 [router] fix req handling order, improve serialization, remove retry (#8888) 2025-08-06 23:24:39 -07:00
Simo Lin
828a4fe944 [router] Implement HTTP Dependency Injection Pattern for Router System (#8714) 2025-08-02 19:16:47 -07:00
Simo Lin
66a398f49d [router] migrate router from actix to axum (#8479) 2025-07-30 17:47:19 -07:00
Simo Lin
fe6a445d1e [router] improve router logs and request id header (#8415) 2025-07-27 19:30:19 -07:00
Simo Lin
8fcc55cfa1 [router] router metrics cleanup (#8158) 2025-07-18 22:09:17 -07:00
Simo Lin
c8f31042a8 [router] Refactor router and policy traits with dependency injection (#7987)
Co-authored-by: Jin Pan <jpan236@wisc.edu>
Co-authored-by: Keru Yang <rukeyang@gmail.com>
Co-authored-by: Yingyi Huang <yingyihuang2000@outlook.com>
Co-authored-by: Philip Zhu <phlipzhux@gmail.com>
2025-07-18 14:24:24 -07:00
Simo Lin
f2d5c4920e [router] add worker abstraction (#7960) 2025-07-11 20:17:48 -07:00
Zilin Zhu
82f021e22e [router] add --log-level to sgl-router (#6512) 2025-07-02 19:33:04 -07:00
Simo Lin
09ae5b20f3 Merge PDLB (Prefill-Decode Load Balancer) into SGLang Router (#7096) 2025-06-19 02:28:15 +08:00
Arthur Cheng
ff91474825 [Router] Fix k8s Service Discovery (#6766)
Co-authored-by: Simo Lin <linsimo.mark@gmail.com>
2025-06-02 16:57:23 -07:00
Chao Yang
1a39979993 Sgl-router Prometheus metrics endpoint and usage track metrics (#6537) 2025-05-24 22:28:15 -07:00
Zilin Zhu
669caa0a3f [router] support http2 in router (#6487) 2025-05-21 01:42:45 -07:00
Zilin Zhu
e3bed74afb [router] Add /list_workers endpoint to router (#6366) 2025-05-17 09:49:02 -07:00
Simo Lin
1468769bde [Misc] add service discovery for sgl router 2025-04-29 10:21:19 -07:00
Simo Lin
f0365820e8 [Misc] add structure logging, write to file and log tracing for SGL Router 2025-04-27 16:54:10 -07:00
Yinghai Lu
f4d7ab7a63 [sgl-router] improvement to avoid hang (#4482)
Co-authored-by: Yineng Zhang <me@zhyncs.com>
Co-authored-by: Byron Hsu <byronhsu1230@gmail.com>
2025-03-17 10:37:50 -07:00
Byron Hsu
9a0cc2e90e [router] Forward all request headers from router to workers (#3070) 2025-01-23 20:30:31 -08:00
Byron Hsu
0311ce8e1c [router] Expose worker startup secs & Return error instead of panic for router init (#3016) 2025-01-20 12:45:13 -08:00
Ata Fatahi
e3b3acfa6f Rename rust folder to sgl-router (#2464)
Signed-off-by: Ata Fatahi <immrata@gmail.com>
2024-12-12 09:40:41 -08:00