Simo Lin
|
2f173ea074
|
[router] allow one router to support different model families and serving mode (#10244)
|
2025-09-12 16:18:27 -07:00 |
|
Frank Fang
|
4634fd5953
|
[router] Add Rerank Routing Logic in Regular Router (#10219)
|
2025-09-12 09:10:18 -07:00 |
|
Keyang Ru
|
a23bdeaf04
|
[router] Basic OAI Response api (#10346)
|
2025-09-11 20:56:17 -07:00 |
|
Simo Lin
|
d966b902af
|
[router] move tokenizer, reasoning, tool initialization to server (#9996)
|
2025-09-03 19:35:13 -07:00 |
|
Chang Su
|
90313fb09a
|
[router] add token bucket rate limiter (#9656)
|
2025-08-26 10:36:26 -07:00 |
|
Keyang Ru
|
5ef545e678
|
[router] Move all protocols to spec.rs file (#9519)
|
2025-08-22 14:18:47 -07:00 |
|
Keyang Ru
|
ce67b2d586
|
[router]restructure protocol modules for better organization (#9321)
|
2025-08-19 01:07:58 +00:00 |
|
Simo Lin
|
9d68bdb240
|
[router] Add Rust Binary Entrypoint for SGLang Router (#9089)
|
2025-08-11 21:37:36 -07:00 |
|
Simo Lin
|
a69b637014
|
[router] fix req handling order, improve serialization, remove retry (#8888)
|
2025-08-06 23:24:39 -07:00 |
|
Simo Lin
|
828a4fe944
|
[router] Implement HTTP Dependency Injection Pattern for Router System (#8714)
|
2025-08-02 19:16:47 -07:00 |
|
Simo Lin
|
66a398f49d
|
[router] migrate router from actix to axum (#8479)
|
2025-07-30 17:47:19 -07:00 |
|
Simo Lin
|
fe6a445d1e
|
[router] improve router logs and request id header (#8415)
|
2025-07-27 19:30:19 -07:00 |
|
Simo Lin
|
8fcc55cfa1
|
[router] router metrics cleanup (#8158)
|
2025-07-18 22:09:17 -07:00 |
|
Simo Lin
|
c8f31042a8
|
[router] Refactor router and policy traits with dependency injection (#7987)
Co-authored-by: Jin Pan <jpan236@wisc.edu>
Co-authored-by: Keru Yang <rukeyang@gmail.com>
Co-authored-by: Yingyi Huang <yingyihuang2000@outlook.com>
Co-authored-by: Philip Zhu <phlipzhux@gmail.com>
|
2025-07-18 14:24:24 -07:00 |
|
Simo Lin
|
f2d5c4920e
|
[router] add worker abstraction (#7960)
|
2025-07-11 20:17:48 -07:00 |
|
Zilin Zhu
|
82f021e22e
|
[router] add --log-level to sgl-router (#6512)
|
2025-07-02 19:33:04 -07:00 |
|
Simo Lin
|
09ae5b20f3
|
Merge PDLB (Prefill-Decode Load Balancer) into SGLang Router (#7096)
|
2025-06-19 02:28:15 +08:00 |
|
Arthur Cheng
|
ff91474825
|
[Router] Fix k8s Service Discovery (#6766)
Co-authored-by: Simo Lin <linsimo.mark@gmail.com>
|
2025-06-02 16:57:23 -07:00 |
|
Chao Yang
|
1a39979993
|
Sgl-router Prometheus metrics endpoint and usage track metrics (#6537)
|
2025-05-24 22:28:15 -07:00 |
|
Zilin Zhu
|
669caa0a3f
|
[router] support http2 in router (#6487)
|
2025-05-21 01:42:45 -07:00 |
|
Zilin Zhu
|
e3bed74afb
|
[router] Add /list_workers endpoint to router (#6366)
|
2025-05-17 09:49:02 -07:00 |
|
Simo Lin
|
1468769bde
|
[Misc] add service discovery for sgl router
|
2025-04-29 10:21:19 -07:00 |
|
Simo Lin
|
f0365820e8
|
[Misc] add structure logging, write to file and log tracing for SGL Router
|
2025-04-27 16:54:10 -07:00 |
|
Yinghai Lu
|
f4d7ab7a63
|
[sgl-router] improvement to avoid hang (#4482)
Co-authored-by: Yineng Zhang <me@zhyncs.com>
Co-authored-by: Byron Hsu <byronhsu1230@gmail.com>
|
2025-03-17 10:37:50 -07:00 |
|
Byron Hsu
|
9a0cc2e90e
|
[router] Forward all request headers from router to workers (#3070)
|
2025-01-23 20:30:31 -08:00 |
|
Byron Hsu
|
0311ce8e1c
|
[router] Expose worker startup secs & Return error instead of panic for router init (#3016)
|
2025-01-20 12:45:13 -08:00 |
|
Ata Fatahi
|
e3b3acfa6f
|
Rename rust folder to sgl-router (#2464)
Signed-off-by: Ata Fatahi <immrata@gmail.com>
|
2024-12-12 09:40:41 -08:00 |
|