Commit Graph

26 Commits

Author SHA1 Message Date
Chang Su
90313fb09a [router] add token bucket rate limiter (#9656) 2025-08-26 10:36:26 -07:00
Simo Lin
21b8846066 [router] allow more health check configuration (#9198) 2025-08-15 08:07:45 -07:00
Simo Lin
067068f271 [router] regular router circuit breaker (#8997) 2025-08-10 21:19:30 -07:00
Simo Lin
500b15c960 [router] upgrade router version to 0.1.9 (#8844) 2025-08-07 09:29:12 -07:00
Simo Lin
39decec10b [router] upgrade router version to 0.1.8 (#8645) 2025-07-31 19:00:23 -07:00
Simo Lin
2886e23dbd [bugfix] fix router python parser for pd urls (#8644) 2025-07-31 18:09:31 -07:00
Simo Lin
66a398f49d [router] migrate router from actix to axum (#8479) 2025-07-30 17:47:19 -07:00
Rui Chen
a730ce8162 [feature] [sgl-router] Add a dp-aware routing strategy (#6869) 2025-07-30 05:58:48 -07:00
Simo Lin
fe6a445d1e [router] improve router logs and request id header (#8415) 2025-07-27 19:30:19 -07:00
Simo Lin
2ab97023e3 [router] add different policies for p node and d node (#8395) 2025-07-27 00:39:20 -07:00
Simo Lin
b763cf7e8e [router] allow router to have empty workers (#8160) 2025-07-18 22:09:54 -07:00
Zilin Zhu
82f021e22e [router] add --log-level to sgl-router (#6512) 2025-07-02 19:33:04 -07:00
Simo Lin
30f2a44a96 [misc] Add PD service discovery support in router (#7361) 2025-06-22 17:54:14 -07:00
Simo Lin
09ae5b20f3 Merge PDLB (Prefill-Decode Load Balancer) into SGLang Router (#7096) 2025-06-19 02:28:15 +08:00
Chao Yang
1a39979993 Sgl-router Prometheus metrics endpoint and usage track metrics (#6537) 2025-05-24 22:28:15 -07:00
Lifu Huang
3cf1473a09 Use monotonic clock for interval measurement (#6211)
Signed-off-by: Lifu Huang <lifu.hlf@gmail.com>
2025-05-17 16:49:18 -07:00
Simo Lin
1468769bde [Misc] add service discovery for sgl router 2025-04-29 10:21:19 -07:00
Simo Lin
f0365820e8 [Misc] add structure logging, write to file and log tracing for SGL Router 2025-04-27 16:54:10 -07:00
Byron Hsu
8d8ef8497e bump router to 0.1.4 (#3094) 2025-01-23 20:32:43 -08:00
Byron Hsu
3ad4cd4915 bump router to 0.1.3 (#3020) 2025-01-20 14:38:06 -08:00
Byron Hsu
3a8428ecaa [router] Expose worker startup interval (#3019) 2025-01-20 14:36:54 -08:00
Byron Hsu
0311ce8e1c [router] Expose worker startup secs & Return error instead of panic for router init (#3016) 2025-01-20 12:45:13 -08:00
Lianmin Zheng
03464890e0 Separate two entry points: Engine and HTTP server (#2996)
Co-authored-by: fzyzcjy <5236035+fzyzcjy@users.noreply.github.com>
2025-01-19 22:09:24 -08:00
Lianmin Zheng
cd493b5afc Improve metrics, logging, and importing orders (#2992) 2025-01-19 18:36:59 -08:00
Byron Hsu
ef18b0eda2 [router] Allow empty worker list for sglang.launch_router (#2979) 2025-01-19 01:05:23 -08:00
Ata Fatahi
e3b3acfa6f Rename rust folder to sgl-router (#2464)
Signed-off-by: Ata Fatahi <immrata@gmail.com>
2024-12-12 09:40:41 -08:00