Commit Graph

20 Commits

Author SHA1 Message Date
Keyang Ru
4093d460ce [CI] migrate router to BM.A10.4 runner (#8992)
Co-authored-by: key4ng <rukeyang@gamil.com>
2025-08-11 22:41:18 -07:00
Simo Lin
067068f271 [router] regular router circuit breaker (#8997) 2025-08-10 21:19:30 -07:00
Michael Yao
061c8959ff Fix typos in py_test/test_launch_server.py (#6227)
Signed-off-by: windsonsea <haifeng.yao@daocloud.io>
2025-07-31 12:48:47 -07:00
Simo Lin
66a398f49d [router] migrate router from actix to axum (#8479) 2025-07-30 17:47:19 -07:00
Rui Chen
a730ce8162 [feature] [sgl-router] Add a dp-aware routing strategy (#6869) 2025-07-30 05:58:48 -07:00
Simo Lin
b763cf7e8e [router] allow router to have empty workers (#8160) 2025-07-18 22:09:54 -07:00
Simo Lin
c8f31042a8 [router] Refactor router and policy traits with dependency injection (#7987)
Co-authored-by: Jin Pan <jpan236@wisc.edu>
Co-authored-by: Keru Yang <rukeyang@gmail.com>
Co-authored-by: Yingyi Huang <yingyihuang2000@outlook.com>
Co-authored-by: Philip Zhu <phlipzhux@gmail.com>
2025-07-18 14:24:24 -07:00
Zilin Zhu
82f021e22e [router] add --log-level to sgl-router (#6512) 2025-07-02 19:33:04 -07:00
Simo Lin
30f2a44a96 [misc] Add PD service discovery support in router (#7361) 2025-06-22 17:54:14 -07:00
Simo Lin
09ae5b20f3 Merge PDLB (Prefill-Decode Load Balancer) into SGLang Router (#7096) 2025-06-19 02:28:15 +08:00
Chao Yang
1a39979993 Sgl-router Prometheus metrics endpoint and usage track metrics (#6537) 2025-05-24 22:28:15 -07:00
Lifu Huang
6e2da51561 Replace time.time() to time.perf_counter() for benchmarking. (#6178)
Signed-off-by: Lifu Huang <lifu.hlf@gmail.com>
2025-05-11 14:32:49 -07:00
Simo Lin
1468769bde [Misc] add service discovery for sgl router 2025-04-29 10:21:19 -07:00
Simo Lin
f0365820e8 [Misc] add structure logging, write to file and log tracing for SGL Router 2025-04-27 16:54:10 -07:00
Byron Hsu
8cc300f536 Fix router test (#4483) 2025-03-16 22:49:47 -07:00
Byron Hsu
9a0cc2e90e [router] Forward all request headers from router to workers (#3070) 2025-01-23 20:30:31 -08:00
Byron Hsu
3a8428ecaa [router] Expose worker startup interval (#3019) 2025-01-20 14:36:54 -08:00
Byron Hsu
0311ce8e1c [router] Expose worker startup secs & Return error instead of panic for router init (#3016) 2025-01-20 12:45:13 -08:00
Byron Hsu
ef18b0eda2 [router] Allow empty worker list for sglang.launch_router (#2979) 2025-01-19 01:05:23 -08:00
Ata Fatahi
e3b3acfa6f Rename rust folder to sgl-router (#2464)
Signed-off-by: Ata Fatahi <immrata@gmail.com>
2024-12-12 09:40:41 -08:00