Commit Graph

49 Commits

Author SHA1 Message Date
Chang Su
ca240eefb4 [router][grpc] Support parallel queue puts in grpc_request_manager and remove mutex for grpc_client (#11798) 2025-10-17 20:49:43 -07:00
Chang Su
dc01313da1 [router] Add rustfmt and set group imports by default (#11732) 2025-10-16 17:33:29 -07:00
Simo Lin
64affab495 [router] fix p and d worker filtering and bootstrap port handling (#11729) 2025-10-16 14:19:39 -07:00
Chang Su
2479b89405 [router][grpc] Simplify model_id determination (#11684) 2025-10-15 15:56:58 -07:00
Simo Lin
40e0082d8d [router] add worker self discovery for metadata (#11638) 2025-10-14 22:07:25 -04:00
Simo Lin
4b62af92ef [router] change worker api to async instead of sync (#11566) 2025-10-14 00:32:21 -07:00
Simo Lin
88bb627d0d [router] change grpc client from mutable to clone (#11394) 2025-10-09 11:00:24 -07:00
Chang Su
a1080b72a0 [router] Fix all unused_qualifications (#11341) 2025-10-08 13:55:27 -07:00
Simo Lin
fde9b96392 [router] cleanup worker health check to return early (#11310) 2025-10-07 16:53:10 -07:00
Simo Lin
5ee777c98f [router] add ipv6 support across all components (#11219) 2025-10-06 08:16:59 -07:00
Simo Lin
d736e0b65e [router] add grpc router pd mode for chat and generate (#11140) 2025-10-04 06:58:28 -07:00
Simo Lin
ffd03a9bd3 [router] fix get load response parsing (#11213) 2025-10-04 06:58:02 -07:00
Simo Lin
aae7ead2d0 [router] remove old/oudated/useless comments across code base (#10968) 2025-09-26 10:48:50 -07:00
Simo Lin
5d4fe1ceee [router] add move grpc worker management from router to worker manager (#10960) 2025-09-26 03:57:57 -07:00
Simo Lin
1b011e68dc [router] move grpc client from router to worker and builder (#10958) 2025-09-26 03:13:47 -07:00
Simo Lin
1e57b9472d [router] add grpc client get and set (#10955) 2025-09-26 03:07:05 -07:00
Chang Su
5e21d6aec0 refactor: Move grpc/client.rs to grpc_client/sglang_scheduler.rs (#10924) 2025-09-25 17:21:22 -04:00
Simo Lin
d511b2d905 [router] consolidate worker load monitoring (#10894) 2025-09-25 09:59:30 -04:00
Simo Lin
e738703547 [router] consolidate worker get loads (#10880) 2025-09-24 22:13:31 -04:00
Simo Lin
7a06ef984d [router] consolidate health endpoints and flush cache (#10876) 2025-09-24 15:23:21 -07:00
Simo Lin
97c3823931 [router] refactor router and worker management 3/n (#10727) 2025-09-22 12:17:50 -07:00
Jimmy
56321e9fc2 [Router]fix: fix get_load missing api_key (#10385) 2025-09-21 15:28:38 -04:00
Simo Lin
873d858b28 [router] refactor worker to builder pattern 5/n (#10653) 2025-09-19 05:43:23 -04:00
Simo Lin
4f2055ad56 [router] refactor worker to builder pattern 4/n (#10650) 2025-09-18 23:49:10 -07:00
Simo Lin
780d6a22cd [router] refactor worker to builder pattern 2/n (#10633) 2025-09-18 21:47:56 -07:00
Simo Lin
5291f32d75 [router] refactor worker to builder pattern 1/n (#10628) 2025-09-18 13:25:40 -07:00
Chang Su
35ef3f2902 [router] fix worker registration in multi model mode (#10486) 2025-09-15 21:05:00 -04:00
Simo Lin
7eccbe992d [router] fix service discovery and mcp ut (#10449) 2025-09-14 21:07:23 -07:00
Simo Lin
2f173ea074 [router] allow one router to support different model families and serving mode (#10244) 2025-09-12 16:18:27 -07:00
Simo Lin
8c86595c93 [router] enable sccache in ci and local build (#10099) 2025-09-12 09:43:48 -07:00
Simo Lin
bbf261ae4a [router] fix grpc connection mode detection (#9999) 2025-09-03 21:36:16 -07:00
Simo Lin
d966b902af [router] move tokenizer, reasoning, tool initialization to server (#9996) 2025-09-03 19:35:13 -07:00
Chang Su
11dcabc545 Grpc client (#9939) 2025-09-02 11:47:35 -07:00
Simo Lin
5343058875 [router] grpc router bootstraps (#9759) 2025-08-28 12:07:06 -07:00
Chang Su
90313fb09a [router] add token bucket rate limiter (#9656) 2025-08-26 10:36:26 -07:00
Simo Lin
3578eb1e9b [router] address worker load tracking consistency (#9523)
Co-authored-by: fzyzcjy <5236035+fzyzcjy@users.noreply.github.com>
2025-08-26 06:40:51 -07:00
Jeff Nettleton
ce3ca9b02f [router] add cargo clippy in CI and fix-up linting errors (#9242) 2025-08-17 11:03:56 -07:00
Jeff Nettleton
d7e38b2f6d [router] clean up lint warnings with clippy execution (#9201) 2025-08-15 11:01:21 -07:00
Simo Lin
21b8846066 [router] allow more health check configuration (#9198) 2025-08-15 08:07:45 -07:00
Simo Lin
6f81a710f7 [pd-router] add retry and circuit breakfor for pd router (#9051) 2025-08-11 05:53:26 -07:00
Simo Lin
067068f271 [router] regular router circuit breaker (#8997) 2025-08-10 21:19:30 -07:00
Tony Lu
36bfddecb9 [router] add metrics for worker and policy (#8971)
Signed-off-by: Tony Lu <tonyluj@gmail.com>
2025-08-08 13:41:40 -07:00
Simo Lin
61a4680494 [router] router circuit breaker core (#8941) 2025-08-08 09:20:22 -07:00
Tien Nguyen
fd05b56750 refactor(sgl-router): Replace once_cell with LazyLock in worker.rs and remove once_cell dependency from Cargo.toml (#8698) 2025-08-07 06:14:03 -07:00
Simo Lin
2fa0462c39 [router] introduce dp worker abstraction (#8639) 2025-08-04 06:42:20 -07:00
Rui Chen
a730ce8162 [feature] [sgl-router] Add a dp-aware routing strategy (#6869) 2025-07-30 05:58:48 -07:00
Simo Lin
fe6a445d1e [router] improve router logs and request id header (#8415) 2025-07-27 19:30:19 -07:00
Simo Lin
41d33e4736 [router] add ut for worker and errors (#8170) 2025-07-19 14:38:33 -07:00
Simo Lin
f2d5c4920e [router] add worker abstraction (#7960) 2025-07-11 20:17:48 -07:00