Commit Graph

30 Commits

Author SHA1 Message Date
Byron Hsu
92d1253e58 Bump sglang-router to 0.0.10 for env name change (#2226) 2024-11-27 11:23:32 -08:00
Lianmin Zheng
2a02185c5f Rename DP_RANK to SGLANG_DP_RANK (#2218) 2024-11-27 09:36:36 -08:00
Byron Hsu
0b46b951ae Fix rust warning (#2208) 2024-11-26 15:00:41 -08:00
Byron Hsu
2763c0a73a Bump router to 0.0.9 with better logging (#2207) 2024-11-26 13:30:28 -08:00
Byron Hsu
1f76fc6e3f [router] Rust e2e test (#2184) 2024-11-25 16:02:03 -08:00
Byron Hsu
4d62bca542 [router] Replace print with logger (#2183) 2024-11-25 13:36:02 -08:00
dependabot[bot]
5ada33ffa0 Bump rustls from 0.23.16 to 0.23.18 in /rust (#2182)
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2024-11-26 03:22:33 +08:00
Byron Hsu
a866b65e1d Bump rust router to 0.0.8 2024-11-24 23:17:38 -08:00
Byron Hsu
4b0a1c9365 Replace prob based with threshold based load balancing (#2170) 2024-11-24 23:17:11 -08:00
Henry Hyeonmok Ko
dbe1729395 Merged three native APIs into one: get_server_info (#2152) 2024-11-24 01:37:58 -08:00
Byron Hsu
32293a299c Improve sglang router (#2148) 2024-11-23 17:34:24 -08:00
Byron Hsu
7921690893 add prefix match for certain tenant (#2147) 2024-11-23 15:35:02 -08:00
Byron Hsu
bbb81c2457 Add more api routes (completion, health, etc) to the router (#2146) 2024-11-23 15:10:26 -08:00
Byron Hsu
145c0ddc2d update router doc (#2143) 2024-11-23 11:01:04 -08:00
Byron Hsu
505d7f71a6 Bump sglang-router to 0.0.5 (#2142) 2024-11-23 08:35:46 -08:00
Byron Hsu
cbedd1db1d [router] cache-aware load-balancing router v1 (#2114) 2024-11-23 08:34:48 -08:00
Tzu Gwo
df246e699d chore: open lto and optimization in release profile (#2028) 2024-11-14 15:02:39 +08:00
Byron Hsu
602ebc661d remove sglang folder in rust (#2010) 2024-11-11 20:45:52 -08:00
Byron Hsu
86c37d010a fix sglang_router not found (#2005) 2024-11-11 15:20:14 -08:00
Byron Hsu
3e33574374 run rust test on ubuntu instead of 1-gpu-runner (#2003) 2024-11-11 14:46:08 -08:00
Byron Hsu
0d94f1dd03 Bump router to 0.0.3 (#2004) 2024-11-11 14:42:22 -08:00
Byron Hsu
239eafbd2e Fix rust unit test and pypi token (#2001) 2024-11-11 14:18:21 -08:00
Byron Hsu
00ffde206f setup router python binding ci (#1999) 2024-11-11 12:19:32 -08:00
Byron Hsu
f9633fa9b9 [rust] cache-aware DP - approx tree (#1934) 2024-11-10 21:57:32 -08:00
Chayenne
c77c1e05ba fix black in pre-commit (#1940) 2024-11-08 07:42:47 +08:00
Austin Liu
4b1d7a2583 Add Rust Router Python Binding (#1891)
Signed-off-by: Austin Liu <austin362667@gmail.com>
Co-authored-by: ByronHsu <byronhsu1230@gmail.com>
2024-11-06 18:08:30 -08:00
Xuehai Pan
a5e0defb5a minor: Add basic editorconfig and pre-commit hooks to enforce style for whitespaces (#1926) 2024-11-06 13:46:04 +00:00
Byron Hsu
96766101b4 [rust] refactor server and router (#1922) 2024-11-06 00:02:02 -08:00
Byron Hsu
530ff541cf [router] Impl radix tree and set up CI (#1893)
Co-authored-by: Lianmin Zheng <lianminzheng@gmail.com>
2024-11-04 10:56:52 -08:00
Byron Hsu
3839be2913 [Router] Add a rust-based router (#1790) 2024-10-28 09:49:48 -07:00