Byron Hsu
|
92d1253e58
|
Bump sglang-router to 0.0.10 for env name change (#2226)
|
2024-11-27 11:23:32 -08:00 |
|
Lianmin Zheng
|
2a02185c5f
|
Rename DP_RANK to SGLANG_DP_RANK (#2218)
|
2024-11-27 09:36:36 -08:00 |
|
Byron Hsu
|
0b46b951ae
|
Fix rust warning (#2208)
|
2024-11-26 15:00:41 -08:00 |
|
Byron Hsu
|
2763c0a73a
|
Bump router to 0.0.9 with better logging (#2207)
|
2024-11-26 13:30:28 -08:00 |
|
Byron Hsu
|
1f76fc6e3f
|
[router] Rust e2e test (#2184)
|
2024-11-25 16:02:03 -08:00 |
|
Byron Hsu
|
4d62bca542
|
[router] Replace print with logger (#2183)
|
2024-11-25 13:36:02 -08:00 |
|
dependabot[bot]
|
5ada33ffa0
|
Bump rustls from 0.23.16 to 0.23.18 in /rust (#2182)
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
|
2024-11-26 03:22:33 +08:00 |
|
Byron Hsu
|
a866b65e1d
|
Bump rust router to 0.0.8
|
2024-11-24 23:17:38 -08:00 |
|
Byron Hsu
|
4b0a1c9365
|
Replace prob based with threshold based load balancing (#2170)
|
2024-11-24 23:17:11 -08:00 |
|
Henry Hyeonmok Ko
|
dbe1729395
|
Merged three native APIs into one: get_server_info (#2152)
|
2024-11-24 01:37:58 -08:00 |
|
Byron Hsu
|
32293a299c
|
Improve sglang router (#2148)
|
2024-11-23 17:34:24 -08:00 |
|
Byron Hsu
|
7921690893
|
add prefix match for certain tenant (#2147)
|
2024-11-23 15:35:02 -08:00 |
|
Byron Hsu
|
bbb81c2457
|
Add more api routes (completion, health, etc) to the router (#2146)
|
2024-11-23 15:10:26 -08:00 |
|
Byron Hsu
|
145c0ddc2d
|
update router doc (#2143)
|
2024-11-23 11:01:04 -08:00 |
|
Byron Hsu
|
505d7f71a6
|
Bump sglang-router to 0.0.5 (#2142)
|
2024-11-23 08:35:46 -08:00 |
|
Byron Hsu
|
cbedd1db1d
|
[router] cache-aware load-balancing router v1 (#2114)
|
2024-11-23 08:34:48 -08:00 |
|
Tzu Gwo
|
df246e699d
|
chore: open lto and optimization in release profile (#2028)
|
2024-11-14 15:02:39 +08:00 |
|
Byron Hsu
|
602ebc661d
|
remove sglang folder in rust (#2010)
|
2024-11-11 20:45:52 -08:00 |
|
Byron Hsu
|
86c37d010a
|
fix sglang_router not found (#2005)
|
2024-11-11 15:20:14 -08:00 |
|
Byron Hsu
|
3e33574374
|
run rust test on ubuntu instead of 1-gpu-runner (#2003)
|
2024-11-11 14:46:08 -08:00 |
|
Byron Hsu
|
0d94f1dd03
|
Bump router to 0.0.3 (#2004)
|
2024-11-11 14:42:22 -08:00 |
|
Byron Hsu
|
239eafbd2e
|
Fix rust unit test and pypi token (#2001)
|
2024-11-11 14:18:21 -08:00 |
|
Byron Hsu
|
00ffde206f
|
setup router python binding ci (#1999)
|
2024-11-11 12:19:32 -08:00 |
|
Byron Hsu
|
f9633fa9b9
|
[rust] cache-aware DP - approx tree (#1934)
|
2024-11-10 21:57:32 -08:00 |
|
Chayenne
|
c77c1e05ba
|
fix black in pre-commit (#1940)
|
2024-11-08 07:42:47 +08:00 |
|
Austin Liu
|
4b1d7a2583
|
Add Rust Router Python Binding (#1891)
Signed-off-by: Austin Liu <austin362667@gmail.com>
Co-authored-by: ByronHsu <byronhsu1230@gmail.com>
|
2024-11-06 18:08:30 -08:00 |
|
Xuehai Pan
|
a5e0defb5a
|
minor: Add basic editorconfig and pre-commit hooks to enforce style for whitespaces (#1926)
|
2024-11-06 13:46:04 +00:00 |
|
Byron Hsu
|
96766101b4
|
[rust] refactor server and router (#1922)
|
2024-11-06 00:02:02 -08:00 |
|
Byron Hsu
|
530ff541cf
|
[router] Impl radix tree and set up CI (#1893)
Co-authored-by: Lianmin Zheng <lianminzheng@gmail.com>
|
2024-11-04 10:56:52 -08:00 |
|
Byron Hsu
|
3839be2913
|
[Router] Add a rust-based router (#1790)
|
2024-10-28 09:49:48 -07:00 |
|