Lianmin Zheng
|
2a02185c5f
|
Rename DP_RANK to SGLANG_DP_RANK (#2218)
|
2024-11-27 09:36:36 -08:00 |
|
Byron Hsu
|
4d62bca542
|
[router] Replace print with logger (#2183)
|
2024-11-25 13:36:02 -08:00 |
|
Byron Hsu
|
4b0a1c9365
|
Replace prob based with threshold based load balancing (#2170)
|
2024-11-24 23:17:11 -08:00 |
|
Byron Hsu
|
32293a299c
|
Improve sglang router (#2148)
|
2024-11-23 17:34:24 -08:00 |
|
Byron Hsu
|
cbedd1db1d
|
[router] cache-aware load-balancing router v1 (#2114)
|
2024-11-23 08:34:48 -08:00 |
|
Byron Hsu
|
86c37d010a
|
fix sglang_router not found (#2005)
|
2024-11-11 15:20:14 -08:00 |
|
Byron Hsu
|
00ffde206f
|
setup router python binding ci (#1999)
|
2024-11-11 12:19:32 -08:00 |
|
Byron Hsu
|
f9633fa9b9
|
[rust] cache-aware DP - approx tree (#1934)
|
2024-11-10 21:57:32 -08:00 |
|