Commit Graph

8 Commits

Author SHA1 Message Date
Lianmin Zheng
2a02185c5f Rename DP_RANK to SGLANG_DP_RANK (#2218) 2024-11-27 09:36:36 -08:00
Byron Hsu
4d62bca542 [router] Replace print with logger (#2183) 2024-11-25 13:36:02 -08:00
Byron Hsu
4b0a1c9365 Replace prob based with threshold based load balancing (#2170) 2024-11-24 23:17:11 -08:00
Byron Hsu
32293a299c Improve sglang router (#2148) 2024-11-23 17:34:24 -08:00
Byron Hsu
cbedd1db1d [router] cache-aware load-balancing router v1 (#2114) 2024-11-23 08:34:48 -08:00
Byron Hsu
86c37d010a fix sglang_router not found (#2005) 2024-11-11 15:20:14 -08:00
Byron Hsu
00ffde206f setup router python binding ci (#1999) 2024-11-11 12:19:32 -08:00
Byron Hsu
f9633fa9b9 [rust] cache-aware DP - approx tree (#1934) 2024-11-10 21:57:32 -08:00