Commit Graph

6 Commits

Author SHA1 Message Date
Lianmin Zheng
2a02185c5f Rename DP_RANK to SGLANG_DP_RANK (#2218) 2024-11-27 09:36:36 -08:00
Byron Hsu
4d62bca542 [router] Replace print with logger (#2183) 2024-11-25 13:36:02 -08:00
Byron Hsu
4b0a1c9365 Replace prob based with threshold based load balancing (#2170) 2024-11-24 23:17:11 -08:00
Byron Hsu
32293a299c Improve sglang router (#2148) 2024-11-23 17:34:24 -08:00
Byron Hsu
cbedd1db1d [router] cache-aware load-balancing router v1 (#2114) 2024-11-23 08:34:48 -08:00
Byron Hsu
86c37d010a fix sglang_router not found (#2005) 2024-11-11 15:20:14 -08:00