Commit Graph

10 Commits

Author SHA1 Message Date
Byron Hsu
2a717c5078 [Router] fix interrupt from terminal (#2413) 2024-12-08 16:58:41 -08:00
Byron Hsu
a1e697b25b [router] Improve cleanup logic (#2411) 2024-12-08 15:24:02 -08:00
Lianmin Zheng
2a02185c5f Rename DP_RANK to SGLANG_DP_RANK (#2218) 2024-11-27 09:36:36 -08:00
Byron Hsu
4d62bca542 [router] Replace print with logger (#2183) 2024-11-25 13:36:02 -08:00
Byron Hsu
4b0a1c9365 Replace prob based with threshold based load balancing (#2170) 2024-11-24 23:17:11 -08:00
Byron Hsu
32293a299c Improve sglang router (#2148) 2024-11-23 17:34:24 -08:00
Byron Hsu
cbedd1db1d [router] cache-aware load-balancing router v1 (#2114) 2024-11-23 08:34:48 -08:00
Byron Hsu
86c37d010a fix sglang_router not found (#2005) 2024-11-11 15:20:14 -08:00
Byron Hsu
00ffde206f setup router python binding ci (#1999) 2024-11-11 12:19:32 -08:00
Byron Hsu
f9633fa9b9 [rust] cache-aware DP - approx tree (#1934) 2024-11-10 21:57:32 -08:00