Keyang Ru
|
3828db4309
|
[router] Add IGW (Inference Gateway) Feature Flag (#9371)
Co-authored-by: Yineng Zhang <me@zhyncs.com>
|
2025-08-20 17:38:57 -07:00 |
|
Jeff Nettleton
|
ce3ca9b02f
|
[router] add cargo clippy in CI and fix-up linting errors (#9242)
|
2025-08-17 11:03:56 -07:00 |
|
Keyang Ru
|
4093d460ce
|
[CI] migrate router to BM.A10.4 runner (#8992)
Co-authored-by: key4ng <rukeyang@gamil.com>
|
2025-08-11 22:41:18 -07:00 |
|
Lianmin Zheng
|
2c7f01bc89
|
Reorganize CI and test files (#9027)
|
2025-08-10 12:30:06 -07:00 |
|
Simo Lin
|
a9fd80336d
|
[router] allow longer time out for router e2e (#8560)
|
2025-07-29 23:43:37 -07:00 |
|
Simo Lin
|
3abc30364d
|
[ci] add router benchmark script and CI (#7498)
|
2025-06-25 01:28:25 -07:00 |
|
Yineng Zhang
|
ad1ae7f7cd
|
use topk_softmax with sgl-kernel (#4439)
|
2025-03-14 15:59:06 -07:00 |
|
Byron Hsu
|
ef18b0eda2
|
[router] Allow empty worker list for sglang.launch_router (#2979)
|
2025-01-19 01:05:23 -08:00 |
|
Ata Fatahi
|
ce094a5d79
|
Clean up GPU memory after killing sglang processes (#2457)
Signed-off-by: Ata Fatahi <immrata@gmail.com>
|
2024-12-17 03:42:40 -08:00 |
|
Ata Fatahi
|
e3b3acfa6f
|
Rename rust folder to sgl-router (#2464)
Signed-off-by: Ata Fatahi <immrata@gmail.com>
|
2024-12-12 09:40:41 -08:00 |
|
Ata Fatahi
|
2ac36b9a7b
|
Make request payload size configurable (#2444)
Signed-off-by: Ata Fatahi <immrata@gmail.com>
|
2024-12-11 16:55:21 -08:00 |
|
Byron Hsu
|
c36736c841
|
[router] Add remove worker api (#2380)
|
2024-12-06 17:16:03 -08:00 |
|
Byron Hsu
|
64fceab8af
|
[router] use 2-gpu-runner (#2368)
|
2024-12-06 14:13:57 +08:00 |
|
Byron Hsu
|
1f76fc6e3f
|
[router] Rust e2e test (#2184)
|
2024-11-25 16:02:03 -08:00 |
|
Byron Hsu
|
3e33574374
|
run rust test on ubuntu instead of 1-gpu-runner (#2003)
|
2024-11-11 14:46:08 -08:00 |
|
Lianmin Zheng
|
a1f32867ca
|
Update pr-test-rust.yml to add a "finish" step (#1975)
|
2024-11-09 13:53:35 -08:00 |
|
Xuehai Pan
|
a5e0defb5a
|
minor: Add basic editorconfig and pre-commit hooks to enforce style for whitespaces (#1926)
|
2024-11-06 13:46:04 +00:00 |
|
Byron Hsu
|
96766101b4
|
[rust] refactor server and router (#1922)
|
2024-11-06 00:02:02 -08:00 |
|
Byron Hsu
|
530ff541cf
|
[router] Impl radix tree and set up CI (#1893)
Co-authored-by: Lianmin Zheng <lianminzheng@gmail.com>
|
2024-11-04 10:56:52 -08:00 |
|