Lifu Huang
|
4474eaf552
|
Support LoRA in TestOpenAIVisionServer and fix fused kv_proj loading bug. (#6861)
|
2025-06-04 22:08:30 -07:00 |
|
Lianmin Zheng
|
2d72fc47cf
|
Improve profiler and integrate profiler in bench_one_batch_server (#6787)
|
2025-05-31 15:53:55 -07:00 |
|
Chang Su
|
4685fbb888
|
[VLM] Support chunk prefill for VLM (#6355)
Co-authored-by: yizhang2077 <1109276519@qq.com>
|
2025-05-22 20:32:41 -07:00 |
|
fzyzcjy
|
f11481b921
|
Add 4-GPU runner tests and split existing tests (#6383)
|
2025-05-18 11:56:51 -07:00 |
|