- Fix world size bug in model_runner to make sure ep>16 runs with MC2
- enable e2e test for vl
Co-Authored-By: whx-sjtu <2952154980@qq.com>
Co-Authored-By: Icey <1790571317@qq.com>
- vLLM version: v0.10.2
- vLLM main:
3e903b6cb4
Signed-off-by: wangxiyuan <wangxiyuan1007@gmail.com>