Files
xc-llm-ascend/vllm_ascend/worker/v2
shiyuan680 3b6b3c4214 [MODELRUNNERV2]fix penality ops (#7013)
### What this PR does / why we need it?
fix penality ops for new version, and achieved a 10% performance
improvement

### How was this patch tested?
pytest
‎tests/e2e/nightly/single_node/ops/singlecard_ops/triton/test_penality.py
- vLLM version: v0.16.0
- vLLM main:
15d76f74e2

Signed-off-by: shiyuan680 <917935075@qq.com>
2026-03-11 17:13:34 +08:00
..

[Experimental] Model Runner V2

This directory contains the new model runner which is under active development.

please see Model Runner V2 to get specific plans.

supported vllm version: main@1339784