Re-patch TritonPlaceholder on main to make CI happy (#753)

### What this PR does / why we need it?
Re-patch TritonPlaceholder on main to make CI happy
- Add triton patch back until
https://github.com/vllm-project/vllm/pull/17446 resolved
- Move patch_main before patch_common to resolve minicpm triton import
issue
- Add `0.8.5` and `0.8.5.post1` to make patch work on 0.8.5 all versions

Related:
- https://github.com/vllm-project/vllm-ascend/pull/704
- https://github.com/vllm-project/vllm-ascend/pull/690

### Does this PR introduce _any_ user-facing change?
No

### How was this patch tested?
All CI passed include main

Signed-off-by: Yikun Jiang <yikunkero@gmail.com>
This commit is contained in:
Yikun Jiang
2025-05-05 23:22:24 +08:00
committed by GitHub
parent d2ead057ae
commit d7e1110c8e
6 changed files with 92 additions and 5 deletions

View File

@@ -23,5 +23,9 @@ def register():
def register_model():
# TODO: fixme when TritonPlaceholder fixed
from vllm_ascend.utils import vllm_version_is
if not (vllm_version_is("0.8.5") or vllm_version_is("0.8.5.post1")):
import vllm_ascend.patch.worker.patch_main.patch_tritonplaceholder # noqa
from .models import register_model
register_model()