[CI] Patch torch.library.infer_schema for fused moe ops to fix CI (#854)

make sure pytorch infer_schema check is patched before some case which
using fused moe ops:
1. model register
2. quantization loading
3. fused moe ut

Signed-off-by: wangxiyuan <wangxiyuan1007@gmail.com>
This commit is contained in:
wangxiyuan
2025-05-14 19:49:09 +08:00
committed by GitHub
parent 508242425c
commit 68fb63428b
3 changed files with 11 additions and 0 deletions

View File

@@ -23,5 +23,9 @@ def register():
def register_model():
# fix pytorch schema check error, remove this line after pytorch
# is upgraded to 2.7.0
import vllm_ascend.patch.worker.patch_common.patch_utils # noqa: F401
from .models import register_model
register_model()