Logo
Explore Help
Register Sign In
EngineX/xc-llm-ascend
3
0
Fork 0
You've already forked xc-llm-ascend
Code Issues Pull Requests Actions Projects Releases Wiki Activity
Files
6c65dd891fb61017c077f4e55c19fe0eb03b662d
xc-llm-ascend/vllm_ascend/patch/worker/patch_common
History
Wang Kunpeng 4b3bd4f397 [main][bugfix] bugfix for minicpm models (#3527)
### What this PR does / why we need it?
bugfix for minicpm-2b and minicpm3-4b

- vLLM version: v0.11.0rc3
- vLLM main: https://github.com/vllm-project/vllm/commit/v0.11.0

Signed-off-by: Wang Kunpeng <1289706727@qq.com>
2025-10-19 11:00:55 +08:00
..
__init__.py
[main][bugfix] bugfix for minicpm models (#3527)
2025-10-19 11:00:55 +08:00
patch_attention_selector.py
[Feat] Supports Aclgraph for bge-m3 (#3171)
2025-10-14 23:07:45 +08:00
patch_distributed.py
[feat] support customized and separated hccl_buffer_size for process group initialization (#3073)
2025-10-11 15:55:22 +08:00
patch_logits.py
[Bugfix][LoRA][Patch] Fix the LoRA inference bug after upstream vLLM codebase changed (#2560)
2025-08-28 10:40:51 +08:00
patch_minicpm.py
[Model][MiniCPM] support MiniCPM (#645)
2025-04-27 11:27:24 +08:00
patch_multimodal_merge.py
[Bugfix]modify the enable range of _merge_multimodal_embeddings patch (#3360)
2025-10-11 08:37:07 +08:00
patch_roberta.py
[Feat] Supports Aclgraph for bge-m3 (#3171)
2025-10-14 23:07:45 +08:00
patch_triton.py
[2/N][Refactor][Qwen3-Next] remove redundant methods and patch methods in Qwen3NextGatedDeltaNet (#3082)
2025-09-24 11:25:42 +08:00
patch_weight_loader.py
Drop 0.10.2 (#3284)
2025-10-09 10:28:38 +08:00
Powered by Gitea Version: 1.24.3 Page: 101ms Template: 6ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API