Logo
Explore Help
Register Sign In
EngineX/xc-llm-ascend
3
0
Fork 0
You've already forked xc-llm-ascend
Code Issues Pull Requests Actions Projects Releases Wiki Activity
Files
6b290acfe109bdfd9225a6c06a89f2dcba7a4156
xc-llm-ascend/vllm_ascend/patch/worker/patch_common
History
Wang Kunpeng 4b3bd4f397 [main][bugfix] bugfix for minicpm models (#3527)
### What this PR does / why we need it?
bugfix for minicpm-2b and minicpm3-4b

- vLLM version: v0.11.0rc3
- vLLM main: https://github.com/vllm-project/vllm/commit/v0.11.0

Signed-off-by: Wang Kunpeng <1289706727@qq.com>
2025-10-19 11:00:55 +08:00
..
__init__.py
[main][bugfix] bugfix for minicpm models (#3527)
2025-10-19 11:00:55 +08:00
patch_attention_selector.py
[Feat] Supports Aclgraph for bge-m3 (#3171)
2025-10-14 23:07:45 +08:00
patch_distributed.py
[feat] support customized and separated hccl_buffer_size for process group initialization (#3073)
2025-10-11 15:55:22 +08:00
patch_logits.py
[Bugfix][LoRA][Patch] Fix the LoRA inference bug after upstream vLLM codebase changed (#2560)
2025-08-28 10:40:51 +08:00
patch_minicpm.py
[Model][MiniCPM] support MiniCPM (#645)
2025-04-27 11:27:24 +08:00
patch_multimodal_merge.py
[Bugfix]modify the enable range of _merge_multimodal_embeddings patch (#3360)
2025-10-11 08:37:07 +08:00
patch_roberta.py
[Feat] Supports Aclgraph for bge-m3 (#3171)
2025-10-14 23:07:45 +08:00
patch_triton.py
[2/N][Refactor][Qwen3-Next] remove redundant methods and patch methods in Qwen3NextGatedDeltaNet (#3082)
2025-09-24 11:25:42 +08:00
patch_weight_loader.py
Drop 0.10.2 (#3284)
2025-10-09 10:28:38 +08:00
Powered by Gitea Version: 1.24.3 Page: 110ms Template: 7ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API