xc-llm-ascend

Files

Wang Kunpeng 4b3bd4f397 [main][bugfix] bugfix for minicpm models (#3527 )

### What this PR does / why we need it?
bugfix for minicpm-2b and minicpm3-4b

- vLLM version: v0.11.0rc3
- vLLM main: https://github.com/vllm-project/vllm/commit/v0.11.0

Signed-off-by: Wang Kunpeng <1289706727@qq.com>

2025-10-19 11:00:55 +08:00

__init__.py

[main][bugfix] bugfix for minicpm models (#3527 )

2025-10-19 11:00:55 +08:00

patch_attention_selector.py

[Feat] Supports Aclgraph for bge-m3 (#3171 )

2025-10-14 23:07:45 +08:00

patch_distributed.py

[feat] support customized and separated hccl_buffer_size for process group initialization (#3073 )

2025-10-11 15:55:22 +08:00

patch_logits.py

[Bugfix][LoRA][Patch] Fix the LoRA inference bug after upstream vLLM codebase changed (#2560 )

2025-08-28 10:40:51 +08:00

patch_minicpm.py

[Model][MiniCPM] support MiniCPM (#645 )

2025-04-27 11:27:24 +08:00

patch_multimodal_merge.py

[Bugfix]modify the enable range of _merge_multimodal_embeddings patch (#3360 )

2025-10-11 08:37:07 +08:00

patch_roberta.py

[Feat] Supports Aclgraph for bge-m3 (#3171 )

2025-10-14 23:07:45 +08:00

patch_triton.py

[2/N][Refactor][Qwen3-Next] remove redundant methods and patch methods in Qwen3NextGatedDeltaNet (#3082 )

2025-09-24 11:25:42 +08:00

patch_weight_loader.py

Drop 0.10.2 (#3284 )

2025-10-09 10:28:38 +08:00