Logo
Explore Help
Register Sign In
EngineX/xc-llm-ascend
3
0
Fork 0
You've already forked xc-llm-ascend
Code Issues Pull Requests Projects Releases Wiki Activity
Files
a813eadd2d2fb5d4f6179fbed860aaebfe2b3db6
xc-llm-ascend/vllm_ascend/attention
History
wangxiyuan 16c3b0b822 Revert "[Refactor][EAGLE] 8/N delete mtp_proposer" (#7030)
Reverts vllm-project/vllm-ascend#7016
It breaks E2E test
- vLLM version: v0.16.0
- vLLM main:
4034c3d32e
2026-03-06 11:24:05 +08:00
..
context_parallel
[BugFix] [dcp] Fix GQA Model Error when Enable both DP and DCP (#7012)
2026-03-05 16:51:08 +08:00
__init__.py
[Core] Make V1 work and enable V1 engine test (#389)
2025-03-28 19:34:23 +08:00
attention_mask.py
[Lint]Style: Convert vllm-ascend/ to ruff format(Batch #2) (#5977)
2026-01-19 08:59:46 +08:00
attention_v1.py
Revert "[Refactor][EAGLE] 8/N delete mtp_proposer" (#7030)
2026-03-06 11:24:05 +08:00
mla_v1.py
[Refact]Refact MLA/SFA weight prefetch to consist with moe weight prefetch (#6629)
2026-02-10 14:14:37 +08:00
sfa_v1.py
[perf][refactor] Refactor and optimize sfa_v1.py for dsv3.2/glm5 (#6874)
2026-03-05 14:27:11 +08:00
utils.py
[Feat] support basic pcp&dcp for qwen3next (#6091)
2026-02-28 21:44:08 +08:00
Powered by Gitea Version: 1.24.3 Page: 210ms Template: 66ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API