Logo
Explore Help
Register Sign In
EngineX/xc-llm-ascend
3
0
Fork 0
You've already forked xc-llm-ascend
Code Issues Pull Requests Projects Releases Wiki Activity
Files
16c3b0b822b5894f749ad79d90cd693ed39588e8
xc-llm-ascend/vllm_ascend/attention
History
wangxiyuan 16c3b0b822 Revert "[Refactor][EAGLE] 8/N delete mtp_proposer" (#7030)
Reverts vllm-project/vllm-ascend#7016
It breaks E2E test
- vLLM version: v0.16.0
- vLLM main:
4034c3d32e
2026-03-06 11:24:05 +08:00
..
context_parallel
[BugFix] [dcp] Fix GQA Model Error when Enable both DP and DCP (#7012)
2026-03-05 16:51:08 +08:00
__init__.py
[Core] Make V1 work and enable V1 engine test (#389)
2025-03-28 19:34:23 +08:00
attention_mask.py
[Lint]Style: Convert vllm-ascend/ to ruff format(Batch #2) (#5977)
2026-01-19 08:59:46 +08:00
attention_v1.py
Revert "[Refactor][EAGLE] 8/N delete mtp_proposer" (#7030)
2026-03-06 11:24:05 +08:00
mla_v1.py
[Refact]Refact MLA/SFA weight prefetch to consist with moe weight prefetch (#6629)
2026-02-10 14:14:37 +08:00
sfa_v1.py
[perf][refactor] Refactor and optimize sfa_v1.py for dsv3.2/glm5 (#6874)
2026-03-05 14:27:11 +08:00
utils.py
[Feat] support basic pcp&dcp for qwen3next (#6091)
2026-02-28 21:44:08 +08:00
Powered by Gitea Version: 1.24.3 Page: 170ms Template: 7ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API