Logo
Explore Help
Register Sign In
EngineX/xc-llm-ascend
3
0
Fork 0
You've already forked xc-llm-ascend
Code Issues Pull Requests Projects Releases Wiki Activity
Files
v0.18.0
xc-llm-ascend/vllm_ascend/spec_decode
History
wangbj127 9cc41c9457 [v0.18.0][Bugfix][EAGLE] Fix FIA pad bug under max concurrency (#7754)
cherry picked from https://github.com/vllm-project/vllm-ascend/pull/7740
Fixes padding problems of FIA op under max concurrency.

- vLLM version: v0.18.0
- vLLM main:
35141a7eed

Signed-off-by: Wangbingjie <wangbj1207@126.com>
2026-03-29 12:23:44 +08:00
..
__init__.py
[feat][spec decode]Unified draft parallel (#6766)
2026-03-13 14:07:35 +08:00
draft_proposer.py
[CI] Add pre-commit check for patch logger (#7446)
2026-03-19 16:53:20 +08:00
eagle_proposer.py
[v0.18.0][Bugfix][EAGLE] Fix FIA pad bug under max concurrency (#7754)
2026-03-29 12:23:44 +08:00
medusa_proposer.py
[CI] Add pre-commit check for patch logger (#7446)
2026-03-19 16:53:20 +08:00
ngram_proposer.py
[Spec Decode]clean up spec decode interface (#6947)
2026-03-05 14:30:10 +08:00
suffix_proposer.py
[Spec Decode]clean up spec decode interface (#6947)
2026-03-05 14:30:10 +08:00
Powered by Gitea Version: 1.24.3 Page: 353ms Template: 77ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API