Logo
Explore Help
Register Sign In
EngineX/xc-llm-ascend
3
0
Fork 0
You've already forked xc-llm-ascend
Code Issues Pull Requests Projects Releases Wiki Activity
Files
e3eefdecbd4aa8c2f621eadc51c23121e3b04509
xc-llm-ascend/vllm_ascend/worker
History
wangxiyuan 4e3919e965 Reapply "[Refactor] Unify full-graph parameter update logic (#6041)" (#6227) (#6231)
This reverts commit 95649344aa.

The CI failure doesn't related to this change. Let's reapply it.

- vLLM version: v0.14.0
- vLLM main:
d68209402d
2026-01-26 09:04:54 +08:00
..
v2
model runner v2 support triton of penalty (#5854)
2026-01-20 12:26:05 +00:00
__init__.py
[Misc][V0 Deprecation] Remove Cache Engine Used for V0 Worker (#1878)
2025-07-19 09:42:32 +08:00
block_table.py
[feature] support pcp + mtp in full graph (#4572)
2025-12-22 16:13:39 +08:00
model_runner_v1.py
Reapply "[Refactor] Unify full-graph parameter update logic (#6041)" (#6227) (#6231)
2026-01-26 09:04:54 +08:00
npu_input_batch.py
Drop 0.12.0 support (#5146)
2025-12-20 09:38:53 +08:00
pcp_utils.py
[bugfix] fix the complex and potentially problematic generate_kv_idx. (#5957)
2026-01-21 14:21:02 +08:00
worker.py
[Worker] Implement update max_model_len interface for NPUWorker (#6193)
2026-01-26 09:03:33 +08:00
Powered by Gitea Version: 1.24.3 Page: 158ms Template: 7ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API