xc-llm-ascend/e2e at 2a9d02e08039749cf811c5fb190d4c8a950d792d - xc-llm-ascend - Gitea: Git with a cup of tea

EngineX/xc-llm-ascend

Files

History

Icey 2a9d02e080 [Bugfix] eagle and eagle3 spec decode failures and enable e2e test (#2979 )

### What this PR does / why we need it?
- Fix the bug https://github.com/vllm-project/vllm-ascend/issues/2978
- Enable e2e test,
- Adapt to scenarios where Speculative tokens are greater than 2,
- Fix the bug that causes Eagle3 inference failures under high
concurrency and improve the acceptance rate of draft models, by
https://github.com/vllm-project/vllm-ascend/pull/2794

### Does this PR introduce _any_ user-facing change?

### How was this patch tested?
CI passed with new added/existing test.

Co-authored-by: hukongyi
[hukongyi@cmbchina.com](mailto:hukongyi@cmbchina.com)
Co-authored-by: guanyuzhu
[zhuguanyu@huawei.com](mailto:zhuguanyu@huawei.com)
Co-authored-by: liumail680
[liumail680@163.com](mailto:liumail680@163.com)


- vLLM version: v0.10.2
- vLLM main:
f225ea7dd9

---------

Signed-off-by: Icey <1790571317@qq.com>

2025-09-25 14:39:12 +08:00

..

Refactor e2e CI (#2276 )

2025-09-02 09:02:22 +08:00

Increase doctest timeout to 300s and time print (#3041 )

2025-09-19 20:26:00 +08:00

[Test] Update the format of the accuracy report (#3081 )

2025-09-22 14:10:03 +08:00

[Feature] Support moe multi-stream for aclgraph. (#2946 )

2025-09-19 11:06:45 +08:00

Fix VLLM_ASCEND_LLMDD_RPC_PORT renaming (#3108 )

2025-09-23 10:33:04 +08:00

[CI/UT] Add test for chunk prefill and prefix cache on v1/AscendScheduler (#1505 )

2025-07-02 16:57:03 +08:00

[Bugfix] eagle and eagle3 spec decode failures and enable e2e test (#2979 )

2025-09-25 14:39:12 +08:00

Add OOT platform E2E test case to be run in the vllm buildkite pipeline (#3154 )

2025-09-24 17:55:58 +08:00

__init__.py

[Test] Clean up duplicate test for ascend scheduler (#1819 )

2025-07-16 17:57:48 +08:00

common.sh

Increase doctest timeout to 300s and time print (#3041 )

2025-09-19 20:26:00 +08:00

conftest.py

[CI] Upgrade vLLM to 20250920 (c60e613) and address config break (#3067 )

2025-09-21 09:49:17 +08:00

model_utils.py

[CI] Update vllm version to 20250922(5aeb925) (#3091 )

2025-09-22 22:18:13 +08:00

run_disagg_pd.sh

[CI] Fix PD job (#1129 )

2025-06-09 16:34:41 +08:00

run_doctests.sh

Increase doctest timeout to 300s and time print (#3041 )

2025-09-19 20:26:00 +08:00

utils.py

[Test] Remove VLLM_USE_V1 in example and tests (#1733 )

2025-07-15 12:49:57 +08:00