xc-llm-ascend

Files

Pleaplusone 0e43813120 [ModelRunner] Use shared CachedRequestData cross request to fix ci (#1546 )

### What this PR does / why we need it?

This PR (adapted from
2863befce3)
updates the CachedRequestData definition to use a single instance shared
across all requests in a batch, instead of creating a new instance per
request.

Found ci boken by the vllm's model_runner change: `ERROR 07-01 09:53:53
[core.py:521] TypeError: 'CachedRequestData' object is not iterable`,
Modify the model_runner to fix it.


### Does this PR introduce _any_ user-facing change?
No

### How was this patch tested?
pass ci will verify this.

---------

Signed-off-by: ganyi <pleaplusone.gy@gmail.com>
Signed-off-by: Yikun Jiang <yikunkero@gmail.com>
Co-authored-by: Yikun Jiang <yikunkero@gmail.com>

2025-07-02 06:05:21 +08:00

__init__.py

[CI] Add unit test framework (#1201 )

2025-06-16 18:32:28 +08:00

test_ascend_scheduler_e2e.py

[CI] Add unit test framework (#1201 )

2025-06-16 18:32:28 +08:00

test_ascend_scheduler.py

[ModelRunner] Use shared CachedRequestData cross request to fix ci (#1546 )

2025-07-02 06:05:21 +08:00