xc-llm-ascend

Files

starmountain1997 bfcc372f75 [CI] Add long and short prompt tests for DeepSeek-V3.2 (#6499 )

### What this PR does / why we need it?

This PR enhances the test_deepseek3_2_w8a8_pruning_mtp_tp2_ep E2E test
by adding both short and long prompt test cases:
- Short test: Validates basic functionality with minimal input ("Hello
")
- Long test: Validates the model can handle prompts near its maximum
context length (~163K tokens, approaching the max_position_embeddings
limit of 163,840)
Additionally, explicitly sets max_model_len=163840 to ensure the test
properly exercises the model's full context window capability.
### Does this PR introduce _any_ user-facing change?

No. This change only affects internal E2E testing infrastructure.  

### How was this patch tested?

The modified test case will be executed as part of the E2E test suite
and has been validated
[here](https://github.com/vllm-project/vllm-ascend/actions/runs/21620195055/job/62308026205?pr=6499).



- vLLM version: v0.15.0
- vLLM main: https://github.com/vllm-project/vllm/commit/v0.15.0

Signed-off-by: guozr <guozr1997@hotmail.com>
Co-authored-by: guozr <guozr1997@hotmail.com>

2026-02-04 09:10:50 +08:00

2-cards

[CI] Add long and short prompt tests for DeepSeek-V3.2 (#6499 )

2026-02-04 09:10:50 +08:00

4-cards

[E2E] add E2E for Prefix Caching cp & Chunked Prefill cp (#5149 )

2026-02-03 15:04:14 +08:00