xc-llm-ascend

Files

yupeng 8d44ddacb0 [Test][LoRA] Add e2e test for base model inference (#6624 )

### What this PR does / why we need it?

This PR adds an end-to-end test case to verify the correctness of base
model inference when LoRA is enabled. This is to ensure that after a
LoRA base model request issue was fixed, the functionality remains
correct and does not regress. The new test case calls `do_sample` with
`lora_id=0` to target the base model and asserts the output against
expected SQL queries.

### Does this PR introduce _any_ user-facing change?

No.

### How was this patch tested?

CI passed with the new test case. The test can be run with:
```bash
pytest -sv tests/e2e/singlecard/test_llama32_lora.py

Signed-off-by: paulyu12 <507435917@qq.com>

2026-02-09 21:06:49 +08:00

e2e

[Test][LoRA] Add e2e test for base model inference (#6624 )

2026-02-09 21:06:49 +08:00

[Feat](sfa,dcp) support dcp for sfa (#6563 )

2026-02-09 18:52:25 +08:00

__init__.py

[SpecDecode] Add spec decode support (#500 )

2025-04-17 20:16:32 +08:00