xc-llm-ascend

Files

ZYang6263 34386c8896 [v0.18.0][CI] Fix and simplify the CI for Qwen3 32B (#8093 )

### What this PR does / why we need it?
This PR fixes and simplifies the CI configuration for Qwen3 32B.

The main changes are:
- Remove the redundant `Qwen3-32B-Int8-A3-Feature-Stack3.yaml` config
and consolidate the CI setup into `Qwen3-32B-Int8.yaml`.
- Improve runtime stability by adding
`PYTORCH_NPU_ALLOC_CONF=expandable_segments:True` and setting
`--max-num-seqs 80`.
- Update the accuracy benchmark from `aime2024` to `gsm8k-lite`, and
adjust the related dataset config, output length, baseline, and
threshold accordingly.

These changes make the Qwen3 32B CI easier to maintain and more stable
in nightly validation.

---------

Signed-off-by: ZYang6263 <zy626375@gmail.com>

2026-04-10 14:22:24 +08:00

ISSUE_TEMPLATE

[CI][lint] Add rule codespell back (#6236 )

2026-01-26 14:12:33 +08:00

workflows

[v0.18.0][CI] Fix and simplify the CI for Qwen3 32B (#8093 )

2026-04-10 14:22:24 +08:00

actionlint.yaml

[CI][Misc] Some improvement for github action (#6587 )

2026-02-06 14:06:27 +08:00

CODEOWNERS

[Community] Nominate whx-sjtu as maintainer (#6268 )