xc-llm-ascend/config at 14bd55f30c3a7aab092b1cde2ad589f6d6b16f3e - xc-llm-ascend - Gitea: Git with a cup of tea

EngineX/xc-llm-ascend

Files

History

dsxsteven 325cb16e3f [BugFix][CI]Fix DeepSeek-R1-W8A8-longseq nightly CI (#6297 )

### What this PR does / why we need it?
The precision issue arose because the kv cache of the p-node had not
been fetched for an extended period(>6min) and was forcibly freed. To
avoid this problem, the batch size was reduced and the timeout period
has also been extended.
### Does this PR introduce _any_ user-facing change?

### How was this patch tested?

- vLLM version: v0.14.1
- vLLM main:
dc917cceb8

Signed-off-by: dsxsteven <dsxsteven@sina.com>

2026-01-28 16:36:24 +08:00

..

DeepSeek-R1-W8A8-A2.yaml

Default enable MLAPO (#5952 )

2026-01-22 09:26:39 +08:00

DeepSeek-R1-W8A8-EPLB.yaml

[EPLB]Eplb Config Renaming (#5533 )

2026-01-15 10:26:44 +08:00

DeepSeek-R1-W8A8-longseq.yaml

[BugFix][CI]Fix DeepSeek-R1-W8A8-longseq nightly CI (#6297 )

2026-01-28 16:36:24 +08:00

DeepSeek-R1-W8A8.yaml

[Refactor]Refactor of vllm_ascend/distributed module (#5719 )

2026-01-15 08:57:40 +08:00

DeepSeek-V3_2-W8A8-A3-dual-nodes.yaml

[CI] Enable FLASHCOMM1 with layer_sharding and FULL_DECODE_ONLY in ds32 testing (#6115 )

2026-01-23 19:48:37 +08:00

DeepSeek-V3.1-BF16.yaml

[CI] Add nightly ci test for deepseek v3.1 (#5386 )

2026-01-23 14:36:49 +08:00

DeepSeek-V3.yaml

[Refactor]Refactor of vllm_ascend/distributed module (#5719 )

2026-01-15 08:57:40 +08:00

Kimi-K2-Instruct-W8A8.yaml

[CI]Add Kimi k2 nightly test (#5682 )

2026-01-12 15:56:07 +08:00

Qwen3-235B-A22B-A2.yaml

[CI] Align multi-node nightly test paramter with corresponding tutorials document (#5756 )

2026-01-12 09:00:31 +08:00

Qwen3-235B-A22B.yaml

[CI] Align multi-node nightly test paramter with corresponding tutorials document (#5756 )

2026-01-12 09:00:31 +08:00

Qwen3-235B-disagg-pd.yaml

[Refactor]Refactor of vllm_ascend/distributed module (#5719 )

2026-01-15 08:57:40 +08:00

Qwen3-235B-W8A8-EPLB.yaml

[EPLB]Eplb Config Renaming (#5533 )

2026-01-15 10:26:44 +08:00

Qwen3-235B-W8A8-longseq.yaml

[Refactor]Refactor of vllm_ascend/distributed module (#5719 )

2026-01-15 08:57:40 +08:00

Qwen3-235B-W8A8.yaml

[Refactor]Refactor of vllm_ascend/distributed module (#5719 )

2026-01-15 08:57:40 +08:00

Qwen3-VL-235B-disagg-pd.yaml

[Refactor]Refactor of vllm_ascend/distributed module (#5719 )

2026-01-15 08:57:40 +08:00