xc-llm-ascend/workflows at dceef080b140305841a98d71cd8cfdeccbd390af - xc-llm-ascend - Gitea: Git with a cup of tea

EngineX/xc-llm-ascend

Files

History

lbk-sys c611291661 【main】SP For Qwen3 MoE (#2209 )

### What this PR does / why we need it?
Qwen3 MoE supports SP. In scenarios like AlltoAll, AlltoAllv, and MC2,
replacing AllReduce with Reduce-Scatter and AllGather achieves
computational benefits in norm operations while saving one AllGather
communication. This feature is enabled during the P-phase and delivers
notable gains in long-sequence scenarios (e.g., 16k–25k), with
performance improvements reaching 5%–10%.
### Does this PR introduce _any_ user-facing change?

### How was this patch tested?
``` 
compilation_config={
    "pass_config":{
        "enable_sequence_parallelism": True
    }
},
enable_expert_parallel=True,
```

- vLLM version: v0.10.0
- vLLM main:
9edd1db02b

---------

Signed-off-by: libaokui <libaokui@huawei.com>
Co-authored-by: libaokui <libaokui@huawei.com>

2025-08-07 09:15:49 +08:00

..

[Core] Init vllm-ascend (#3 )

2025-02-05 10:53:12 +08:00

accuracy_test.yaml

Enable pytest and yaml style accuracy test (#2073 )

2025-07-31 21:39:13 +08:00

format_pr_body.yaml

Use ci_vllm_version when recording vLLM commit (#1689 )

2025-07-10 11:07:27 +08:00

image_310p_openeuler.yml

Enable image push CI for build file and csrc has changes (#1977 )

2025-07-24 21:19:41 +08:00

image_310p_ubuntu.yml

Enable image push CI for build file and csrc has changes (#1977 )

2025-07-24 21:19:41 +08:00

image_a3_openeuler.yml

Enable image push CI for build file and csrc has changes (#1977 )

2025-07-24 21:19:41 +08:00

image_a3_ubuntu.yml

Enable image push CI for build file and csrc has changes (#1977 )

2025-07-24 21:19:41 +08:00

image_openeuler.yml

Enable image push CI for build file and csrc has changes (#1977 )

2025-07-24 21:19:41 +08:00

image_ubuntu.yml

Enable image push CI for build file and csrc has changes (#1977 )

2025-07-24 21:19:41 +08:00

label_merge_conflict.yml

[CI] Add merge conflict label job (#1050 )

2025-06-03 17:32:31 +08:00

labeler.yml

[CI] Add dependabot support and labeler workflow (#162 )

2025-02-27 19:46:31 +08:00

nightly_benchmarks.yaml

[CI/Build] Upgrade CANN to 8.2.RC1 (#1653 )

2025-07-26 22:37:46 +08:00

pre-commit.yml

bump default python version to 3.11 (#2072 )

2025-07-29 19:07:17 +08:00

release_code.yml

bump default python version to 3.11 (#2072 )

2025-07-29 19:07:17 +08:00

release_whl.yml

Bump torch version to 2.7.1 (#1562 )

2025-08-05 08:43:24 +08:00

reminder_comment.yml

[misc] Add reminder comment when PR submitted (#2092 )

2025-07-30 10:14:33 +08:00

vllm_ascend_doctest.yaml

[CI] Enable linux-aarch64-a2 (64GB) and tp2 * 2 max-parallel to speed up CI (#2065 )

2025-07-29 18:59:05 +08:00

vllm_ascend_test_310p.yaml

[CI] Update image for 310p ci (#2155 )

2025-08-02 16:46:02 +08:00

vllm_ascend_test_long_term.yaml

[CI] Enable linux-aarch64-a2 (64GB) and tp2 * 2 max-parallel to speed up CI (#2065 )

2025-07-29 18:59:05 +08:00

vllm_ascend_test_pd.yaml

[CI/Build] Upgrade CANN to 8.2.RC1 (#1653 )

2025-07-26 22:37:46 +08:00

vllm_ascend_test.yaml

【main】SP For Qwen3 MoE (#2209 )

2025-08-07 09:15:49 +08:00