[CI] Ds32 ep aime2025 (#8496)
Backport of #7882 to releases/v0.18.0. Adds aime2025 benchmark test for DeepSeek-V3.2-W8A8 EP with disaggregated prefill on A3 (4-node, 16 NPUs per node, accuracy benchmark baseline 66.67%). Signed-off-by: guozr <guozr1997@hotmail.com> Co-authored-by: guozr <guozr1997@hotmail.com>
This commit is contained in:
@@ -119,6 +119,9 @@ jobs:
|
||||
- name: multi-node-deepseek-v3.2-W8A8-EP
|
||||
config_file_path: DeepSeek-V3_2-W8A8-EP.yaml
|
||||
size: 4
|
||||
- name: multi-node-deepseek-v3.2-W8A8-EP-aime2025
|
||||
config_file_path: DeepSeek-V3_2-W8A8-EP-aime2025.yaml
|
||||
size: 4
|
||||
uses: ./.github/workflows/_e2e_nightly_multi_node.yaml
|
||||
with:
|
||||
soc_version: a3
|
||||
|
||||
Reference in New Issue
Block a user