Logo
Explore Help
Register Sign In
EngineX/xc-llm-ascend
3
0
Fork 0
You've already forked xc-llm-ascend
Code Issues Pull Requests Projects Releases Wiki Activity
Files
14bd55f30c3a7aab092b1cde2ad589f6d6b16f3e
xc-llm-ascend/tests/e2e/models/configs/Qwen3-Next-80B-A3B-Instruct.yaml

16 lines
327 B
YAML
Raw Normal View History

[Test] Add e2e test and accuracy test for Qwen3-Next-80B-A3B-Instruct (#3450) ### What this PR does / why we need it? Add e2e test and accuracy test for Qwen3-Next-80B-A3B-Instruct ### How was this patch tested? accuracy test: https://github.com/vllm-project/vllm-ascend/actions/runs/18771221544/job/53556027634?pr=3450 ci test: https://github.com/vllm-project/vllm-ascend/actions/runs/18771221530/job/53556027614?pr=3450 <img width="1703" height="562" alt="image" src="https://github.com/user-attachments/assets/973b6cfa-8240-41e3-893a-5024ff8d0693" /> - vLLM version: v0.11.0rc3 - vLLM main: https://github.com/vllm-project/vllm/commit/v0.11.0 Signed-off-by: hfadzxy <starmoon_zhang@163.com>
2025-10-25 10:57:56 +08:00
model_name: "Qwen/Qwen3-Next-80B-A3B-Instruct"
hardware: "Atlas A2 Series"
model: "vllm"
tasks:
- name: "ceval-valid_accountant"
metrics:
- name: "acc,none"
value: 0.98
max_model_len: 4096
tensor_parallel_size: 4
gpu_memory_utilization: 0.7
enable_expert_parallel: True
enforce_eager: True
batch_size: 1
num_fewshot: 5
Reference in New Issue Copy Permalink
Powered by Gitea Version: 1.24.3 Page: 172ms Template: 1ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API