Logo
Explore Help
Register Sign In
EngineX/xc-llm-ascend
3
0
Fork 0
You've already forked xc-llm-ascend
Code Issues Pull Requests Projects Releases Wiki Activity
Files
09682e075118aaacb0a717f2b7078bad040599a9
xc-llm-ascend/tests/e2e/models/configs/Qwen3-Next-80B-A3B-Instruct.yaml

16 lines
327 B
YAML
Raw Normal View History

[Test] Add e2e test and accuracy test for Qwen3-Next-80B-A3B-Instruct (#3450) ### What this PR does / why we need it? Add e2e test and accuracy test for Qwen3-Next-80B-A3B-Instruct ### How was this patch tested? accuracy test: https://github.com/vllm-project/vllm-ascend/actions/runs/18771221544/job/53556027634?pr=3450 ci test: https://github.com/vllm-project/vllm-ascend/actions/runs/18771221530/job/53556027614?pr=3450 <img width="1703" height="562" alt="image" src="https://github.com/user-attachments/assets/973b6cfa-8240-41e3-893a-5024ff8d0693" /> - vLLM version: v0.11.0rc3 - vLLM main: https://github.com/vllm-project/vllm/commit/v0.11.0 Signed-off-by: hfadzxy <starmoon_zhang@163.com>
2025-10-25 10:57:56 +08:00
model_name: "Qwen/Qwen3-Next-80B-A3B-Instruct"
hardware: "Atlas A2 Series"
model: "vllm"
tasks:
- name: "ceval-valid_accountant"
metrics:
- name: "acc,none"
value: 0.98
max_model_len: 4096
tensor_parallel_size: 4
gpu_memory_utilization: 0.7
enable_expert_parallel: True
enforce_eager: True
batch_size: 1
num_fewshot: 5
Reference in New Issue Copy Permalink
Powered by Gitea Version: 1.24.3 Page: 108ms Template: 2ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API