[TEST]add a qwen3-30b acc case with mooncake mempool (#6244)
### What this PR does / why we need it?
This PR adds a case of qwen3-30b w8a8 with mooncake mempool, we need to
test it regual
### Does this PR introduce _any_ user-facing change?
No
### How was this patch tested?
by running the test
- vLLM version: v0.14.1
- vLLM main:
d68209402d
Signed-off-by: jiangyunfan1 <jiangyunfan1@h-partners.com>
This commit is contained in:
1
.github/workflows/misc/model_list.json
vendored
1
.github/workflows/misc/model_list.json
vendored
@@ -202,6 +202,7 @@
|
||||
"vllm-ascend/Qwen3-235B-A22B-W8A8",
|
||||
"vllm-ascend/Qwen3-235B-A22B-w8a8",
|
||||
"vllm-ascend/Qwen3-30B-A3B",
|
||||
"vllm-ascend/Qwen3-a3B_eagle3",
|
||||
"vllm-ascend/Qwen3-30B-A3B-Puring",
|
||||
"vllm-ascend/Qwen3-30B-A3B-W8A8",
|
||||
"vllm-ascend/Qwen3-30B-A3B-W8A8-Pruning",
|
||||
|
||||
Reference in New Issue
Block a user