[CPU] [BF16] Call fused_experts_cpu, weight_packed_linear and bmm_cpu kernel in DeepSeek model (#6641)

Co-authored-by: Thien Tran <gau.nernst@yahoo.com.sg>
This commit is contained in:
Chunyuan WU
2025-06-25 16:43:33 +08:00
committed by GitHub
parent bc2e5645c4
commit 7eb47b0f3d
9 changed files with 343 additions and 23 deletions

View File

@@ -28,7 +28,7 @@ class Mod(nn.Module):
class TestGemm(CustomTestCase):
M = [1, 101]
N = [32 * 13]
N = [16, 32 * 13]
K = [32 * 16]
has_bias = [False, True]