Files
enginex-ascend-910-llama.cpp/ggml/src/ggml-cuda
R0CKSTAR 716301d1b0 musa: enable fp16 mma (all) and cublas on qy2 (#13842)
* musa: enable fp16 mma (all) and cublas on qy2

Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com>

* Update ggml/src/ggml-cuda/ggml-cuda.cu

Co-authored-by: Johannes Gäßler <johannesg@5d6.de>

* Address review comments

Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com>

* Address review comments

Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com>

* musa: disable MUL_MAT_ID (q2_k × f32) due to precision issues

Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com>

---------

Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com>
Co-authored-by: Johannes Gäßler <johannesg@5d6.de>
2025-06-26 12:11:59 +08:00
..
2024-11-21 18:18:50 +01:00
2025-06-20 09:50:24 +08:00
2025-06-20 09:50:24 +08:00
2025-06-22 12:39:54 +08:00
2025-06-22 12:39:54 +08:00
2025-04-03 09:32:55 +02:00
2025-03-31 18:05:13 +02:00
2025-03-31 18:05:13 +02:00
2025-06-22 12:39:54 +08:00
2025-06-22 12:39:54 +08:00