[SYCL] refactor (#6408)
* seperate lower precision GEMM from the main files * fix workgroup size hardcode
This commit is contained in:
1024
ggml-sycl/mmvq.cpp
Normal file
1024
ggml-sycl/mmvq.cpp
Normal file
File diff suppressed because it is too large
Load Diff
Reference in New Issue
Block a user