Files
enginex-ascend-910-llama.cpp/ggml/src/ggml-cuda
Daniele 0a423800ff CUDA: revert part of the RDNA1 optimizations (#8309)
The change on the launch_bounds was causing a small performance drop in perplexity of 25 t/s
2024-07-05 09:06:09 +02:00
..
2024-07-04 01:02:58 +02:00