enginex-ascend-910-llama.cpp/ggml at 38355c6c8e43204e11a22daa7483082c0ff01e71 - enginex-ascend-910-llama.cpp - Gitea: Git with a cup of tea

EngineX-Ascend/enginex-ascend-910-llama.cpp

Files

History

Aman Gupta 38355c6c8e CUDA: use registers instead of smem in topk-moe (#16647 )

Uses the technique used in the vulkan PR #16641. Neat trick!

2025-10-18 11:52:53 +02:00

..

ggml: Skip backend library linking code when GGML_BACKEND_DL=ON (#15094 )

2025-08-07 13:45:41 +02:00

rpc : report actual free memory (#16616 )

2025-10-17 18:02:52 +03:00

CUDA: use registers instead of smem in topk-moe (#16647 )

2025-10-18 11:52:53 +02:00

.gitignore

vulkan : cmake integration (#8119 )

2024-07-13 18:12:39 +02:00

CMakeLists.txt

ggml webgpu: profiling, CI updates, reworking of command submission (#16452 )

2025-10-07 13:48:56 -07:00