enginex-ascend-910-llama.cpp/ggml at 7538246e7ce0606694c38055cc2fc9f60535be6c - enginex-ascend-910-llama.cpp - Gitea: Git with a cup of tea

EngineX-Ascend/enginex-ascend-910-llama.cpp

Files

History

Sigbjørn Skjæret 7538246e7c cuda : add f32 to bf16 copy op (#12806 )

This allows BF16 KV-cache on CUDA.

2025-04-08 23:21:31 +02:00

..

scripts : update sync + fix cmake merge

2025-03-27 10:09:29 +02:00

metal : improve FA + improve MoE (#12612 )

2025-03-28 20:21:59 +02:00

cuda : add f32 to bf16 copy op (#12806 )

2025-04-08 23:21:31 +02:00

.gitignore

vulkan : cmake integration (#8119 )

2024-07-13 18:12:39 +02:00

CMakeLists.txt

ggml : add logging for native build options/vars (whisper/2935)

2025-03-30 08:33:31 +03:00