enginex-ascend-910-llama.cpp/fattn.cuh at 5143fa895e7725c5bd2135daf7d8f793d98fa91c - enginex-ascend-910-llama.cpp - Gitea: Git with a cup of tea

EngineX-Ascend/enginex-ascend-910-llama.cpp

Files

Johannes Gäßler 13aeb7aef2 CUDA: refactor FA support/selection code (#15454 )

2025-08-20 23:14:14 +02:00

6 lines

185 B

Plaintext

Raw Blame History

 #include "common.cuh"
 void ggml_cuda_flash_attn_ext(ggml_backend_cuda_context & ctx, ggml_tensor * dst);
 bool ggml_cuda_flash_attn_ext_supported(int device, const ggml_tensor * dst);