This website requires JavaScript.
Explore
Help
Register
Sign In
EngineX-Ascend
/
enginex-ascend-910-llama.cpp
Watch
10
Star
0
Fork
0
You've already forked enginex-ascend-910-llama.cpp
Code
Issues
Pull Requests
Actions
4
Projects
Releases
Wiki
Activity
6,169
Commits
1
Branch
1
Tag
df36bce667bf14f8e538645547754386f9516326
Commit Graph
2 Commits
Author
SHA1
Message
Date
Johannes Gäßler
0cf6725e9f
CUDA: FA support for Deepseek (Ampere or newer) (
#13306
)
...
* CUDA: FA support for Deepseek (Ampere or newer) * do loop unrolling via C++ template
2025-05-09 13:34:58 +02:00
Johannes Gäßler
5fa07c2f93
CUDA: optimize FA for GQA + large batches (
#12014
)
2025-02-22 12:20:17 +01:00