This website requires JavaScript.
Explore
Help
Register
Sign In
EngineX-Ascend
/
enginex-ascend-910-llama.cpp
Watch
10
Star
0
Fork
0
You've already forked enginex-ascend-910-llama.cpp
Code
Issues
Pull Requests
Actions
4
Projects
Releases
Wiki
Activity
2,968
Commits
1
Branch
1
Tag
38c03478a37e460ecd3a21155b338a83bfed7f90
Commit Graph
3 Commits
Author
SHA1
Message
Date
Johannes Gäßler
38c03478a3
CUDA: fix FA out-of-bounds writes (
#7465
)
2024-05-22 17:58:25 +02:00
Johannes Gäßler
133d99c599
CUDA: deduplicate FlashAttention code (
#7352
)
2024-05-18 12:36:25 +02:00
Johannes Gäßler
0fc1e820a9
CUDA: faster large batch FA without tensor cores (
#7314
)
2024-05-17 18:54:52 +02:00