This website requires JavaScript.
Explore
Help
Register
Sign In
EngineX-Ascend
/
enginex-ascend-910-llama.cpp
Watch
10
Star
0
Fork
0
You've already forked enginex-ascend-910-llama.cpp
Code
Issues
Pull Requests
Actions
4
Projects
Releases
Wiki
Activity
Files
116efee0eef09d8c3c4c60b52fa01b56ddeb432c
enginex-ascend-910-llama.cpp
/
ggml
History
Ivan
116efee0ee
cuda: add q8_0->f32 cpy operation (
#9571
)
...
llama: enable K-shift for quantized KV cache It will fail on unsupported backends or quant types.
2024-09-24 02:14:24 +02:00
..
cmake
llama : reorganize source code + improve CMake (
#8006
)
2024-06-26 18:33:02 +03:00
include
ggml/examples: add backend support for numerical optimization (ggml/949)
2024-09-20 21:15:05 +03:00
src
cuda: add q8_0->f32 cpy operation (
#9571
)
2024-09-24 02:14:24 +02:00
.gitignore
vulkan : cmake integration (
#8119
)
2024-07-13 18:12:39 +02:00
CMakeLists.txt
cmake : do not hide GGML options + rename option (
#9465
)
2024-09-16 10:27:50 +03:00