This website requires JavaScript.
Explore
Help
Register
Sign In
EngineX-Ascend
/
enginex-ascend-910-llama.cpp
Watch
10
Star
0
Fork
0
You've already forked enginex-ascend-910-llama.cpp
Code
Issues
Pull Requests
Actions
4
Projects
Releases
Wiki
Activity
Files
3cd3a395323fa9cdf6ecfa1fea290bf228d4e856
enginex-ascend-910-llama.cpp
/
gguf-py
/
gguf
History
Xuan-Son Nguyen
fbdfefe74e
llama : gemma3 : use output tensor if it exists in model weight (
#12506
)
...
* llama : gemma3 : use output tensor if it exists in model weight * also add to the llm_tensor_names
2025-03-22 23:28:19 +01:00
..
scripts
Refactor gguf scripts to improve metadata handling (
#11909
)
2025-02-26 08:04:48 -05:00
__init__.py
convert-*.py: GGUF Naming Convention Refactor and Metadata Override Refactor (
#7499
)
2024-07-18 20:40:15 +10:00
constants.py
llama : gemma3 : use output tensor if it exists in model weight (
#12506
)
2025-03-22 23:28:19 +01:00
gguf_reader.py
Refactor gguf scripts to improve metadata handling (
#11909
)
2025-02-26 08:04:48 -05:00
gguf_writer.py
llama: Add support for RWKV v7 architecture (
#12412
)
2025-03-18 07:27:50 +08:00
gguf.py
gguf-py: Refactor and allow reading/modifying existing GGUF files (
#3981
)
2023-11-11 08:04:50 +03:00
lazy.py
gguf-py : simplify support for quant types (
#8838
)
2024-08-08 13:33:09 -04:00
metadata.py
convert : fix Norway problem when parsing YAML (
#12114
)
2025-02-28 17:44:46 +01:00
py.typed
convert : various script cleanups/fixes + merges and special token handling (
#2842
)
2023-08-30 11:25:50 +03:00
quants.py
ggml-quants : ternary packing for TriLMs and BitNet b1.58 (
#8151
)
2024-09-05 21:48:47 -04:00
tensor_mapping.py
llama: Add support for RWKV v7 architecture (
#12412
)
2025-03-18 07:27:50 +08:00
utility.py
repo : update links to new url (
#11886
)
2025-02-15 16:40:57 +02:00
vocab.py
convert : Support chat_template.json (
#12460
)
2025-03-19 08:58:13 +01:00