Update README.md

This commit is contained in:
SB
2026-01-02 09:43:29 +00:00
committed by system
parent c6de86d960
commit 3c2f21f41d

View File

@@ -58,7 +58,7 @@ This repo contains GGUF format model files for [SpeakLeash](https://speakleash.o
* **q5_k_m:** Uses Q6_K for half of the attention.wv and feed_forward.w2 tensors, else Q5_K
* **q6_k:** Uses Q8_K for all tensors
* **q8_0:** Almost indistinguishable from float16. High resource use and slow. Not recommended for most users.
* **16bit:** Converted fp16 to GGUF format.
* **16bit:** Converted to FP16 and BF16 GGUF format.
### Ollama Modfile
The GGUF file can be used with [Ollama](https://ollama.com/). To do this, you need to import the model using the configuration defined in the Modfile. For model eg. Bielik-11B-v3.0-Instruct.Q4_K_M.gguf (full path to model location) Modfile looks like: