Update README.md
This commit is contained in:
@@ -58,7 +58,7 @@ This repo contains GGUF format model files for [SpeakLeash](https://speakleash.o
|
||||
* **q5_k_m:** Uses Q6_K for half of the attention.wv and feed_forward.w2 tensors, else Q5_K
|
||||
* **q6_k:** Uses Q8_K for all tensors
|
||||
* **q8_0:** Almost indistinguishable from float16. High resource use and slow. Not recommended for most users.
|
||||
* **16bit:** Converted fp16 to GGUF format.
|
||||
* **16bit:** Converted to FP16 and BF16 GGUF format.
|
||||
|
||||
### Ollama Modfile
|
||||
The GGUF file can be used with [Ollama](https://ollama.com/). To do this, you need to import the model using the configuration defined in the Modfile. For model eg. Bielik-11B-v3.0-Instruct.Q4_K_M.gguf (full path to model location) Modfile looks like:
|
||||
|
||||
Reference in New Issue
Block a user