Gemmasutra-Mini-2B-v1-GGUF/README.md


---
license: mit
language:
- en
pipeline_tag: text-generation
---

My own (ZeroWw) quantizations.
output and embed tensors quantized to f16.
all other tensors quantized to q5_k or q6_k.

Result:
both f16.q6 and f16.q5 are smaller than q8_0 standard quantization
and they perform as well as the pure f16.

Updated on: Sat Aug 03, 17:55:22
初始化项目，由ModelHub XC社区提供模型 Model: ZeroWw/Gemmasutra-Mini-2B-v1-GGUF Source: Original Platform 2026-05-10 14:40:46 +08:00
			`---`
			`license: mit`
			`language:`
			`- en`
			`pipeline_tag: text-generation`
			`---`

			`My own (ZeroWw) quantizations.`
			`output and embed tensors quantized to f16.`
			`all other tensors quantized to q5_k or q6_k.`

			`Result:`
			`both f16.q6 and f16.q5 are smaller than q8_0 standard quantization`
			`and they perform as well as the pure f16.`

			`Updated on: Sat Aug 03, 17:55:22`