Go to file

ModelHub XC 0da4f955c7 初始化项目，由ModelHub XC社区提供模型

Model: Shinzmann/naija-petro-GGUF
Source: Original Platform

2026-06-11 05:14:16 +08:00

.gitattributes

初始化项目，由ModelHub XC社区提供模型

2026-06-11 05:14:16 +08:00

config.json

初始化项目，由ModelHub XC社区提供模型

2026-06-11 05:14:16 +08:00

Modelfile

初始化项目，由ModelHub XC社区提供模型

2026-06-11 05:14:16 +08:00

Qwen3-32B.BF16-00002-of-00002.gguf

初始化项目，由ModelHub XC社区提供模型

2026-06-11 05:14:16 +08:00

Qwen3-32B.Q4_K_M.gguf

初始化项目，由ModelHub XC社区提供模型

2026-06-11 05:14:16 +08:00

Qwen3-32B.Q5_K_M.gguf

初始化项目，由ModelHub XC社区提供模型

2026-06-11 05:14:16 +08:00

Qwen3-32B.Q8_0.gguf

初始化项目，由ModelHub XC社区提供模型

2026-06-11 05:14:16 +08:00

README.md

初始化项目，由ModelHub XC社区提供模型

2026-06-11 05:14:16 +08:00

README.md

license, language, base_model, pipeline_tag, tags

license

language

base_model

pipeline_tag

Naija-Petro (32B) — GGUF

GGUF quantizations of Shinzmann/naija-petro (the 32B variant) for inference with llama.cpp, Ollama, LM Studio, and compatible runtimes.

See the full model card for training details, intended use, and limitations. For a lighter footprint, consider the 8B GGUF. For Nigeria-specific accuracy, use these weights with the Naija-Petro RAG system.

Available quantizations

File suffix	Method	Notes
`Q4_K_M`	4-bit (k-quant, medium)	Smallest; recommended default for 32B on limited RAM/VRAM
`Q5_K_M`	5-bit (k-quant, medium)	Higher quality, larger
`Q8_0`	8-bit	Near-lossless; largest and slowest

32B GGUF files are large. Q4_K_M is the practical choice for most machines; ensure you have enough RAM/VRAM + disk for the chosen quant.

Usage

Ollama

ollama run hf.co/Shinzmann/naija-petro-GGUF:Q4_K_M

llama.cpp

./llama-cli -hf Shinzmann/naija-petro-GGUF:Q4_K_M \
  -p "What are the screening criteria for CO2 EOR?" -c 4096

License

Apache-2.0 (inherited from Qwen3-32B). Validate outputs with qualified engineers before operational use.