naija-petro-8b-GGUF/README.md

---
license: apache-2.0
language:
- en
base_model: Shinzmann/naija-petro-8b
pipeline_tag: text-generation
tags:
- petroleum-engineering
- oil-and-gas
- nigeria
- qwen3
- gguf
- llama-cpp
- ollama
- quantized
---

# Naija-Petro 8B — GGUF

GGUF quantizations of [`Shinzmann/naija-petro-8b`](https://huggingface.co/Shinzmann/naija-petro-8b) for CPU/edge inference with [llama.cpp](https://github.com/ggerganov/llama.cpp), [Ollama](https://ollama.com), LM Studio, and compatible runtimes.

See the full [model card](https://huggingface.co/Shinzmann/naija-petro-8b) for training details, intended use, and limitations. For Nigeria-specific accuracy, use these weights with the [Naija-Petro RAG system](https://github.com/Mystique1337/naija-petro).

## Available quantizations

| File suffix | Method | Notes |
|---|---|---|
| `Q4_K_M` | 4-bit (k-quant, medium) | Best size/quality trade-off — recommended default |
| `Q8_0`   | 8-bit | Near-lossless; larger and slower |

## Usage

**Ollama**
```bash
ollama run hf.co/Shinzmann/naija-petro-8b-GGUF:Q4_K_M
```

**llama.cpp**
```bash
# download a specific quant, then:
./llama-cli -hf Shinzmann/naija-petro-8b-GGUF:Q4_K_M \
  -p "Explain the material balance equation for an undersaturated reservoir." \
  -c 4096
```

**Python (llama-cpp-python)**
```python
from llama_cpp import Llama
llm = Llama.from_pretrained(
    repo_id="Shinzmann/naija-petro-8b-GGUF",
    filename="*Q4_K_M.gguf",
    n_ctx=4096,
)
print(llm.create_chat_completion(messages=[
    {"role": "system", "content": "You are Naija-Petro, an expert petroleum-engineering AI assistant."},
    {"role": "user", "content": "How do you interpret a Horner plot?"},
])["choices"][0]["message"]["content"])
```

## License
Apache-2.0 (inherited from Qwen3-8B). Validate outputs with qualified engineers before operational use.