Files
naija-petro-8b-GGUF/README.md
ModelHub XC d302c9ab39 初始化项目,由ModelHub XC社区提供模型
Model: Shinzmann/naija-petro-8b-GGUF
Source: Original Platform
2026-06-11 04:22:16 +08:00

62 lines
1.8 KiB
Markdown

---
license: apache-2.0
language:
- en
base_model: Shinzmann/naija-petro-8b
pipeline_tag: text-generation
tags:
- petroleum-engineering
- oil-and-gas
- nigeria
- qwen3
- gguf
- llama-cpp
- ollama
- quantized
---
# Naija-Petro 8B — GGUF
GGUF quantizations of [`Shinzmann/naija-petro-8b`](https://huggingface.co/Shinzmann/naija-petro-8b) for CPU/edge inference with [llama.cpp](https://github.com/ggerganov/llama.cpp), [Ollama](https://ollama.com), LM Studio, and compatible runtimes.
See the full [model card](https://huggingface.co/Shinzmann/naija-petro-8b) for training details, intended use, and limitations. For Nigeria-specific accuracy, use these weights with the [Naija-Petro RAG system](https://github.com/Mystique1337/naija-petro).
## Available quantizations
| File suffix | Method | Notes |
|---|---|---|
| `Q4_K_M` | 4-bit (k-quant, medium) | Best size/quality trade-off — recommended default |
| `Q8_0` | 8-bit | Near-lossless; larger and slower |
## Usage
**Ollama**
```bash
ollama run hf.co/Shinzmann/naija-petro-8b-GGUF:Q4_K_M
```
**llama.cpp**
```bash
# download a specific quant, then:
./llama-cli -hf Shinzmann/naija-petro-8b-GGUF:Q4_K_M \
-p "Explain the material balance equation for an undersaturated reservoir." \
-c 4096
```
**Python (llama-cpp-python)**
```python
from llama_cpp import Llama
llm = Llama.from_pretrained(
repo_id="Shinzmann/naija-petro-8b-GGUF",
filename="*Q4_K_M.gguf",
n_ctx=4096,
)
print(llm.create_chat_completion(messages=[
{"role": "system", "content": "You are Naija-Petro, an expert petroleum-engineering AI assistant."},
{"role": "user", "content": "How do you interpret a Horner plot?"},
])["choices"][0]["message"]["content"])
```
## License
Apache-2.0 (inherited from Qwen3-8B). Validate outputs with qualified engineers before operational use.