Files
PlayerAI-1.2B-v1.5-GGUF/README.md

212 lines
5.9 KiB
Markdown
Raw Normal View History

---
license: apache-2.0
language:
- en
base_model:
- LiquidAI/LFM2.5-1.2B-Instruct
- YoussefElsafi/PlayerAI-1.2B-v1.5
tags:
- gguf
- quantized
- conversational
- llama-cpp
pipeline_tag: text-generation
---
**PlayerAI-1.2B-v1.5-GGUF** contains GGUF quantized versions of [PlayerAI-1.2B-v1.5](https://huggingface.co/YoussefElsafi/PlayerAI-1.2B), a fine-tuned conversational language model designed for immersive, human-like interaction in multiplayer social environments.
This version improves conversational coherence, tone stability, and multi-turn consistency compared to previous releases, while remaining optimized for lightweight local inference.
---
## Available Quantizations
| File | Quant | Size | Quality | Recommended For |
|------|-------|------|---------|-----------------|
| `PlayerAI-1.2B-v1.5-Q2_K.gguf` | Q2_K | 483 MB | Lowest | Very limited RAM |
| `PlayerAI-1.2B-v1.5-Q3_K_S.gguf` | Q3_K_S | 558 MB | Very Low | Minimal RAM |
| `PlayerAI-1.2B-v1.5-Q3_K_M.gguf` | Q3_K_M | 600 MB | Low | Low RAM |
| `PlayerAI-1.2B-v1.5-Q3_K_L.gguf` | Q3_K_L | 635 MB | Low-Med | Low RAM |
| `PlayerAI-1.2B-v1.5-iQ4_XS.gguf` | IQ4_XS | 669 MB | Medium | Better than Q4 at same size |
| `PlayerAI-1.2B-v1.5-iQ4_NL.gguf` | IQ4_NL | 700 MB | Medium | Better than Q4 at same size |
| `PlayerAI-1.2B-v1.5-Q4_K_S.gguf` | Q4_K_S | 700 MB | Medium | Balanced |
| `PlayerAI-1.2B-v1.5-Q4_K_M.gguf` | Q4_K_M | 731 MB | Medium | ⭐ Recommended |
| `PlayerAI-1.2B-v1.5-Q5_K_S.gguf` | Q5_K_S | 825 MB | Good | High quality |
| `PlayerAI-1.2B-v1.5-Q5_K_M.gguf` | Q5_K_M | 843 MB | Good | High quality |
| `PlayerAI-1.2B-v1.5-Q6_K.gguf` | Q6_K | 963 MB | High | Near lossless |
| `PlayerAI-1.2B-v1.5-Q8_0.gguf` | Q8_0 | 1.25 GB | Very High | Best quality |
| `PlayerAI-1.2B-v1.5-BF16.gguf` | BF16 | 2.34 GB | Native precision | Reference |
| `PlayerAI-1.2B-v1.5-F16.gguf` | F16 | 2.34 GB | Full | Reference / conversion |
---
## Which One Should I Pick?
Since this is a **1.2B model**, all quantizations are lightweight enough for local use.
```
Any device with 1GB+ RAM → Q4_K_M ⭐ (recommended)
Best quality → Q8_0
Lowest size → Q2_K
Balanced performance → Q3_K_M / Q4_K_S
No limits → F16 / BF16
````
---
## How to Use
### With llama.cpp CLI
```bash
hf download YoussefElsafi/PlayerAI-1.2B-v1.5-GGUF \
PlayerAI-1.2B-v1.5-Q4_K_M.gguf \
--local-dir ./PlayerAI-GGUF
./llama.cpp/build/bin/llama-cli \
-m ./PlayerAI-GGUF/PlayerAI-1.2B-v1.5-Q4_K_M.gguf \
-p "User: hi\nAI:" \
-n 100 \
--temp 0.8 \
--top-p 0.9
````
---
### With llama-cpp-python
```python
from llama_cpp import Llama
llm = Llama.from_pretrained(
repo_id="YoussefElsafi/PlayerAI-1.2B-v1.5-GGUF",
filename="PlayerAI-1.2B-v1.5-Q4_K_M.gguf",
n_ctx=512,
verbose=False,
)
SYSTEM_PROMPT = (
"You are a human-like player in a multiplayer chat environment. "
"Respond casually, with short informal messages and natural tone."
)
response = llm.create_chat_completion(
messages=[
{"role": "system", "content": SYSTEM_PROMPT},
{"role": "user", "content": "hi wsp"},
],
max_tokens=80,
temperature=0.8,
top_p=0.9,
)
print(response["choices"][0]["message"]["content"])
```
---
## Model Overview
* **Base Model:** LiquidAI/LFM2.5-1.2B-Instruct
* **Parent Model:** PlayerAI-1.2B-v1.5
* **Parameters:** ~1.2B
* **Architecture:** Decoder-only Transformer
* **Training Type:** Supervised fine-tuning (full model)
* **Context Style:** Multi-turn conversational sequences
* **Primary Objective:** Social realism in dialogue generation
---
## Intended Use
This model is intended for research and experimental use cases involving:
* Multiplayer conversational agents
* Social simulation environments
* NPC dialogue systems
* Human-like chat behavior modeling
* Interactive roleplay systems
It is not intended for:
* factual question answering
* structured instruction following
* safety-critical systems
* deterministic reasoning tasks
---
## Example Interactions
**Note:** All assistant messages are generated by PlayerAI-1.2B-v1.5.
### Example 1 — Single Turn
![example1](https://cdn-uploads.huggingface.co/production/uploads/66777fbec5ec2bdc9468394a/gsl9yw3V6hKzBR--lDv3g.png)
### Example 2 — Short Conversation
![example2](https://cdn-uploads.huggingface.co/production/uploads/66777fbec5ec2bdc9468394a/Wyx8U-eDV5__0390cxXD1.png)
### Example 3 — Extended Context Chain
![example3](https://cdn-uploads.huggingface.co/production/uploads/66777fbec5ec2bdc9468394a/pevyW8RBUOjAF2wia1ct0.png)
### Example 4 — Nonsense Interaction
![example4](https://cdn-uploads.huggingface.co/production/uploads/66777fbec5ec2bdc9468394a/_CjPIzMPgFW9aijihOFRl.png)
### Example 5 — Reverse psychology
![example5](https://cdn-uploads.huggingface.co/production/uploads/66777fbec5ec2bdc9468394a/zUl1rrp3-JwuEHXS1-Giv.png)
---
## Behavior Characteristics
The model exhibits:
* informal conversational tone
* short and adaptive responses
* occasional ambiguity or inconsistency
* strong dependence on recent dialogue context
* variability in emotional and linguistic style
These properties are intentional and aligned with the social simulation objective.
---
## Limitations
* Not suitable for factual reasoning tasks
* May produce inconsistent outputs in long contexts
* Limited stability in structured instruction formats
* Not optimized for deterministic responses
* Can exhibit conversational drift
---
## Ethical Considerations
This model is intended for research and simulation purposes.
* Outputs may appear human-like in social contexts
* Behavior is optimized for realism, not correctness
* Conversational ambiguity is an intentional feature
Appropriate safeguards should be applied depending on deployment context.
---
## Attribution (Optional)
If you use PlayerAI in a project, attribution is appreciated but not required:
"**Powered by PlayerAI**"
---
## License
Apache 2.0