初始化项目,由ModelHub XC社区提供模型

Model: QuantFactory/Llama-3.1-8B-ArliAI-RPMax-v1.2-GGUF
Source: Original Platform
This commit is contained in:
ModelHub XC
2026-06-04 02:20:12 +08:00
commit eb1c4ae9ec
17 changed files with 178 additions and 0 deletions

49
.gitattributes vendored Normal file
View File

@@ -0,0 +1,49 @@
*.7z filter=lfs diff=lfs merge=lfs -text
*.arrow filter=lfs diff=lfs merge=lfs -text
*.bin filter=lfs diff=lfs merge=lfs -text
*.bz2 filter=lfs diff=lfs merge=lfs -text
*.ckpt filter=lfs diff=lfs merge=lfs -text
*.ftz filter=lfs diff=lfs merge=lfs -text
*.gz filter=lfs diff=lfs merge=lfs -text
*.h5 filter=lfs diff=lfs merge=lfs -text
*.joblib filter=lfs diff=lfs merge=lfs -text
*.lfs.* filter=lfs diff=lfs merge=lfs -text
*.mlmodel filter=lfs diff=lfs merge=lfs -text
*.model filter=lfs diff=lfs merge=lfs -text
*.msgpack filter=lfs diff=lfs merge=lfs -text
*.npy filter=lfs diff=lfs merge=lfs -text
*.npz filter=lfs diff=lfs merge=lfs -text
*.onnx filter=lfs diff=lfs merge=lfs -text
*.ot filter=lfs diff=lfs merge=lfs -text
*.parquet filter=lfs diff=lfs merge=lfs -text
*.pb filter=lfs diff=lfs merge=lfs -text
*.pickle filter=lfs diff=lfs merge=lfs -text
*.pkl filter=lfs diff=lfs merge=lfs -text
*.pt filter=lfs diff=lfs merge=lfs -text
*.pth filter=lfs diff=lfs merge=lfs -text
*.rar filter=lfs diff=lfs merge=lfs -text
*.safetensors filter=lfs diff=lfs merge=lfs -text
saved_model/**/* filter=lfs diff=lfs merge=lfs -text
*.tar.* filter=lfs diff=lfs merge=lfs -text
*.tar filter=lfs diff=lfs merge=lfs -text
*.tflite filter=lfs diff=lfs merge=lfs -text
*.tgz filter=lfs diff=lfs merge=lfs -text
*.wasm filter=lfs diff=lfs merge=lfs -text
*.xz filter=lfs diff=lfs merge=lfs -text
*.zip filter=lfs diff=lfs merge=lfs -text
*.zst filter=lfs diff=lfs merge=lfs -text
*tfevents* filter=lfs diff=lfs merge=lfs -text
Llama-3.1-8B-ArliAI-RPMax-v1.2.Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text
Llama-3.1-8B-ArliAI-RPMax-v1.2.Q4_1.gguf filter=lfs diff=lfs merge=lfs -text
Llama-3.1-8B-ArliAI-RPMax-v1.2.Q4_0.gguf filter=lfs diff=lfs merge=lfs -text
Llama-3.1-8B-ArliAI-RPMax-v1.2.Q8_0.gguf filter=lfs diff=lfs merge=lfs -text
Llama-3.1-8B-ArliAI-RPMax-v1.2.Q6_K.gguf filter=lfs diff=lfs merge=lfs -text
Llama-3.1-8B-ArliAI-RPMax-v1.2.Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text
Llama-3.1-8B-ArliAI-RPMax-v1.2.Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text
Llama-3.1-8B-ArliAI-RPMax-v1.2.Q5_0.gguf filter=lfs diff=lfs merge=lfs -text
Llama-3.1-8B-ArliAI-RPMax-v1.2.Q5_1.gguf filter=lfs diff=lfs merge=lfs -text
Llama-3.1-8B-ArliAI-RPMax-v1.2.Q2_K.gguf filter=lfs diff=lfs merge=lfs -text
Llama-3.1-8B-ArliAI-RPMax-v1.2.Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text
Llama-3.1-8B-ArliAI-RPMax-v1.2.Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text
Llama-3.1-8B-ArliAI-RPMax-v1.2.Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text
Llama-3.1-8B-ArliAI-RPMax-v1.2.Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:5ed5bd72d11acd927228449d16ff1c84dbd6b2dafbfff83d131fbad1372d7987
size 3179131552

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:b93b3fb77e4ad15118be388f60117d55d437e7c9c67b0491c0be0f01a3307e2f
size 4321956512

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:ea976f5719d202b22b57a8865e103ec0b3a593ef4f9f28cf15c7827370dc6bfa
size 4018918048

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:de9e8b46ef07e4ed6eb81a36fe78f6ede707e8a404e27eb13f7b24d4188988fa
size 3664499360

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:d60df059ffb259234226314bfa71cac62aaadf5f8448dbc9a7588ce8f2bc5fc5
size 4661211808

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:a18b346a3cdd4a27f3e95838c55d8acbe10094c96a2e9708fdab284d70613266
size 5130252960

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:f7d928e09faf9385c9c0700abe630434201d1cbc80573ef1d2a762b5a6d470cb
size 4920734368

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:a0677c3c43614bf72166527cdd0ccb99bf00e6764764e4cf8287ab387aecc506
size 4692669088

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:507f8a5f129385e378db74cb183b49d0fc6390662c829e768f8beb57bfecd86a
size 5599294112

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:e272fc6c7700ba40de1064001d0c78701a983762ed460f63a744df16ef6c047e
size 6068335264

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:b3ed74de8dcf9a080650ff07bed95059f185f8b81c1f1d82186b831ce0a16043
size 5732987552

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:8f9824977936332b75442b18aee579e75c79dc7f8777edb3d86edfe73046c339
size 5599294112

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:3fa7df6e0b433d4f4c066dc88a469a9163ef512c8cffc3c43c24784dd6f9c5f3
size 6596006560

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:7b0c36916318cc34cd53f40c104cf21e60eb8ba1b913a927d7f1d2f9d014b3bb
size 8540770976

86
README.md Normal file
View File

@@ -0,0 +1,86 @@
---
license: llama3.1
---
[![QuantFactory Banner](https://lh7-rt.googleusercontent.com/docsz/AD_4nXeiuCm7c8lEwEJuRey9kiVZsRn2W-b4pWlu3-X534V3YmVuVc2ZL-NXg2RkzSOOS2JXGHutDuyyNAUtdJI65jGTo8jT9Y99tMi4H4MqL44Uc5QKG77B0d6-JfIkZHFaUA71-RtjyYZWVIhqsNZcx8-OMaA?key=xt3VSDoCbmTY7o-cwwOFwQ)](https://hf.co/QuantFactory)
# QuantFactory/Llama-3.1-8B-ArliAI-RPMax-v1.2-GGUF
This is quantized version of [ArliAI/Llama-3.1-8B-ArliAI-RPMax-v1.2](https://huggingface.co/ArliAI/Llama-3.1-8B-ArliAI-RPMax-v1.2) created using llama.cpp
# Original Model Card
# Llama-3.1-8B-ArliAI-RPMax-v1.2
=====================================
## RPMax Series Overview
| [2B](https://huggingface.co/ArliAI/Gemma-2-2B-ArliAI-RPMax-v1.1) |
[3.8B](https://huggingface.co/ArliAI/Phi-3.5-mini-3.8B-ArliAI-RPMax-v1.1) |
[8B](https://huggingface.co/ArliAI/Llama-3.1-8B-ArliAI-RPMax-v1.2) |
[9B](https://huggingface.co/ArliAI/Gemma-2-9B-ArliAI-RPMax-v1.1) |
[12B](https://huggingface.co/ArliAI/Mistral-Nemo-12B-ArliAI-RPMax-v1.2) |
[20B](https://huggingface.co/ArliAI/InternLM2_5-20B-ArliAI-RPMax-v1.1) |
[22B](https://huggingface.co/ArliAI/Mistral-Small-22B-ArliAI-RPMax-v1.1) |
[70B](https://huggingface.co/ArliAI/Llama-3.1-70B-ArliAI-RPMax-v1.1) |
RPMax is a series of models that are trained on a diverse set of curated creative writing and RP datasets with a focus on variety and deduplication. This model is designed to be highly creative and non-repetitive by making sure no two entries in the dataset have repeated characters or situations, which makes sure the model does not latch on to a certain personality and be capable of understanding and acting appropriately to any characters or situations.
Early tests by users mentioned that these models does not feel like any other RP models, having a different style and generally doesn't feel in-bred.
You can access the model at https://arliai.com and ask questions at https://www.reddit.com/r/ArliAI/
We also have a models ranking page at https://www.arliai.com/models-ranking
Ask questions in our new Discord Server! https://discord.com/invite/t75KbPgwhk
## Model Description
Llama-3.1-8B-ArliAI-RPMax-v1.2 is a variant of the Meta-Llama-3.1-8B model.
v1.2 update is a retrain using an incremental improvement of the RPMax dataset which dedups the dataset even more and better filtering to cutout irrelevant description text that came from card sharing sites.
### Specs
* **Context Length**: 128K
* **Parameters**: 8B
### Training Details
* **Sequence Length**: 8192
* **Training Duration**: Approximately 1 day on 2x3090Ti
* **Epochs**: 1 epoch training for minimized repetition sickness
* **LORA**: 64-rank 128-alpha, resulting in ~2% trainable weights
* **Learning Rate**: 0.00001
* **Gradient accumulation**: Very low 32 for better learning.
## Quantization
The model is available in quantized formats:
We recommend using full weights or GPTQ
* **FP16**: https://huggingface.co/ArliAI/Llama-3.1-8B-ArliAI-RPMax-v1.2
* **GGUF**: https://huggingface.co/ArliAI/Llama-3.1-8B-ArliAI-RPMax-v1.2-GGUF
## Suggested Prompt Format
Llama 3 Instruct Format
Example:
```
<|begin_of_text|><|start_header_id|>system<|end_header_id|>
You are [character]. You have a personality of [personality description]. [Describe scenario]<|eot_id|><|start_header_id|>user<|end_header_id|>
{{ user_message_1 }}<|eot_id|><|start_header_id|>assistant<|end_header_id|>
{{ model_answer_1 }}<|eot_id|><|start_header_id|>user<|end_header_id|>
{{ user_message_2 }}<|eot_id|><|start_header_id|>assistant<|end_header_id|>
```

1
configuration.json Normal file
View File

@@ -0,0 +1 @@
{"framework": "pytorch", "task": "others", "allow_remote": true}