初始化项目，由ModelHub XC社区提供模型

Model: osmapi/Nidum-Llama-3.2-3B-Uncensored-GGUF Source: Original Platform
2026-06-05 22:18:18 +08:00
commit 4a89495f59
17 changed files with 228 additions and 0 deletions
--- a/.gitattributes
+++ b/.gitattributes
@@ -0,0 +1,36 @@
+*.7z filter=lfs diff=lfs merge=lfs -text
+*.arrow filter=lfs diff=lfs merge=lfs -text
+*.bin filter=lfs diff=lfs merge=lfs -text
+*.bz2 filter=lfs diff=lfs merge=lfs -text
+*.ckpt filter=lfs diff=lfs merge=lfs -text
+*.ftz filter=lfs diff=lfs merge=lfs -text
+*.gz filter=lfs diff=lfs merge=lfs -text
+*.h5 filter=lfs diff=lfs merge=lfs -text
+*.joblib filter=lfs diff=lfs merge=lfs -text
+*.lfs.* filter=lfs diff=lfs merge=lfs -text
+*.mlmodel filter=lfs diff=lfs merge=lfs -text
+*.model filter=lfs diff=lfs merge=lfs -text
+*.msgpack filter=lfs diff=lfs merge=lfs -text
+*.npy filter=lfs diff=lfs merge=lfs -text
+*.npz filter=lfs diff=lfs merge=lfs -text
+*.onnx filter=lfs diff=lfs merge=lfs -text
+*.ot filter=lfs diff=lfs merge=lfs -text
+*.parquet filter=lfs diff=lfs merge=lfs -text
+*.pb filter=lfs diff=lfs merge=lfs -text
+*.pickle filter=lfs diff=lfs merge=lfs -text
+*.pkl filter=lfs diff=lfs merge=lfs -text
+*.pt filter=lfs diff=lfs merge=lfs -text
+*.pth filter=lfs diff=lfs merge=lfs -text
+*.rar filter=lfs diff=lfs merge=lfs -text
+*.safetensors filter=lfs diff=lfs merge=lfs -text
+saved_model/**/* filter=lfs diff=lfs merge=lfs -text
+*.tar.* filter=lfs diff=lfs merge=lfs -text
+*.tar filter=lfs diff=lfs merge=lfs -text
+*.tflite filter=lfs diff=lfs merge=lfs -text
+*.tgz filter=lfs diff=lfs merge=lfs -text
+*.wasm filter=lfs diff=lfs merge=lfs -text
+*.xz filter=lfs diff=lfs merge=lfs -text
+*.zip filter=lfs diff=lfs merge=lfs -text
+*.zst filter=lfs diff=lfs merge=lfs -text
+*tfevents* filter=lfs diff=lfs merge=lfs -text
+*.gguf filter=lfs diff=lfs merge=lfs -text
--- a/Nidum-Llama-3.2-3B-Uncensored-F16.gguf
+++ b/Nidum-Llama-3.2-3B-Uncensored-F16.gguf
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:767f7d00431b076da787cf79a9179846a7cfbe4a6d60820d4930ce15c9888459
+size 6433688032
--- a/README.md
+++ b/README.md
@@ -0,0 +1,147 @@
+---
+license: apache-2.0
+base_model:
+- nidum/Nidum-Llama-3.2-3B-Uncensored
+- meta-llama/Llama-3.2-3B
+library_name: adapter-transformers
+tags:
+- chemistry
+- biology
+- legal
+- code
+- medical
+- finance
+- roleplay
+- uncensored
+- uncensored LLM
+pipeline_tag: text-generation
+---
+
+### Nidum-Llama-3.2-3B-Uncensored  
+
+### Welcome to Nidum!  
+At Nidum, we believe in pushing the boundaries of innovation by providing advanced and unrestricted AI models for every application. Dive into our world of possibilities and experience the freedom of **Nidum-Llama-3.2-3B-Uncensored**, tailored to meet diverse needs with exceptional performance.
+
+---
+
+[![GitHub Icon](https://upload.wikimedia.org/wikipedia/commons/thumb/9/95/Font_Awesome_5_brands_github.svg/232px-Font_Awesome_5_brands_github.svg.png)](https://github.com/NidumAI-Inc)  
+**Explore Nidum's Open-Source Projects on GitHub**: [https://github.com/NidumAI-Inc](https://github.com/NidumAI-Inc)
+
+---
+### Key Features
+
+1. **Uncensored Responses**: Capable of addressing any query without content restrictions, offering detailed and uninhibited answers.
+2. **Versatility**: Excels in diverse use cases, from complex technical queries to engaging casual conversations.
+3. **Advanced Contextual Understanding**: Draws from an expansive knowledge base for accurate and context-aware outputs.
+4. **Extended Context Handling**: Optimized for handling long-context interactions for improved continuity and depth.
+5. **Customizability**: Adaptable to specific tasks and user preferences through fine-tuning.
+
+---
+
+### Use Cases
+
+- **Open-Ended Q&A**  
+- **Creative Writing and Ideation**  
+- **Research Assistance**  
+- **Educational Queries**  
+- **Casual Conversations**  
+- **Mathematical Problem Solving**  
+- **Long-Context Dialogues**  
+
+---
+
+### How to Use
+
+To start using **Nidum-Llama-3.2-3B-Uncensored**, follow the sample code below:
+
+```python
+import torch
+from transformers import pipeline
+
+pipe = pipeline(
+    "text-generation",
+    model="nidum/Nidum-Llama-3.2-3B-Uncensored",
+    model_kwargs={"torch_dtype": torch.bfloat16},
+    device="cuda",  # replace with "mps" to run on a Mac device
+)
+
+messages = [
+    {"role": "user", "content": "Tell me something fascinating."},
+]
+
+outputs = pipe(messages, max_new_tokens=256)
+assistant_response = outputs[0]["generated_text"][-1]["content"].strip()
+print(assistant_response)
+```
+
+---
+#### Quantized Models Available for Download
+
+| **Quantized Model Version**                                                                                       | **Description**                                                                 |
+|-------------------------------------------------------------------------------------------------------------------|---------------------------------------------------------------------------------|
+| [**Nidum-Llama-3.2-3B-Uncensored-F16.gguf**](https://huggingface.co/nidum/Nidum-Llama-3.2-3B-Uncensored-GGUF/blob/main/Nidum-Llama-3.2-3B-Uncensored-F16.gguf) | Full 16-bit floating point precision for maximum accuracy on high-end GPUs.     |
+| [**model-Q2_K.gguf**](https://huggingface.co/nidum/Nidum-Llama-3.2-3B-Uncensored-GGUF/blob/main/model-Q2_K.gguf)               | Optimized for minimal memory usage with lower precision, suitable for edge cases.|
+| [**model-Q3_K_L.gguf**](https://huggingface.co/nidum/Nidum-Llama-3.2-3B-Uncensored-GGUF/blob/main/model-Q3_K_L.gguf)           | Balanced precision with enhanced memory efficiency for medium-range devices.    |
+| [**model-Q3_K_M.gguf**](https://huggingface.co/nidum/Nidum-Llama-3.2-3B-Uncensored-GGUF/blob/main/model-Q3_K_M.gguf)           | Mid-range quantization for moderate precision and memory usage balance.         |
+| [**model-Q3_K_S.gguf**](https://huggingface.co/nidum/Nidum-Llama-3.2-3B-Uncensored-GGUF/blob/main/model-Q3_K_S.gguf)           | Smaller quantization steps, offering moderate precision with reduced memory use.|
+| [**model-Q4_0_4_4.gguf**](https://huggingface.co/nidum/Nidum-Llama-3.2-3B-Uncensored-GGUF/blob/main/model-Q4_0_4_4.gguf)       | Performance-optimized for low memory, ideal for lightweight deployment.         |
+| [**model-Q4_0_4_8.gguf**](https://huggingface.co/nidum/Nidum-Llama-3.2-3B-Uncensored-GGUF/blob/main/model-Q4_0_4_8.gguf)       | Extended quantization balancing memory use and inference speed.                 |
+| [**model-Q4_0_8_8.gguf**](https://huggingface.co/nidum/Nidum-Llama-3.2-3B-Uncensored-GGUF/blob/main/model-Q4_0_8_8.gguf)       | Advanced memory precision targeting larger contexts.                            |
+| [**model-Q4_K_M.gguf**](https://huggingface.co/nidum/Nidum-Llama-3.2-3B-Uncensored-GGUF/blob/main/model-Q4_K_M.gguf)           | High-efficiency quantization for moderate GPU resources.                        |
+| [**model-Q4_K_S.gguf**](https://huggingface.co/nidum/Nidum-Llama-3.2-3B-Uncensored-GGUF/blob/main/model-Q4_K_S.gguf)           | Optimized for smaller-scale operations with compact memory footprint.           |
+| [**model-Q5_K_M.gguf**](https://huggingface.co/nidum/Nidum-Llama-3.2-3B-Uncensored-GGUF/blob/main/model-Q5_K_M.gguf)           | Balances performance and precision, ideal for robust inferencing environments.  |
+| [**model-Q5_K_S.gguf**](https://huggingface.co/nidum/Nidum-Llama-3.2-3B-Uncensored-GGUF/blob/main/model-Q5_K_S.gguf)           | Moderate quantization targeting performance with minimal resource usage.        |
+| [**model-Q6_K.gguf**](https://huggingface.co/nidum/Nidum-Llama-3.2-3B-Uncensored-GGUF/blob/main/model-Q6_K.gguf)               | High-precision quantization for accurate and stable inferencing tasks.          |
+| [**model-TQ1_0.gguf**](https://huggingface.co/nidum/Nidum-Llama-3.2-3B-Uncensored-GGUF/blob/main/model-TQ1_0.gguf)             | Experimental quantization for targeted applications in test environments.       |
+| [**model-TQ2_0.gguf**](https://huggingface.co/nidum/Nidum-Llama-3.2-3B-Uncensored-GGUF/blob/main/model-TQ2_0.gguf)             | High-performance tuning for experimental use cases and flexible precision.      |
+
+---
+### Datasets and Fine-Tuning
+
+The following fine-tuning datasets are leveraged to enhance specific model capabilities:
+
+- **Uncensored Data**: Enables unrestricted and uninhibited responses.
+- **RAG-Based Fine-Tuning**: Optimizes retrieval-augmented generation for knowledge-intensive tasks.
+- **Long Context Fine-Tuning**: Enhances the model's ability to process and maintain coherence in extended conversations.
+- **Math-Instruct Data**: Specially curated for precise and contextually accurate mathematical reasoning.
+
+---
+
+### Benchmarks  
+
+After fine-tuning with **uncensored data**, **Nidum-Llama-3.2-3B** demonstrates **superior performance compared to the original LLaMA model**, particularly in accuracy and handling diverse, unrestricted scenarios.
+
+#### Benchmark Summary Table
+
+| **Benchmark**    | **Metric**                       | **LLaMA 3.2 3B** | **Nidum 3.2 3B** | **Observation**                                                                                     |
+|-------------------|-----------------------------------|--------------|--------------|-----------------------------------------------------------------------------------------------------|
+| **GPQA**         | Exact Match (Flexible)           | 0.3          | 0.5          | Nidum 3B demonstrates significant improvement, particularly in **generative tasks**.                |
+|                  | Accuracy                         | 0.4          | 0.5          | Consistent improvement, especially in **zero-shot** scenarios.                                      |
+| **HellaSwag**    | Accuracy                         | 0.3          | 0.4          | Better performance in **common sense reasoning** tasks.                                             |
+|                  | Normalized Accuracy              | 0.3          | 0.4          | Enhanced ability to understand and predict context in sentence completion.                          |
+|                  | Normalized Accuracy (Stderr)     | 0.15275      | 0.1633       | Slightly improved consistency in normalized accuracy.                                               |
+|                  | Accuracy (Stderr)                | 0.15275      | 0.1633       | Shows robustness in reasoning accuracy compared to LLaMA 3B.                                        |
+
+---
+
+### Insights:
+1. **GPQA Results**: Fine-tuning on uncensored data has boosted **Nidum 3B's Exact Match and Accuracy**, particularly excelling in **generative** and **zero-shot** tasks involving domain-specific knowledge.
+2. **HellaSwag Results**: **Nidum 3B** consistently outperforms **LLaMA 3B** in **common sense reasoning benchmarks**, indicating enhanced contextual and semantic understanding.
+
+---
+
+### Contributing
+
+We welcome contributions to improve and extend the model’s capabilities. Stay tuned for updates on how to contribute.
+
+---
+
+### Contact
+
+For inquiries, collaborations, or further information, please reach out to us at **info@nidum.ai**.
+
+---
+
+### Explore the Possibilities
+
+Dive into unrestricted creativity and innovation with **Nidum Llama 3.2 3B Uncensored**!
--- a/model-Q2_K.gguf
+++ b/model-Q2_K.gguf
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:0eafb04b1efc61aeb5828da51e3f84529878df702adcebe604670d1150490d5b
+size 1363935712
--- a/model-Q3_K_L.gguf
+++ b/model-Q3_K_L.gguf
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:b5d496388a257ebef8d0515da29c571d11afc6d053df8395828eaac10f46154f
+size 1815347680
--- a/model-Q3_K_M.gguf
+++ b/model-Q3_K_M.gguf
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:01004ee1572e35af05173508bb4c1b3b693d65eda2fcf408cf0cb064b8f9a897
+size 1687159264
--- a/model-Q3_K_S.gguf
+++ b/model-Q3_K_S.gguf
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:76570e9965eb3affc896a375aa81df4e3993348f9b94b0edfaaaf5aec5626b47
+size 1542848992
--- a/model-Q4_0_4_4.gguf
+++ b/model-Q4_0_4_4.gguf
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:7b0e2a8cd2fa282ed7083d6493b519bc4abf4624c26843b8bb588cc809b02ac3
+size 1917190624
--- a/model-Q4_0_4_8.gguf
+++ b/model-Q4_0_4_8.gguf
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:90cfd651466305fe62d4e04de71fcade9d66a54faab3805f89e7815a6dba8eee
+size 1917190624
--- a/model-Q4_0_8_8.gguf
+++ b/model-Q4_0_8_8.gguf
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:4b6d63532e6da1a64056e9a12b798bd78befc1540a178a8e1ff9be0b3afac950
+size 1917190624
--- a/model-Q4_K_M.gguf
+++ b/model-Q4_K_M.gguf
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:f71ffc4383890374d140a69cd8d6da6179ab4cc00cc9609deadcb3c8766bb387
+size 2019377632
--- a/model-Q4_K_S.gguf
+++ b/model-Q4_K_S.gguf
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:dcde8577e5963a0a10f7b3de5fa299e2d24bc03bd2bb22b2c953440b0c23a9df
+size 1928200672
--- a/model-Q5_K_M.gguf
+++ b/model-Q5_K_M.gguf
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:7c91827f6e1285b60b901f38a6100ac977af1570c9389b547f4470c3f7e7bc3d
+size 2322153952
--- a/model-Q5_K_S.gguf
+++ b/model-Q5_K_S.gguf
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:a132d8b1ca9243358ee0277d1b6b03be71e87196e402f553828f2b6db4b3127e
+size 2269512160
--- a/model-Q6_K.gguf
+++ b/model-Q6_K.gguf
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:b8a147548a4ccdc51b958a7fc09f311283893621845ccb1f15db413ed06cd9ae
+size 2643853792
--- a/model-TQ1_0.gguf
+++ b/model-TQ1_0.gguf
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:b494f733a564dae0020a78178c33bbfa8bf5d23627a618bf60d574c68df2c2ec
+size 926286304
--- a/model-TQ2_0.gguf
+++ b/model-TQ2_0.gguf
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:153693c8664c367ce62a3d2247d4308ff1994cfda6e4354b476b5b3c7cb1577f
+size 1058406880