初始化项目,由ModelHub XC社区提供模型
Model: tripathyShaswata/sarvam-1-v0.5-GGUF Source: Original Platform
This commit is contained in:
37
.gitattributes
vendored
Normal file
37
.gitattributes
vendored
Normal file
@@ -0,0 +1,37 @@
|
||||
*.7z filter=lfs diff=lfs merge=lfs -text
|
||||
*.arrow filter=lfs diff=lfs merge=lfs -text
|
||||
*.bin filter=lfs diff=lfs merge=lfs -text
|
||||
*.bz2 filter=lfs diff=lfs merge=lfs -text
|
||||
*.ckpt filter=lfs diff=lfs merge=lfs -text
|
||||
*.ftz filter=lfs diff=lfs merge=lfs -text
|
||||
*.gz filter=lfs diff=lfs merge=lfs -text
|
||||
*.h5 filter=lfs diff=lfs merge=lfs -text
|
||||
*.joblib filter=lfs diff=lfs merge=lfs -text
|
||||
*.lfs.* filter=lfs diff=lfs merge=lfs -text
|
||||
*.mlmodel filter=lfs diff=lfs merge=lfs -text
|
||||
*.model filter=lfs diff=lfs merge=lfs -text
|
||||
*.msgpack filter=lfs diff=lfs merge=lfs -text
|
||||
*.npy filter=lfs diff=lfs merge=lfs -text
|
||||
*.npz filter=lfs diff=lfs merge=lfs -text
|
||||
*.onnx filter=lfs diff=lfs merge=lfs -text
|
||||
*.ot filter=lfs diff=lfs merge=lfs -text
|
||||
*.parquet filter=lfs diff=lfs merge=lfs -text
|
||||
*.pb filter=lfs diff=lfs merge=lfs -text
|
||||
*.pickle filter=lfs diff=lfs merge=lfs -text
|
||||
*.pkl filter=lfs diff=lfs merge=lfs -text
|
||||
*.pt filter=lfs diff=lfs merge=lfs -text
|
||||
*.pth filter=lfs diff=lfs merge=lfs -text
|
||||
*.rar filter=lfs diff=lfs merge=lfs -text
|
||||
*.safetensors filter=lfs diff=lfs merge=lfs -text
|
||||
saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
||||
*.tar.* filter=lfs diff=lfs merge=lfs -text
|
||||
*.tar filter=lfs diff=lfs merge=lfs -text
|
||||
*.tflite filter=lfs diff=lfs merge=lfs -text
|
||||
*.tgz filter=lfs diff=lfs merge=lfs -text
|
||||
*.wasm filter=lfs diff=lfs merge=lfs -text
|
||||
*.xz filter=lfs diff=lfs merge=lfs -text
|
||||
*.zip filter=lfs diff=lfs merge=lfs -text
|
||||
*.zst filter=lfs diff=lfs merge=lfs -text
|
||||
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
||||
sarvam-1-v0.5-Q8_0.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
sarvam-1-v0.5-f16.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
82
README.md
Normal file
82
README.md
Normal file
@@ -0,0 +1,82 @@
|
||||
---
|
||||
license: apache-2.0
|
||||
base_model: sarvamai/sarvam-1-v0.5
|
||||
tags:
|
||||
- gguf
|
||||
- llama-cpp
|
||||
- quantized
|
||||
- ollama
|
||||
- indic
|
||||
- indian-languages
|
||||
- hindi
|
||||
- multilingual
|
||||
- sarvam
|
||||
pipeline_tag: text-generation
|
||||
language:
|
||||
- en
|
||||
- hi
|
||||
- bn
|
||||
- ta
|
||||
- te
|
||||
- kn
|
||||
- ml
|
||||
- mr
|
||||
- pa
|
||||
- gu
|
||||
- or
|
||||
---
|
||||
|
||||
# Sarvam-1-v0.5 GGUF
|
||||
|
||||
GGUF quantized versions of [sarvamai/sarvam-1-v0.5](https://huggingface.co/sarvamai/sarvam-1-v0.5) for local inference with **llama.cpp**, **Ollama**, **LM Studio**, and **GPT4All**.
|
||||
|
||||
Sarvam-1 is an Indian multilingual LLM built by [Sarvam AI](https://www.sarvam.ai/) — supporting **22 Indian languages** including Hindi, Bengali, Tamil, Telugu, Kannada, Malayalam, Marathi, Punjabi, Gujarati, and Odia. Based on Llama architecture with 3.1B parameters.
|
||||
|
||||
## Available Quantizations
|
||||
|
||||
| File | Quant | Size | RAM Needed | Use Case |
|
||||
|------|-------|------|------------|----------|
|
||||
| `sarvam-1-v0.5-Q8_0.gguf` | Q8_0 | 2.5 GB | ~4 GB | Best quality, near-lossless |
|
||||
| `sarvam-1-v0.5-f16.gguf` | F16 | 4.7 GB | ~6 GB | Full precision, maximum quality |
|
||||
|
||||
## How to Use
|
||||
|
||||
### With llama.cpp
|
||||
|
||||
```bash
|
||||
./llama-cli -m sarvam-1-v0.5-Q8_0.gguf -p "भारत की राजधानी क्या है?" -n 256
|
||||
```
|
||||
|
||||
### With Ollama
|
||||
|
||||
```bash
|
||||
# Create a Modelfile
|
||||
echo 'FROM ./sarvam-1-v0.5-Q8_0.gguf' > Modelfile
|
||||
ollama create sarvam -f Modelfile
|
||||
ollama run sarvam
|
||||
```
|
||||
|
||||
### With LM Studio
|
||||
|
||||
1. Download the Q8_0 file
|
||||
2. Open LM Studio → Load Model → Select the file
|
||||
3. Start chatting in English or any supported Indian language
|
||||
|
||||
## Model Details
|
||||
|
||||
- **Architecture:** Llama
|
||||
- **Parameters:** 3.1B
|
||||
- **Hidden Size:** 2048
|
||||
- **Layers:** 28
|
||||
- **Attention Heads:** 16
|
||||
- **Context Length:** Check original model card
|
||||
- **Languages:** English + 22 Indian languages (Hindi, Bengali, Tamil, Telugu, Kannada, Malayalam, Marathi, Punjabi, Gujarati, Odia, and more)
|
||||
- **License:** Apache 2.0
|
||||
|
||||
## Original Model
|
||||
|
||||
Built by [Sarvam AI](https://www.sarvam.ai/) — India's leading AI research company. See the original model at [sarvamai/sarvam-1-v0.5](https://huggingface.co/sarvamai/sarvam-1-v0.5).
|
||||
|
||||
## Quantized by
|
||||
|
||||
[Shaswata Tripathy](https://shaswatatripathy.com) | [GitHub](https://github.com/ShaswataTripathy) | [Medium](https://medium.com/@tripathyshaswata) | [LinkedIn](https://linkedin.com/in/shaswata-tripathy) | [Hugging Face](https://huggingface.co/tripathyShaswata)
|
||||
3
sarvam-1-v0.5-Q8_0.gguf
Normal file
3
sarvam-1-v0.5-Q8_0.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:f10e69074500afec2b6dd270275f7c983bf65ae76a06d5a84cdacafe1e7488bc
|
||||
size 2667903296
|
||||
3
sarvam-1-v0.5-f16.gguf
Normal file
3
sarvam-1-v0.5-f16.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:1512a51e93ab93a86f1d3d3491eba5ac8e5b1ecf526ff2a8429c6cb10f51dedd
|
||||
size 5019826496
|
||||
Reference in New Issue
Block a user