commit 95bcd7f783b7cbd0725ceb5bd1fde1b2ae182f7b Author: ModelHub XC Date: Wed May 6 10:54:45 2026 +0800 初始化项目,由ModelHub XC社区提供模型 Model: tripathyShaswata/sarvam-1-v0.5-GGUF Source: Original Platform diff --git a/.gitattributes b/.gitattributes new file mode 100644 index 0000000..208f855 --- /dev/null +++ b/.gitattributes @@ -0,0 +1,37 @@ +*.7z filter=lfs diff=lfs merge=lfs -text +*.arrow filter=lfs diff=lfs merge=lfs -text +*.bin filter=lfs diff=lfs merge=lfs -text +*.bz2 filter=lfs diff=lfs merge=lfs -text +*.ckpt filter=lfs diff=lfs merge=lfs -text +*.ftz filter=lfs diff=lfs merge=lfs -text +*.gz filter=lfs diff=lfs merge=lfs -text +*.h5 filter=lfs diff=lfs merge=lfs -text +*.joblib filter=lfs diff=lfs merge=lfs -text +*.lfs.* filter=lfs diff=lfs merge=lfs -text +*.mlmodel filter=lfs diff=lfs merge=lfs -text +*.model filter=lfs diff=lfs merge=lfs -text +*.msgpack filter=lfs diff=lfs merge=lfs -text +*.npy filter=lfs diff=lfs merge=lfs -text +*.npz filter=lfs diff=lfs merge=lfs -text +*.onnx filter=lfs diff=lfs merge=lfs -text +*.ot filter=lfs diff=lfs merge=lfs -text +*.parquet filter=lfs diff=lfs merge=lfs -text +*.pb filter=lfs diff=lfs merge=lfs -text +*.pickle filter=lfs diff=lfs merge=lfs -text +*.pkl filter=lfs diff=lfs merge=lfs -text +*.pt filter=lfs diff=lfs merge=lfs -text +*.pth filter=lfs diff=lfs merge=lfs -text +*.rar filter=lfs diff=lfs merge=lfs -text +*.safetensors filter=lfs diff=lfs merge=lfs -text +saved_model/**/* filter=lfs diff=lfs merge=lfs -text +*.tar.* filter=lfs diff=lfs merge=lfs -text +*.tar filter=lfs diff=lfs merge=lfs -text +*.tflite filter=lfs diff=lfs merge=lfs -text +*.tgz filter=lfs diff=lfs merge=lfs -text +*.wasm filter=lfs diff=lfs merge=lfs -text +*.xz filter=lfs diff=lfs merge=lfs -text +*.zip filter=lfs diff=lfs merge=lfs -text +*.zst filter=lfs diff=lfs merge=lfs -text +*tfevents* filter=lfs diff=lfs merge=lfs -text +sarvam-1-v0.5-Q8_0.gguf filter=lfs diff=lfs merge=lfs -text +sarvam-1-v0.5-f16.gguf filter=lfs diff=lfs merge=lfs -text diff --git a/README.md b/README.md new file mode 100644 index 0000000..1ebc6bf --- /dev/null +++ b/README.md @@ -0,0 +1,82 @@ +--- +license: apache-2.0 +base_model: sarvamai/sarvam-1-v0.5 +tags: + - gguf + - llama-cpp + - quantized + - ollama + - indic + - indian-languages + - hindi + - multilingual + - sarvam +pipeline_tag: text-generation +language: + - en + - hi + - bn + - ta + - te + - kn + - ml + - mr + - pa + - gu + - or +--- + +# Sarvam-1-v0.5 GGUF + +GGUF quantized versions of [sarvamai/sarvam-1-v0.5](https://huggingface.co/sarvamai/sarvam-1-v0.5) for local inference with **llama.cpp**, **Ollama**, **LM Studio**, and **GPT4All**. + +Sarvam-1 is an Indian multilingual LLM built by [Sarvam AI](https://www.sarvam.ai/) — supporting **22 Indian languages** including Hindi, Bengali, Tamil, Telugu, Kannada, Malayalam, Marathi, Punjabi, Gujarati, and Odia. Based on Llama architecture with 3.1B parameters. + +## Available Quantizations + +| File | Quant | Size | RAM Needed | Use Case | +|------|-------|------|------------|----------| +| `sarvam-1-v0.5-Q8_0.gguf` | Q8_0 | 2.5 GB | ~4 GB | Best quality, near-lossless | +| `sarvam-1-v0.5-f16.gguf` | F16 | 4.7 GB | ~6 GB | Full precision, maximum quality | + +## How to Use + +### With llama.cpp + +```bash +./llama-cli -m sarvam-1-v0.5-Q8_0.gguf -p "भारत की राजधानी क्या है?" -n 256 +``` + +### With Ollama + +```bash +# Create a Modelfile +echo 'FROM ./sarvam-1-v0.5-Q8_0.gguf' > Modelfile +ollama create sarvam -f Modelfile +ollama run sarvam +``` + +### With LM Studio + +1. Download the Q8_0 file +2. Open LM Studio → Load Model → Select the file +3. Start chatting in English or any supported Indian language + +## Model Details + +- **Architecture:** Llama +- **Parameters:** 3.1B +- **Hidden Size:** 2048 +- **Layers:** 28 +- **Attention Heads:** 16 +- **Context Length:** Check original model card +- **Languages:** English + 22 Indian languages (Hindi, Bengali, Tamil, Telugu, Kannada, Malayalam, Marathi, Punjabi, Gujarati, Odia, and more) +- **License:** Apache 2.0 + +## Original Model + +Built by [Sarvam AI](https://www.sarvam.ai/) — India's leading AI research company. See the original model at [sarvamai/sarvam-1-v0.5](https://huggingface.co/sarvamai/sarvam-1-v0.5). + +## Quantized by + +[Shaswata Tripathy](https://shaswatatripathy.com) | [GitHub](https://github.com/ShaswataTripathy) | [Medium](https://medium.com/@tripathyshaswata) | [LinkedIn](https://linkedin.com/in/shaswata-tripathy) | [Hugging Face](https://huggingface.co/tripathyShaswata) diff --git a/sarvam-1-v0.5-Q8_0.gguf b/sarvam-1-v0.5-Q8_0.gguf new file mode 100644 index 0000000..df2b797 --- /dev/null +++ b/sarvam-1-v0.5-Q8_0.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:f10e69074500afec2b6dd270275f7c983bf65ae76a06d5a84cdacafe1e7488bc +size 2667903296 diff --git a/sarvam-1-v0.5-f16.gguf b/sarvam-1-v0.5-f16.gguf new file mode 100644 index 0000000..128d37c --- /dev/null +++ b/sarvam-1-v0.5-f16.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:1512a51e93ab93a86f1d3d3491eba5ac8e5b1ecf526ff2a8429c6cb10f51dedd +size 5019826496