Go to file

ModelHub XC ad3389eca6 初始化项目，由ModelHub XC社区提供模型

Model: sh4lu-z/Sinhala-Qwen3-v7500
Source: Original Platform

2026-05-19 11:46:37 +08:00

.gitattributes

初始化项目，由ModelHub XC社区提供模型

2026-05-19 11:46:37 +08:00

added_tokens.json

初始化项目，由ModelHub XC社区提供模型

2026-05-19 11:46:37 +08:00

chat_template.jinja

初始化项目，由ModelHub XC社区提供模型

2026-05-19 11:46:37 +08:00

config.json

初始化项目，由ModelHub XC社区提供模型

2026-05-19 11:46:37 +08:00

merges.txt

初始化项目，由ModelHub XC社区提供模型

2026-05-19 11:46:37 +08:00

model.safetensors

初始化项目，由ModelHub XC社区提供模型

2026-05-19 11:46:37 +08:00

README.md

初始化项目，由ModelHub XC社区提供模型

2026-05-19 11:46:37 +08:00

special_tokens_map.json

初始化项目，由ModelHub XC社区提供模型

2026-05-19 11:46:37 +08:00

tokenizer_config.json

初始化项目，由ModelHub XC社区提供模型

2026-05-19 11:46:37 +08:00

tokenizer.json

初始化项目，由ModelHub XC社区提供模型

2026-05-19 11:46:37 +08:00

vocab.json

初始化项目，由ModelHub XC社区提供模型

2026-05-19 11:46:37 +08:00

README.md

base_model, language, pipeline_tag, library_name, license, datasets, tags

base_model

language

pipeline_tag

library_name

license

datasets

Sinhala Qwen 3 - Fine-Tuned Model (v7500)

Model Description

This is a fine-tuned version of the Qwen 3 base model, specifically trained to understand and generate the Sinhala language.

Originally, the base model had absolutely no understanding of Sinhala. Through custom fine-tuning using the Unsloth library, this model has been taught the foundational elements of the language from scratch. It is now capable of basic Sinhala comprehension and text generation, marking a significant step in low-resource language adaptation for this architecture.

Model Details

Base Model: Qwen 3 (Qwen3ForCausalLM)
Language(s): Sinhala (si), English (en)
Fine-Tuning Library: Unsloth (unsloth_version: 2026.1.4)
Checkpoint/Version: v7500 (Trained up to 7500 steps)
Status: Experimental / Early Stage

Intended Use

This model is intended for researchers and developers working on Natural Language Processing (NLP) for the Sinhala language. It can be used as a starting point for further fine-tuning, vocabulary expansion, or basic Sinhala text generation tasks.

Note: Since the model was trained from scratch to learn a completely new language, it might still make grammatical errors or occasionally struggle with complex sentence structures. It represents a foundation that is continuously being improved.

How to Use

You can use this model directly with the transformers library in Python:

from transformers import AutoModelForCausalLM, AutoTokenizer


model_id = "sh4lu-z/Sinhala-Qwen3-v7500" 

tokenizer = AutoTokenizer.from_pretrained(model_id)
model = AutoModelForCausalLM.from_pretrained(model_id, device_map="auto")

prompt = "ඔබට කොහොමද?"
inputs = tokenizer(prompt, return_tensors="pt").to("cuda")
outputs = model.generate(**inputs, max_new_tokens=50)

print(tokenizer.decode(outputs[0], skip_special_tokens=True))

Training Procedure

This model was trained using the Unsloth library to significantly speed up the fine-tuning process while maintaining accuracy.

README.md

Sinhala Qwen 3 - Fine-Tuned Model (v7500)

Model Description

Model Details

Intended Use

How to Use

Training Procedure

`Framework:` PyTorch / Transformers

`Hardware:` Fine-tuned on GPU instances

README.md

Sinhala Qwen 3 - Fine-Tuned Model (v7500)

Model Description

Model Details

Intended Use

How to Use

Training Procedure

Framework: PyTorch / Transformers

Hardware: Fine-tuned on GPU instances

`Framework:` PyTorch / Transformers

`Hardware:` Fine-tuned on GPU instances