Go to file

ModelHub XC 84bf965720 初始化项目，由ModelHub XC社区提供模型

Model: damerajee/Gaja-v2.00
Source: Original Platform

2026-05-26 20:25:05 +08:00

.gitattributes

初始化项目，由ModelHub XC社区提供模型

2026-05-26 20:25:05 +08:00

config.json

初始化项目，由ModelHub XC社区提供模型

2026-05-26 20:25:05 +08:00

generation_config.json

初始化项目，由ModelHub XC社区提供模型

2026-05-26 20:25:05 +08:00

model-00001-of-00004.safetensors

初始化项目，由ModelHub XC社区提供模型

2026-05-26 20:25:05 +08:00

model-00002-of-00004.safetensors

初始化项目，由ModelHub XC社区提供模型

2026-05-26 20:25:05 +08:00

model-00003-of-00004.safetensors

初始化项目，由ModelHub XC社区提供模型

2026-05-26 20:25:05 +08:00

model-00004-of-00004.safetensors

初始化项目，由ModelHub XC社区提供模型

2026-05-26 20:25:05 +08:00

model.safetensors.index.json

初始化项目，由ModelHub XC社区提供模型

2026-05-26 20:25:05 +08:00

README.md

初始化项目，由ModelHub XC社区提供模型

2026-05-26 20:25:05 +08:00

special_tokens_map.json

初始化项目，由ModelHub XC社区提供模型

2026-05-26 20:25:05 +08:00

tokenizer_config.json

初始化项目，由ModelHub XC社区提供模型

2026-05-26 20:25:05 +08:00

tokenizer.json

初始化项目，由ModelHub XC社区提供模型

2026-05-26 20:25:05 +08:00

README.md

language, license, library_name, tags, datasets, pipeline_tag, model-index

language

license

library_name

tags

datasets

pipeline_tag

model-index

llama2

transformers

hindi

english

Bilingual

sarvamai/samvaad-hi-v1

text-generation

name

results

Gaja-v2.00

task

dataset

metrics

source

type	name
text-generation	Text Generation

name

type

config

split

args

AI2 Reasoning Challenge (25-Shot)

ai2_arc

ARC-Challenge

test

num_few_shot
25

type	value	name
acc_norm	51.79	normalized accuracy

url	name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=damerajee/Gaja-v2.00	Open LLM Leaderboard

task

dataset

metrics

source

type	name
text-generation	Text Generation

name

type

split

args

HellaSwag (10-Shot)

hellaswag

validation

num_few_shot
10

type	value	name
acc_norm	75.79	normalized accuracy

url	name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=damerajee/Gaja-v2.00	Open LLM Leaderboard

task

dataset

metrics

source

type	name
text-generation	Text Generation

name

type

config

split

args

MMLU (5-Shot)

cais/mmlu

all

test

num_few_shot
5

type	value	name
acc	40.69	accuracy

url	name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=damerajee/Gaja-v2.00	Open LLM Leaderboard

task

dataset

metrics

source

type	name
text-generation	Text Generation

name

type

config

split

args

TruthfulQA (0-shot)

truthful_qa

multiple_choice

validation

num_few_shot
0

type	value
mc2	41.5

url	name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=damerajee/Gaja-v2.00	Open LLM Leaderboard

task

dataset

metrics

source

type	name
text-generation	Text Generation

name

type

config

split

args

Winogrande (5-shot)

winogrande

winogrande_xl

validation

num_few_shot
5

type	value	name
acc	71.9	accuracy

url	name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=damerajee/Gaja-v2.00	Open LLM Leaderboard

task

dataset

metrics

source

type	name
text-generation	Text Generation

name

type

config

split

args

GSM8k (5-shot)

gsm8k

main

test

num_few_shot
5

type	value	name
acc	0.23	accuracy

url	name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=damerajee/Gaja-v2.00	Open LLM Leaderboard

Model

🐘 Gaja

Gaja is a Hindi/Hinglish chat model, initially trained on SarvamAI's OpenHathi model and further fine-tuned for conversational interactions.

Additional Information

It outperforms Airavata, AI4Bharat's chat version, on Huggingface OpenLLM benchmark suite.
It was fine-tuned on only 5k samples

Inference

hey guys thanks to Bhabha AI, you guys can finally try my model

Additional Information

The code for this can be found in The github code - Github

💬 Prompt template

<|im_start|>user
{}<|im_end|> 
<|im_start|>assistant
{}<|im_end|>

😎 Features:

Language Support: Gaja is designed to understand and generate responses in both Hindi and Hinglish, catering to a diverse range of users.
Base Model: Built upon SarvamAI's OpenHathi model, Gaja inherits its foundational capabilities while being optimized for conversational tasks.
Fine-tuning: Gaja has undergone fine-tuning specifically for chat-based interactions, enhancing its ability to engage in meaningful conversations with users.
Experimental Platform: With its flexibility and adaptability, Gaja serves as a valuable platform for conducting experiments and exploring innovative approaches to chatbot development.

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric	Value
Avg.	46.98
AI2 Reasoning Challenge (25-Shot)	51.79
HellaSwag (10-Shot)	75.79
MMLU (5-Shot)	40.69
TruthfulQA (0-shot)	41.50
Winogrande (5-shot)	71.90
GSM8k (5-shot)	0.23