Go to file

ModelHub XC 8b35c21c45 初始化项目，由ModelHub XC社区提供模型

Model: fierysurf/Ambari-7B-base-v0.1-sharded
Source: Original Platform

2026-05-02 06:37:07 +08:00

.gitattributes

初始化项目，由ModelHub XC社区提供模型

2026-05-02 06:37:07 +08:00

config.json

初始化项目，由ModelHub XC社区提供模型

2026-05-02 06:37:07 +08:00

generation_config.json

初始化项目，由ModelHub XC社区提供模型

2026-05-02 06:37:07 +08:00

model-00001-of-00003.safetensors

初始化项目，由ModelHub XC社区提供模型

2026-05-02 06:37:07 +08:00

model-00002-of-00003.safetensors

初始化项目，由ModelHub XC社区提供模型

2026-05-02 06:37:07 +08:00

model-00003-of-00003.safetensors

初始化项目，由ModelHub XC社区提供模型

2026-05-02 06:37:07 +08:00

model.safetensors.index.json

初始化项目，由ModelHub XC社区提供模型

2026-05-02 06:37:07 +08:00

README.md

初始化项目，由ModelHub XC社区提供模型

2026-05-02 06:37:07 +08:00

special_tokens_map.json

初始化项目，由ModelHub XC社区提供模型

2026-05-02 06:37:07 +08:00

tokenizer_config.json

初始化项目，由ModelHub XC社区提供模型

2026-05-02 06:37:07 +08:00

tokenizer.json

初始化项目，由ModelHub XC社区提供模型

2026-05-02 06:37:07 +08:00

tokenizer.model

初始化项目，由ModelHub XC社区提供模型

2026-05-02 06:37:07 +08:00

README.md

license, language, metrics, pipeline_tag, tags

license

language

metrics

pipeline_tag

Ambari-7B-Base-v0.1 (sharded)

Overview

Ambari-7B-Base-v0.1 is the first bilingual English/Kannada model in the Ambari series, developed and released by Cognitivelab.in. Based on the Llama2 model by Meta, this 7B parameter model is the outcome of the pretraining stage, involving training on approximately 500 million new Kannada tokens.

Usage

To use the Ambari-7B-Base-v0.1 model, you can follow the example code below:

# Usage
import torch
from transformers import LlamaTokenizer, LlamaForCausalLM

model = LlamaForCausalLM.from_pretrained('Cognitive-Lab/Ambari-7B-Base-v0.1')
tokenizer = LlamaTokenizer.from_pretrained('Cognitive-Lab/Ambari-7B-Base-v0.1')

prompt = "ಕನ್ನಡದ ಇತಿಹಾಸವನ್ನು ವಿವರವಾಗಿ ತಿಳಿಸಿ"
inputs = tokenizer(prompt, return_tensors="pt")

# Generate
generate_ids = model.generate(inputs.input_ids, max_length=30)
decoded_output = tokenizer.batch_decode(generate_ids, skip_special_tokens=True, clean_up_tokenization_spaces=False)[0]

print(decoded_output)

Important: The provided model serves as a foundation and is not designed for independent use. We strongly advise conducting finetuning tailored to your particular task(s) of interest before deploying it in a production environment. Feel free to customize the code according to your specific use case, ensuring that the model undergoes finetuning for optimal performance in your desired application.