91 lines
2.9 KiB
Markdown
91 lines
2.9 KiB
Markdown
|
|
---
|
|||
|
|
base_model: unsloth/Qwen2.5-1.5B-Instruct
|
|||
|
|
tags:
|
|||
|
|
- text-generation-inference
|
|||
|
|
- transformers
|
|||
|
|
- unsloth
|
|||
|
|
- qwen2.5
|
|||
|
|
license: apache-2.0
|
|||
|
|
language:
|
|||
|
|
- en
|
|||
|
|
- hi
|
|||
|
|
---
|
|||
|
|
|
|||
|
|
As calling operations scale, it becomes clear that dialing and talking is not enough.
|
|||
|
|
Even with a strong voice AI + telephony architecture, the real value shows up only when post-call actions are captured and executed in a robust, dependable and consistent way. Closing the loop matters more than just connecting the call.
|
|||
|
|
|
|||
|
|
To support that, we’re releasing our Hindi + English transcript analytics model tuned specifically for call transcripts:
|
|||
|
|
|
|||
|
|
|
|||
|
|
You can plug it into your calling or voice AI stack to automatically extract:
|
|||
|
|
|
|||
|
|
• Enum-based classifications (e.g., call outcome, intent, disposition)
|
|||
|
|
• Conversation summaries
|
|||
|
|
• Action items / follow-ups
|
|||
|
|
|
|||
|
|
It’s built to handle real-world Hindi, English, and mixed Hinglish calls, including noisy transcripts.
|
|||
|
|
|
|||
|
|
Finetuning Parameters:
|
|||
|
|
```
|
|||
|
|
rank = 64
|
|||
|
|
lora_alpha = rank*2,
|
|||
|
|
target_modules = ["q_proj", "k_proj", "v_proj", "o_proj",
|
|||
|
|
"gate_proj", "up_proj", "down_proj",],
|
|||
|
|
SFTConfig(
|
|||
|
|
dataset_text_field = "prompt",
|
|||
|
|
per_device_train_batch_size = 32,
|
|||
|
|
gradient_accumulation_steps = 1, # Use GA to mimic batch size!
|
|||
|
|
warmup_steps = 5,
|
|||
|
|
num_train_epochs = 2,
|
|||
|
|
learning_rate = 2e-4,
|
|||
|
|
logging_steps = 50,
|
|||
|
|
optim = "adamw_8bit",
|
|||
|
|
weight_decay = 0.001,
|
|||
|
|
lr_scheduler_type = "linear",
|
|||
|
|
seed = SEED,
|
|||
|
|
report_to = "wandb",
|
|||
|
|
eval_strategy="steps",
|
|||
|
|
eval_steps=200,
|
|||
|
|
)
|
|||
|
|
The model was finetuned on ~100,000 curated transcripts across different domanins and language preferences
|
|||
|
|
```
|
|||
|
|

|
|||
|
|
|
|||
|
|
|
|||
|
|
Provide the below schema for best output:
|
|||
|
|
```
|
|||
|
|
response_schema = {
|
|||
|
|
"type": "object",
|
|||
|
|
"properties": {
|
|||
|
|
"key_points": {
|
|||
|
|
"type": "array",
|
|||
|
|
"items": {"type": "string"},
|
|||
|
|
"nullable": True,
|
|||
|
|
},
|
|||
|
|
"action_items": {
|
|||
|
|
"type": "array",
|
|||
|
|
"items": {"type": "string"},
|
|||
|
|
"nullable": True,
|
|||
|
|
},
|
|||
|
|
"summary": {"type": "string"},
|
|||
|
|
"classification": classification_schema,
|
|||
|
|
},
|
|||
|
|
"required": ["summary", "classification"],
|
|||
|
|
}
|
|||
|
|
```
|
|||
|
|
|
|||
|
|
- **Developed by:** RinggAI
|
|||
|
|
- **License:** apache-2.0
|
|||
|
|
- **Finetuned from model :** unsloth/Qwen2.5-1.5B-Instruct
|
|||
|
|
- Parameter decision where made using
|
|||
|
|
**Schulman, J., & Thinking Machines Lab. (2025).**
|
|||
|
|
*LoRA Without Regret.*
|
|||
|
|
Thinking Machines Lab: Connectionism.
|
|||
|
|
DOI: 10.64434/tml.20250929
|
|||
|
|
Link: https://thinkingmachines.ai/blog/lora/
|
|||
|
|
|
|||
|
|
|
|||
|
|
|
|||
|
|
[<img style="border-radius: 20px;" src="https://storage.googleapis.com/desivocal-prod/desi-vocal/logo.png" width="200"/>](https://ringg.ai)
|
|||
|
|
[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)
|