29 lines
960 B
Markdown
29 lines
960 B
Markdown
|
|
---
|
||
|
|
license: mit
|
||
|
|
datasets:
|
||
|
|
- openslr/openslr
|
||
|
|
- seanghay/km-speech-corpus
|
||
|
|
- ylacombe/english_dialects
|
||
|
|
- google/fleurs
|
||
|
|
language:
|
||
|
|
- km
|
||
|
|
- en
|
||
|
|
metrics:
|
||
|
|
- wer
|
||
|
|
base_model:
|
||
|
|
- openai/whisper-base
|
||
|
|
new_version: Vira21/Whisper-Base-KhmerV2
|
||
|
|
pipeline_tag: automatic-speech-recognition
|
||
|
|
library_name: transformers
|
||
|
|
---
|
||
|
|
|
||
|
|
# Whisper-Base-KhmerV2
|
||
|
|
|
||
|
|
|
||
|
|
This model is a fine-tuned variant of [openai/whisper-base](https://huggingface.co/openai/whisper-base), specifically adapted to enhance performance on diverse datasets. Designed to deliver improved transcription accuracy across multiple languages, including Khmer, it is fine-tuned with a focus on understanding the nuances of non-English languages and dialects.
|
||
|
|
|
||
|
|
Explore its capabilities in real-time transcription and multilingual support in the demo space: [Whisper-Base-KhmerV2 Demo](https://huggingface.co/spaces/Vira21/Whisper-Base-KhmerV2).
|
||
|
|
|
||
|
|
- **Metrics**:
|
||
|
|
- **WER (Word Error Rate)**: 0.4529
|
||
|
|
- **Training Loss**: 0.1012
|