This model is a fine-tuned version of openai/whisper-large-v3-turbo on the JacobLinCool/common_voice_19_0_zh-TW dataset.
It achieves the following results on the evaluation set:
Loss: 0.1786
Wer: 32.5554
Cer: 8.6009
Decode Runtime: 90.9833
Wer Runtime: 0.1257
Cer Runtime: 0.1534
Model description
This is an open-source Traditional Chinese (Taiwan) automatic speech recognition (ASR) model.
Intended uses & limitations
This model is designed to be a prompt-free ASR model for Traditional Chinese. Due to its inherited language identification (LID) system from Whisper, which supports other Chinese language variants under the same language token (zh), we expect that performance may degrade when transcribing Simplified Chinese.
The model is free to use under the MIT license.
Training and evaluation data
This model was trained on the Common Voice Corpus 19.0 Chinese (Taiwan) Subset, containing about 50k training examples (44 hours) and 5k test examples (5 hours). This dataset is four times larger than the combination of training and validation set (train+validation) of mozilla-foundation/common_voice_16_1, which includes about 12k examples.