Go to file

ModelHub XC 6551b486bd 初始化项目，由ModelHub XC社区提供模型

Model: ayoubkirouane/whisper-small-ar
Source: Original Platform

2026-05-08 11:39:10 +08:00

.gitattributes

初始化项目，由ModelHub XC社区提供模型

2026-05-08 11:39:10 +08:00

added_tokens.json

初始化项目，由ModelHub XC社区提供模型

2026-05-08 11:39:10 +08:00

config.json

初始化项目，由ModelHub XC社区提供模型

2026-05-08 11:39:10 +08:00

generation_config.json

初始化项目，由ModelHub XC社区提供模型

2026-05-08 11:39:10 +08:00

merges.txt

初始化项目，由ModelHub XC社区提供模型

2026-05-08 11:39:10 +08:00

normalizer.json

初始化项目，由ModelHub XC社区提供模型

2026-05-08 11:39:10 +08:00

preprocessor_config.json

初始化项目，由ModelHub XC社区提供模型

2026-05-08 11:39:10 +08:00

pytorch_model.bin

初始化项目，由ModelHub XC社区提供模型

2026-05-08 11:39:10 +08:00

README.md

初始化项目，由ModelHub XC社区提供模型

2026-05-08 11:39:10 +08:00

special_tokens_map.json

初始化项目，由ModelHub XC社区提供模型

2026-05-08 11:39:10 +08:00

tokenizer_config.json

初始化项目，由ModelHub XC社区提供模型

2026-05-08 11:39:10 +08:00

training_args.bin

初始化项目，由ModelHub XC社区提供模型

2026-05-08 11:39:10 +08:00

vocab.json

初始化项目，由ModelHub XC社区提供模型

2026-05-08 11:39:10 +08:00

README.md

language, license, base_model, datasets, model-index, library_name, pipeline_tag

language

license

base_model

datasets

model-index

library_name

pipeline_tag

apache-2.0

openai/whisper-small

mozilla-foundation/common_voice_11_0

name

results

Whisper-small-ar

transformers

automatic-speech-recognition

Arabic-Whisper Small

Description

Whisper-small-ar is an Automatic Speech Recognition (ASR) model fine-tuned specifically for the Arabic language using the Whisper model architecture. ASR models are designed to convert spoken language into written text. This model has been fine-tuned on the Mozilla Common Voice dataset (version 11.0) to transcribe spoken Arabic speech into textual form.

Key Features

Arabic Language Support: Whisper-small-ar is optimized for recognizing and transcribing the Arabic language accurately. It can handle various Arabic dialects and accents.
Transformer Architecture: The model is built on a powerful Transformer-based encoder-decoder architecture, which has demonstrated state-of-the-art performance in various natural language processing tasks, including ASR.
Fine-tuned for Arabic ASR: The model has undergone a fine-tuning process on a substantial amount of Arabic speech data, making it well-suited for a wide range of ASR applications in Arabic, such as transcription of podcasts, call center recordings, and more.
Open-Source: Whisper-small-ar is open-source and available for use by the research and developer community, facilitating the advancement of ASR technology for the Arabic language.
Compatible with Hugging Face Transformers: You can easily integrate and utilize this model in your ASR projects using the Hugging Face Transformers library.

Use Cases

Whisper-small-ar can be employed in a variety of ASR use cases, including:

Transcription Services: Convert spoken Arabic content, such as audio recordings, podcasts, or videos, into written text for indexing, search, or translation purposes.
Voice Assistants: Enhance voice-activated systems and virtual assistants with accurate Arabic speech recognition capabilities.
Language Processing Applications: Integrate the model into applications involving Arabic language processing, such as sentiment analysis, keyword extraction, and more.
Multilingual ASR: Combine Whisper-small-ar with other multilingual ASR models for applications requiring recognition of multiple languages.

Usage

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("automatic-speech-recognition", model="ayoubkirouane/whisper-small-ar")

def transcribe(audio):
    text = pipe(audio)["text"]
    return text