53 lines
1.6 KiB
Markdown
53 lines
1.6 KiB
Markdown
# Introduction
|
|
|
|
This repository supports running the following functions **locally**
|
|
|
|
- Speech-to-text (i.e., ASR); both streaming and non-streaming are supported
|
|
- Text-to-speech (i.e., TTS)
|
|
- Speaker identification
|
|
- Speaker verification
|
|
- Spoken language identification
|
|
- Audio tagging
|
|
- VAD (e.g., [silero-vad](https://github.com/snakers4/silero-vad))
|
|
|
|
on the following platforms and operating systems:
|
|
|
|
- x86, ``x86_64``, 32-bit ARM, 64-bit ARM (arm64, aarch64), RISC-V (riscv64)
|
|
- Linux, macOS, Windows, openKylin
|
|
- Android, WearOS
|
|
- iOS
|
|
- NodeJS
|
|
- WebAssembly
|
|
- [Raspberry Pi](https://www.raspberrypi.com/)
|
|
- [RV1126](https://www.rock-chips.com/uploads/pdf/2022.8.26/191/RV1126%20Brief%20Datasheet.pdf)
|
|
- [LicheePi4A](https://sipeed.com/licheepi4a)
|
|
- [VisionFive 2](https://www.starfivetech.com/en/site/boards)
|
|
- [旭日X3派](https://developer.horizon.ai/api/v1/fileData/documents_pi/index.html)
|
|
- etc
|
|
|
|
with the following APIs
|
|
|
|
- C++
|
|
- C
|
|
- Python
|
|
- Go
|
|
- ``C#``
|
|
- Javascript
|
|
- Java
|
|
- Kotlin
|
|
- Swift
|
|
|
|
# Useful links
|
|
|
|
- Documentation: https://k2-fsa.github.io/sherpa/onnx/
|
|
- APK for the text-to-speech engine: https://k2-fsa.github.io/sherpa/onnx/tts/apk-engine.html
|
|
- APK for speaker identification: https://k2-fsa.github.io/sherpa/onnx/speaker-identification/apk.html
|
|
- APK for speech recognition: https://github.com/k2-fsa/sherpa-onnx/releases/
|
|
- Bilibili 演示视频: https://search.bilibili.com/all?keyword=%E6%96%B0%E4%B8%80%E4%BB%A3Kaldi
|
|
|
|
# How to reach us
|
|
|
|
Please see
|
|
https://k2-fsa.github.io/sherpa/social-groups.html
|
|
for 新一代 Kaldi **微信交流群** and **QQ 交流群**.
|