enginex_bi_series-sherpa-onnx

EngineX-Iluvatar/enginex_bi_series-sherpa-onnx

Archived

This repository has been archived on 2025-08-26. You can view files and clone it, but cannot push or open issues or pull requests.

Go to file

Fangjun Kuang f93f0ca94d Use a separate thread to initialize models for lazarus examples. (#1270 )

So that the main thread is not blocked and the user interface is responsive.

2024-08-18 14:59:48 +08:00

.github

Use a separate thread to initialize models for lazarus examples. (#1270 )

2024-08-18 14:59:48 +08:00

android

Enable to stop TTS generation (#1041 )

2024-06-22 18:18:36 +08:00

c-api-examples

Add more C API examples (#1255 )

2024-08-14 10:52:47 +08:00

cmake

Fix style issues for online punctuation source files (#1225 )

2024-08-06 17:43:24 +08:00

dart-api-examples

Release v1.10.22 (#1267 )

2024-08-16 22:40:49 +08:00

dotnet-examples

Add test about whisper large-v3 for .Net (#1187 )

2024-07-29 20:49:38 +08:00

ffmpeg-examples

Fix ffmpeg c api example (#1185 )

2024-07-29 14:27:55 +08:00

flutter

flutter: add lang, emotion, event to OfflineRecognizerResult (#1268 )

2024-08-17 07:21:59 +08:00

flutter-examples

Release v1.10.22 (#1267 )

2024-08-16 22:40:49 +08:00

go-api-examples

Add Go API for SenseVoice (#1154 )

2024-07-20 23:41:53 +08:00

ios-swift

Revert to onnxruntime 1.17.1 (#1131 )

2024-07-15 14:24:08 +08:00

ios-swiftui

Add MeloTTS example for ios (#1223 )

2024-08-06 14:48:54 +08:00

java-api-examples

Pascal API for streaming ASR (#1246 )

2024-08-12 19:55:51 +08:00

kotlin-api-examples

Add Java and Kotlin API for sense voice (#1164 )

2024-07-22 14:08:40 +08:00

lazarus-examples

Use a separate thread to initialize models for lazarus examples. (#1270 )

2024-08-18 14:59:48 +08:00

mfc-examples

fix building MFC examples (#1178 )

2024-07-28 14:07:25 +08:00

nodejs-addon-examples

Release v1.10.22 (#1267 )

2024-08-16 22:40:49 +08:00

nodejs-examples

Add WebAssembly for SenseVoice (#1158 )

2024-07-21 15:39:55 +08:00

pascal-api-examples

Build generating subtitles APPs for more models (#1265 )

2024-08-16 20:11:24 +08:00

python-api-examples

Fix python two pass ASR examples (#1230 )

2024-08-07 18:35:38 +08:00

rust-api-examples

Update README to include Rust. (#1212 )

2024-08-04 12:20:05 +08:00

scripts

Build generating subtitles APPs for more models (#1265 )

2024-08-16 20:11:24 +08:00

sherpa-onnx

Use a separate thread to initialize models for lazarus examples. (#1270 )

2024-08-18 14:59:48 +08:00

swift-api-examples

Add blank penalty for various language bindings. (#1234 )

2024-08-08 10:43:31 +08:00

toolchains

Support RISC-V (#609 )

2024-02-26 06:57:18 +08:00

wasm

Add blank penalty for various language bindings. (#1234 )

2024-08-08 10:43:31 +08:00

.clang-format

add java wrapper suppport (#117 )

2023-04-15 22:17:28 +08:00

.clang-tidy

Support clang-tidy (#1034 )

2024-06-19 20:51:57 +08:00

.flake8

add offline websocket server/client (#98 )

2023-03-29 21:48:45 +08:00

.gitignore

Add Pascal API for reading wave files (#1243 )

2024-08-11 22:43:42 +08:00

build-aarch64-linux-gnu.sh

Fix the alsa-lib version to v1.2.12 (#1048 )

2024-06-23 20:20:38 +08:00

build-android-arm64-v8a.sh

Revert to onnxruntime 1.17.1 (#1131 )

2024-07-15 14:24:08 +08:00

build-android-armv7-eabi.sh

Revert to onnxruntime 1.17.1 (#1131 )

2024-07-15 14:24:08 +08:00

build-android-x86-64.sh

Revert to onnxruntime 1.17.1 (#1131 )

2024-07-15 14:24:08 +08:00

build-android-x86.sh

Revert to onnxruntime 1.17.1 (#1131 )

2024-07-15 14:24:08 +08:00

build-arm-linux-gnueabihf.sh

Fix the alsa-lib version to v1.2.12 (#1048 )

2024-06-23 20:20:38 +08:00

build-ios-no-tts.sh

Add blank penalty for various language bindings. (#1234 )

2024-08-08 10:43:31 +08:00

build-ios-shared.sh

Revert to onnxruntime 1.17.1 (#1131 )

2024-07-15 14:24:08 +08:00

build-ios.sh

Add MeloTTS example for ios (#1223 )

2024-08-06 14:48:54 +08:00

build-riscv64-linux-gnu.sh

Fix the alsa-lib version to v1.2.12 (#1048 )

2024-06-23 20:20:38 +08:00

build-swift-macos.sh

Fix CI errors. (#993 )

2024-06-12 11:42:19 +08:00

build-wasm-simd-asr.sh

Add WebAssembly for ASR (#604 )

2024-02-23 17:39:11 +08:00

build-wasm-simd-kws.sh

small fixes to wasm kws. (#672 )

2024-03-18 15:28:10 +08:00

build-wasm-simd-nodejs.sh

return timestamps for WebAssembly (#737 )

2024-04-05 20:24:27 +08:00

build-wasm-simd-tts.sh

Add WebAssembly for ASR (#604 )

2024-02-23 17:39:11 +08:00

CHANGELOG.md

Release v1.10.22 (#1267 )

2024-08-16 22:40:49 +08:00

CMakeLists.txt

Release v1.10.22 (#1267 )

2024-08-16 22:40:49 +08:00

CPPLINT.cfg

Use static libraries for MFC examples (#210 )

2023-07-13 14:52:43 +08:00

LICENSE

Use standard apache 2.0 license (#53 )

2023-02-22 11:30:46 +08:00

MANIFEST.in

Fix building wheels from source. (#632 )

2024-03-04 16:39:51 +08:00

README.md

Build generating subtitles APPs for more models (#1265 )

2024-08-16 20:11:24 +08:00

release.sh

Publish pre-compiled libs for Android. (#217 )

2023-07-15 12:25:18 +08:00

setup.py

Provide pre-built wheels with CUDA support. (#1143 )

2024-07-17 22:59:13 +08:00

README.md

Supported functions

Speech recognition	Speech synthesis	Speaker verification	Speaker identification
✔️	✔️	✔️	✔️

Spoken Language identification	Audio tagging	Voice activity detection
✔️	✔️	✔️

Keyword spotting	Add punctuation
✔️	✔️

Supported platforms

Architecture	Android	iOS	Windows	macOS	linux
x64	✔️		✔️	✔️	✔️
x86	✔️		✔️
arm64	✔️	✔️	✔️	✔️	✔️
arm32	✔️				✔️
riscv64					✔️

Supported programming languages

1. C++	2. C	3. Python	4. JavaScript
✔️	✔️	✔️	✔️

5. Java	6. C#	7. Kotlin	8. Swift
✔️	✔️	✔️	✔️

9. Go	10. Dart	11. Rust	12. Pascal
✔️	✔️	✔️	✔️

For Rust support, please see https://github.com/thewh1teagle/sherpa-rs

It also supports WebAssembly.

Introduction

This repository supports running the following functions locally

Speech-to-text (i.e., ASR); both streaming and non-streaming are supported
Text-to-speech (i.e., TTS)
Speaker identification
Speaker verification
Spoken language identification
Audio tagging
VAD (e.g., silero-vad)
Keyword spotting

on the following platforms and operating systems:

x86, x86_64, 32-bit ARM, 64-bit ARM (arm64, aarch64), RISC-V (riscv64)
Linux, macOS, Windows, openKylin
Android, WearOS
iOS
NodeJS
WebAssembly
Raspberry Pi
RV1126
LicheePi4A
VisionFive 2
旭日X3派
etc

with the following APIs

C++, C, Python, Go, C#
Java, Kotlin, JavaScript
Swift, Rust
Dart, Object Pascal

Links for pre-built Android APKs

Description	URL	中国用户
Streaming speech recognition	Address	点此
Text-to-speech	Address	点此
Voice activity detection (VAD)	Address	点此
VAD + non-streaming speech recognition	Address	点此
Two-pass speech recognition	Address	点此
Audio tagging	Address	点此
Audio tagging (WearOS)	Address	点此
Speaker identification	Address	点此
Spoken language identification	Address	点此
Keyword spotting	Address	点此

Links for pre-built Flutter APPs

Real-time speech recognition

Description	URL	中国用户
Streaming speech recognition	Address	点此

Text-to-speech

Description	URL	中国用户
Android (arm64-v8a, armeabi-v7a, x86_64)	Address	点此
Linux (x64)	Address	点此
macOS (x64)	Address	点此
macOS (arm64)	Address	点此
Windows (x64)	Address	点此

Note: You need to build from source for iOS.

Links for pre-built Lazarus APPs

Generating subtitles

Description	URL	中国用户
Generate subtitles (生成字幕)	Address	点此

Links for pre-trained models

Description	URL
Speech recognition (speech to text, ASR)	Address
Text-to-speech (TTS)	Address
VAD	Address
Keyword spotting	Address
Audio tagging	Address
Speaker identification (Speaker ID)	Address
Spoken language identification (Language ID)	See multi-lingual Whisper ASR models from Speech recognition
Punctuation	Address

Useful links

Documentation: https://k2-fsa.github.io/sherpa/onnx/
Bilibili 演示视频: https://search.bilibili.com/all?keyword=%E6%96%B0%E4%B8%80%E4%BB%A3Kaldi

How to reach us

Please see https://k2-fsa.github.io/sherpa/social-groups.html for 新一代 Kaldi 微信交流群 and QQ 交流群.

Languages

C++ 38.3%

Python 16.3%

Shell 7.6%

Kotlin 5.1%

JavaScript 5.1%

Other 27.4%