enginex_bi_series-sherpa-onnx

EngineX-Iluvatar/enginex_bi_series-sherpa-onnx

Archived

This repository has been archived on 2025-08-26. You can view files and clone it, but cannot push or open issues or pull requests.

Go to file

Fangjun Kuang 803c02db0a publish all pre-built wheels to huggingface (#1142 )

pypi.org provides only 10GB of free space for open-source projects.

Each new release of sherpa-onnx occupies about 800MB, so we have to delete previous releases otherwise pypi.org refuses to accept new releases due to limited spaces.

To let users install previous versions, we also publish wheels to huggingface and users can find them at

https://k2-fsa.github.io/sherpa/onnx/cpu.html
and
https://k2-fsa.github.io/sherpa/onnx/cpu-cn.html (for users without access to huggingface.co)

2024-07-17 14:41:27 +08:00

.github

publish all pre-built wheels to huggingface (#1142 )

2024-07-17 14:41:27 +08:00

android

Enable to stop TTS generation (#1041 )

2024-06-22 18:18:36 +08:00

c-api-examples

Build sherpa-onnx as a single shared library (#1078 )

2024-07-06 16:41:54 +08:00

cmake

Revert to onnxruntime 1.17.1 (#1131 )

2024-07-15 14:24:08 +08:00

dart-api-examples

Add C++ runtime for MeloTTS (#1138 )

2024-07-16 15:55:02 +08:00

dotnet-examples

Add microphone example for .Net keyword spotting (#1120 )

2024-07-13 14:56:39 +08:00

ffmpeg-examples

Build sherpa-onnx as a single shared library (#1078 )

2024-07-06 16:41:54 +08:00

flutter

Add C++ runtime for MeloTTS (#1138 )

2024-07-16 15:55:02 +08:00

flutter-examples

Add C++ runtime for MeloTTS (#1138 )

2024-07-16 15:55:02 +08:00

go-api-examples

Fix publishing apks to huggingface (#1121 )

2024-07-13 16:14:00 +08:00

ios-swift

Revert to onnxruntime 1.17.1 (#1131 )

2024-07-15 14:24:08 +08:00

ios-swiftui

Revert to onnxruntime 1.17.1 (#1131 )

2024-07-15 14:24:08 +08:00

java-api-examples

Support onnxruntime 1.18.0 (#906 )

2024-07-10 17:05:26 +08:00

kotlin-api-examples

Support onnxruntime 1.18.0 (#906 )

2024-07-10 17:05:26 +08:00

mfc-examples

Support onnxruntime 1.18.0 (#906 )

2024-07-10 17:05:26 +08:00

nodejs-addon-examples

Provide npm package for 32-bit Windows x86 (#1141 )

2024-07-17 12:33:15 +08:00

nodejs-examples

Support onnxruntime 1.18.0 (#906 )

2024-07-10 17:05:26 +08:00

python-api-examples

Fix publishing apks to huggingface (#1121 )

2024-07-13 16:14:00 +08:00

scripts

Provide npm package for 32-bit Windows x86 (#1141 )

2024-07-17 12:33:15 +08:00

sherpa-onnx

Fix hotwords OOV log (#1139 )

2024-07-16 19:41:31 +08:00

swift-api-examples

Add Swift API for adding punctuations to text. (#1132 )

2024-07-15 15:30:40 +08:00

toolchains

Support RISC-V (#609 )

2024-02-26 06:57:18 +08:00

wasm

Inverse text normalization API of streaming ASR for various programming languages (#1022 )

2024-06-18 13:42:17 +08:00

.clang-format

add java wrapper suppport (#117 )

2023-04-15 22:17:28 +08:00

.clang-tidy

Support clang-tidy (#1034 )

2024-06-19 20:51:57 +08:00

.flake8

add offline websocket server/client (#98 )

2023-03-29 21:48:45 +08:00

.gitignore

Support onnxruntime 1.18.0 (#906 )

2024-07-10 17:05:26 +08:00

build-aarch64-linux-gnu.sh

Fix the alsa-lib version to v1.2.12 (#1048 )

2024-06-23 20:20:38 +08:00

build-android-arm64-v8a.sh

Revert to onnxruntime 1.17.1 (#1131 )

2024-07-15 14:24:08 +08:00

build-android-armv7-eabi.sh

Revert to onnxruntime 1.17.1 (#1131 )

2024-07-15 14:24:08 +08:00

build-android-x86-64.sh

Revert to onnxruntime 1.17.1 (#1131 )

2024-07-15 14:24:08 +08:00

build-android-x86.sh

Revert to onnxruntime 1.17.1 (#1131 )

2024-07-15 14:24:08 +08:00

build-arm-linux-gnueabihf.sh

Fix the alsa-lib version to v1.2.12 (#1048 )

2024-06-23 20:20:38 +08:00

build-ios-no-tts.sh

Revert to onnxruntime 1.17.1 (#1131 )

2024-07-15 14:24:08 +08:00

build-ios-shared.sh

Revert to onnxruntime 1.17.1 (#1131 )

2024-07-15 14:24:08 +08:00

build-ios.sh

Revert to onnxruntime 1.17.1 (#1131 )

2024-07-15 14:24:08 +08:00

build-riscv64-linux-gnu.sh

Fix the alsa-lib version to v1.2.12 (#1048 )

2024-06-23 20:20:38 +08:00

build-swift-macos.sh

Fix CI errors. (#993 )

2024-06-12 11:42:19 +08:00

build-wasm-simd-asr.sh

Add WebAssembly for ASR (#604 )

2024-02-23 17:39:11 +08:00

build-wasm-simd-kws.sh

small fixes to wasm kws. (#672 )

2024-03-18 15:28:10 +08:00

build-wasm-simd-nodejs.sh

return timestamps for WebAssembly (#737 )

2024-04-05 20:24:27 +08:00

build-wasm-simd-tts.sh

Add WebAssembly for ASR (#604 )

2024-02-23 17:39:11 +08:00

CHANGELOG.md

Add C++ runtime for MeloTTS (#1138 )

2024-07-16 15:55:02 +08:00

CMakeLists.txt

Add C++ runtime for MeloTTS (#1138 )

2024-07-16 15:55:02 +08:00

CPPLINT.cfg

Use static libraries for MFC examples (#210 )

2023-07-13 14:52:43 +08:00

LICENSE

Use standard apache 2.0 license (#53 )

2023-02-22 11:30:46 +08:00

MANIFEST.in

Fix building wheels from source. (#632 )

2024-03-04 16:39:51 +08:00

README.md

Fix Flutter TTS example for iOS (#1090 )

2024-07-08 15:22:09 +08:00

release.sh

Publish pre-compiled libs for Android. (#217 )

2023-07-15 12:25:18 +08:00

setup.py

Support spoken language identification with whisper (#694 )

2024-03-24 22:57:00 +08:00

README.md

Supported functions

Speech recognition	Speech synthesis	Speaker verification	Speaker identification
✔️	✔️	✔️	✔️

Spoken Language identification	Audio tagging	Voice activity detection	Keyword spotting
✔️	✔️	✔️	✔️

Supported platforms

Architecture	Android	iOS	Windows	macOS	linux
x64	✔️		✔️	✔️	✔️
x86	✔️		✔️
arm64	✔️	✔️	✔️	✔️	✔️
arm32	✔️				✔️
riscv64					✔️

Supported programming languages

C++	C	Python	C#	Java	JavaScript	Kotlin	Swift	Go	Dart
✔️	✔️	✔️	✔️	✔️	✔️	✔️	✔️	✔️	✔️

It also supports WebAssembly.

Introduction

This repository supports running the following functions locally

Speech-to-text (i.e., ASR); both streaming and non-streaming are supported
Text-to-speech (i.e., TTS)
Speaker identification
Speaker verification
Spoken language identification
Audio tagging
VAD (e.g., silero-vad)
Keyword spotting

on the following platforms and operating systems:

x86, x86_64, 32-bit ARM, 64-bit ARM (arm64, aarch64), RISC-V (riscv64)
Linux, macOS, Windows, openKylin
Android, WearOS
iOS
NodeJS
WebAssembly
Raspberry Pi
RV1126
LicheePi4A
VisionFive 2
旭日X3派
etc

with the following APIs

C++, C, Python, Go, C#
Java, Kotlin, JavaScript
Swift
Dart

Links for pre-built Android APKs

Description	URL	中国用户
Streaming speech recognition	Address	点此
Text-to-speech	Address	点此
Voice activity detection (VAD)	Address	点此
VAD + non-streaming speech recognition	Address	点此
Two-pass speech recognition	Address	点此
Audio tagging	Address	点此
Audio tagging (WearOS)	Address	点此
Speaker identification	Address	点此
Spoken language identification	Address	点此
Keyword spotting	Address	点此

Links for pre-built Flutter APPs

Real-time speech recognition

Description	URL	中国用户
Streaming speech recognition	Address	点此

Text-to-speech

Description	URL	中国用户
Android (arm64-v8a, armeabi-v7a, x86_64)	Address	点此
Linux (x64)	Address	点此
macOS (x64)	Address	点此
macOS (arm64)	Address	点此
Windows (x64)	Address	点此

Note: You need to build from source for iOS.

Links for pre-trained models

Description	URL
Speech recognition (speech to text, ASR)	Address
Text-to-speech (TTS)	Address
VAD	Address
Keyword spotting	Address
Audio tagging	Address
Speaker identification (Speaker ID)	Address
Spoken language identification (Language ID)	See multi-lingual Whisper ASR models from Speech recognition
Punctuation	Address

Useful links

Documentation: https://k2-fsa.github.io/sherpa/onnx/
Bilibili 演示视频: https://search.bilibili.com/all?keyword=%E6%96%B0%E4%B8%80%E4%BB%A3Kaldi

How to reach us

Please see https://k2-fsa.github.io/sherpa/social-groups.html for 新一代 Kaldi 微信交流群 and QQ 交流群.

Languages

C++ 38.3%

Python 16.3%

Shell 7.6%

Kotlin 5.1%

JavaScript 5.1%

Other 27.4%