EngineX-Iluvatar/enginex_bi_series-sherpa-onnx: *此项目已归档，勿使用* - enginex_bi_series-sherpa-onnx - Gitea: Git with a cup of tea

EngineX-Iluvatar/enginex_bi_series-sherpa-onnx

Archived

This repository has been archived on 2025-08-26. You can view files and clone it, but cannot push or open issues or pull requests.

Go to file

Fangjun Kuang c2dce19140 Update README to include Rust. (#1212 )

2024-08-04 12:20:05 +08:00

describe how to add new words for MeloTTS models (#1209 )

2024-08-03 11:19:02 +08:00

Enable to stop TTS generation (#1041 )

2024-06-22 18:18:36 +08:00

Refactor C API to prefix each API with SherpaOnnx. (#1171 )

2024-07-26 18:47:02 +08:00

feat: add directml support (#1153 )

2024-07-22 23:50:48 +08:00

dart-api-examples

Add speaker identification and verification exmaple for Dart API (#1194 )

2024-07-31 13:53:52 +08:00

dotnet-examples

Add test about whisper large-v3 for .Net (#1187 )

2024-07-29 20:49:38 +08:00

ffmpeg-examples

Fix ffmpeg c api example (#1185 )

2024-07-29 14:27:55 +08:00

Add speaker identification and verification exmaple for Dart API (#1194 )

2024-07-31 13:53:52 +08:00

flutter-examples

Add Chinese+English tts example for flutter (#1192 )

2024-07-30 18:38:43 +08:00

go-api-examples

Add Go API for SenseVoice (#1154 )

2024-07-20 23:41:53 +08:00

Revert to onnxruntime 1.17.1 (#1131 )

2024-07-15 14:24:08 +08:00

Revert to onnxruntime 1.17.1 (#1131 )

2024-07-15 14:24:08 +08:00

java-api-examples

Add speaker identification and verification exmaple for Dart API (#1194 )

2024-07-31 13:53:52 +08:00

kotlin-api-examples

Add Java and Kotlin API for sense voice (#1164 )

2024-07-22 14:08:40 +08:00

fix building MFC examples (#1178 )

2024-07-28 14:07:25 +08:00

nodejs-addon-examples

Dart API for adding punctuations to text (#1182 )

2024-07-29 12:41:52 +08:00

nodejs-examples

Add WebAssembly for SenseVoice (#1158 )

2024-07-21 15:39:55 +08:00

python-api-examples

Add more Python examples for SenseVoice (#1179 )

2024-07-28 21:54:38 +08:00

rust-api-examples

Update README to include Rust. (#1212 )

2024-08-04 12:20:05 +08:00

describe how to add new words for MeloTTS models (#1209 )

2024-08-03 11:19:02 +08:00

Remove libonnxruntime_providers_cuda.so as a dependency. (#1210 )

2024-08-03 16:25:23 +08:00

swift-api-examples

Refactor C API to prefix each API with SherpaOnnx. (#1171 )

2024-07-26 18:47:02 +08:00

Support RISC-V (#609 )

2024-02-26 06:57:18 +08:00

Refactor C API to prefix each API with SherpaOnnx. (#1171 )

2024-07-26 18:47:02 +08:00

.clang-format

add java wrapper suppport (#117 )

2023-04-15 22:17:28 +08:00

.clang-tidy

Support clang-tidy (#1034 )

2024-06-19 20:51:57 +08:00

.flake8

add offline websocket server/client (#98 )

2023-03-29 21:48:45 +08:00

.gitignore

Add VAD + Non-streaming ASR example for JavaScript API. (#1170 )

2024-07-26 12:42:08 +08:00

build-aarch64-linux-gnu.sh

Fix the alsa-lib version to v1.2.12 (#1048 )

2024-06-23 20:20:38 +08:00

build-android-arm64-v8a.sh

Revert to onnxruntime 1.17.1 (#1131 )

2024-07-15 14:24:08 +08:00

build-android-armv7-eabi.sh

Revert to onnxruntime 1.17.1 (#1131 )

2024-07-15 14:24:08 +08:00

build-android-x86-64.sh

Revert to onnxruntime 1.17.1 (#1131 )

2024-07-15 14:24:08 +08:00

build-android-x86.sh

Revert to onnxruntime 1.17.1 (#1131 )

2024-07-15 14:24:08 +08:00

build-arm-linux-gnueabihf.sh

Fix the alsa-lib version to v1.2.12 (#1048 )

2024-06-23 20:20:38 +08:00

build-ios-no-tts.sh

Revert to onnxruntime 1.17.1 (#1131 )

2024-07-15 14:24:08 +08:00

build-ios-shared.sh

Revert to onnxruntime 1.17.1 (#1131 )

2024-07-15 14:24:08 +08:00

build-ios.sh

Revert to onnxruntime 1.17.1 (#1131 )

2024-07-15 14:24:08 +08:00

build-riscv64-linux-gnu.sh

Fix the alsa-lib version to v1.2.12 (#1048 )

2024-06-23 20:20:38 +08:00

build-swift-macos.sh

Fix CI errors. (#993 )

2024-06-12 11:42:19 +08:00

build-wasm-simd-asr.sh

Add WebAssembly for ASR (#604 )

2024-02-23 17:39:11 +08:00

build-wasm-simd-kws.sh

small fixes to wasm kws. (#672 )

2024-03-18 15:28:10 +08:00

build-wasm-simd-nodejs.sh

return timestamps for WebAssembly (#737 )

2024-04-05 20:24:27 +08:00

build-wasm-simd-tts.sh

Add WebAssembly for ASR (#604 )

2024-02-23 17:39:11 +08:00

CHANGELOG.md

Dart API for adding punctuations to text (#1182 )

2024-07-29 12:41:52 +08:00

CMakeLists.txt

Dart API for adding punctuations to text (#1182 )

2024-07-29 12:41:52 +08:00

CPPLINT.cfg

Use static libraries for MFC examples (#210 )

2023-07-13 14:52:43 +08:00

LICENSE

Use standard apache 2.0 license (#53 )

2023-02-22 11:30:46 +08:00

MANIFEST.in

Fix building wheels from source. (#632 )

2024-03-04 16:39:51 +08:00

README.md

Update README to include Rust. (#1212 )

2024-08-04 12:20:05 +08:00

release.sh

Publish pre-compiled libs for Android. (#217 )

2023-07-15 12:25:18 +08:00

setup.py

Provide pre-built wheels with CUDA support. (#1143 )

2024-07-17 22:59:13 +08:00

README.md

Supported functions

Speech recognition	Speech synthesis	Speaker verification	Speaker identification
✔️	✔️	✔️	✔️

Spoken Language identification	Audio tagging	Voice activity detection
✔️	✔️	✔️

Keyword spotting	Add punctuation
✔️	✔️

Supported platforms

Architecture	Android	iOS	Windows	macOS	linux
x64	✔️		✔️	✔️	✔️
x86	✔️		✔️
arm64	✔️	✔️	✔️	✔️	✔️
arm32	✔️				✔️
riscv64					✔️

Supported programming languages

1. C++	2. C	3. Python	4. C#	5. Java	6. JavaScript
✔️	✔️	✔️	✔️	✔️	✔️

7. Kotlin	8. Swift	9. Go	10. Dart	11. Rust
✔️	✔️	✔️	✔️	✔️

For Rust support, please see https://github.com/thewh1teagle/sherpa-rs

It also supports WebAssembly.

Introduction

This repository supports running the following functions locally

Speech-to-text (i.e., ASR); both streaming and non-streaming are supported
Text-to-speech (i.e., TTS)
Speaker identification
Speaker verification
Spoken language identification
Audio tagging
VAD (e.g., silero-vad)
Keyword spotting

on the following platforms and operating systems:

x86, x86_64, 32-bit ARM, 64-bit ARM (arm64, aarch64), RISC-V (riscv64)
Linux, macOS, Windows, openKylin
Android, WearOS
iOS
NodeJS
WebAssembly
Raspberry Pi
RV1126
LicheePi4A
VisionFive 2
旭日X3派
etc

with the following APIs

C++, C, Python, Go, C#
Java, Kotlin, JavaScript
Swift
Dart

Links for pre-built Android APKs

Description	URL	中国用户
Streaming speech recognition	Address	点此
Text-to-speech	Address	点此
Voice activity detection (VAD)	Address	点此
VAD + non-streaming speech recognition	Address	点此
Two-pass speech recognition	Address	点此
Audio tagging	Address	点此
Audio tagging (WearOS)	Address	点此
Speaker identification	Address	点此
Spoken language identification	Address	点此
Keyword spotting	Address	点此

Links for pre-built Flutter APPs

Real-time speech recognition

Description	URL	中国用户
Streaming speech recognition	Address	点此

Text-to-speech

Description	URL	中国用户
Android (arm64-v8a, armeabi-v7a, x86_64)	Address	点此
Linux (x64)	Address	点此
macOS (x64)	Address	点此
macOS (arm64)	Address	点此
Windows (x64)	Address	点此

Note: You need to build from source for iOS.

Links for pre-trained models

Description	URL
Speech recognition (speech to text, ASR)	Address
Text-to-speech (TTS)	Address
VAD	Address
Keyword spotting	Address
Audio tagging	Address
Speaker identification (Speaker ID)	Address
Spoken language identification (Language ID)	See multi-lingual Whisper ASR models from Speech recognition
Punctuation	Address

Useful links

Documentation: https://k2-fsa.github.io/sherpa/onnx/
Bilibili 演示视频: https://search.bilibili.com/all?keyword=%E6%96%B0%E4%B8%80%E4%BB%A3Kaldi

How to reach us

Please see https://k2-fsa.github.io/sherpa/social-groups.html for 新一代 Kaldi 微信交流群 and QQ 交流群.

Languages

C++ 38.3%

Python 16.3%

Shell 7.6%

Kotlin 5.1%

JavaScript 5.1%

Other 27.4%