EngineX-Iluvatar/enginex_bi_series-sherpa-onnx

Archived

This repository has been archived on 2025-08-26. You can view files and clone it, but cannot push or open issues or pull requests.

Go to file

Karel Vesely 38c072dcb2 Track token scores (#571 )

* add export of per-token scores (ys, lm, context)

- for best path of the modified-beam-search decoding of transducer

* refactoring JSON export of OnlineRecognitionResult, extending pybind11 API of OnlineRecognitionResult

* export per-token scores also for greedy-search (online-transducer)

- export un-scaled lm_probs (modified-beam search, online-transducer)
- polishing

* fill lm_probs/context_scores only if LM/ContextGraph is present (make Result smaller)

2024-02-29 06:28:45 +08:00

Support RISC-V (#609 )

2024-02-26 06:57:18 +08:00

Add more Chinese TTS models (Mandarin and Cantonese) (#589 )

2024-02-20 15:05:35 +08:00

Use piper-phonemize to convert text to token IDs (#453 )

2023-11-30 23:57:43 +08:00

Use hub.nuaa.cf to replace huggingface URL to download dependencies. (#614 )

2024-02-28 17:48:51 +08:00

dotnet-examples

Use curl to replace wget for Windows. (#558 )

2024-01-29 10:46:34 +08:00

ffmpeg-examples

Fix typos in .Net APIs (#156 )

2023-05-14 22:32:01 +08:00

go-api-examples

Support streaming zipformer CTC (#496 )

2023-12-22 13:46:33 +08:00

Use piper-phonemize to convert text to token IDs (#453 )

2023-11-30 23:57:43 +08:00

Use piper-phonemize to convert text to token IDs (#453 )

2023-11-30 23:57:43 +08:00

java-api-examples

Fix #608 (#610 )

2024-02-26 13:49:37 +08:00

kotlin-api-examples

Fix CI tests for Python and JNI. (#554 )

2024-01-27 13:01:54 +08:00

Support Ukrainian VITS models from coqui-ai/TTS (#469 )

2023-12-06 19:37:11 +08:00

nodejs-examples

Support streaming zipformer CTC (#496 )

2023-12-22 13:46:33 +08:00

python-api-examples

add blank_penalty for online transducer (#548 )

2024-01-26 12:12:13 +08:00

Add more Chinese TTS models (Mandarin and Cantonese) (#589 )

2024-02-20 15:05:35 +08:00

Track token scores (#571 )

2024-02-29 06:28:45 +08:00

swift-api-examples

Add context biasing for mobile (#568 )

2024-02-01 21:33:22 +08:00

Support RISC-V (#609 )

2024-02-26 06:57:18 +08:00

Add WebAssembly for ASR (#604 )

2024-02-23 17:39:11 +08:00

.clang-format

add java wrapper suppport (#117 )

2023-04-15 22:17:28 +08:00

.flake8

add offline websocket server/client (#98 )

2023-03-29 21:48:45 +08:00

.gitignore

Track token scores (#571 )

2024-02-29 06:28:45 +08:00

build-aarch64-linux-gnu.sh

Support piper-phonemize (#452 )

2023-11-28 19:12:58 +08:00

build-android-arm64-v8a.sh

Download android onnxruntime libs from github. (#584 )

2024-02-19 10:32:58 +08:00

build-android-armv7-eabi.sh

Download android onnxruntime libs from github. (#584 )

2024-02-19 10:32:58 +08:00

build-android-x86-64.sh

Download android onnxruntime libs from github. (#584 )

2024-02-19 10:32:58 +08:00

build-android-x86.sh

Download android onnxruntime libs from github. (#584 )

2024-02-19 10:32:58 +08:00

build-apk-two-pass.sh

Fix whisper test script for the latest onnxruntime. (#494 )

2023-12-20 11:12:12 +08:00

build-apk-vad.sh

Add Android APK for Silero VAD (#335 )

2023-09-23 20:39:13 +08:00

build-apk.sh

Release pre-built APKs (#285 )

2023-08-18 14:28:44 +08:00

build-arm-linux-gnueabihf.sh

Support piper-phonemize (#452 )

2023-11-28 19:12:58 +08:00

build-ios.sh

Download ios-onnxruntime from github instead of huggingface. (#593 )

2024-02-21 10:51:41 +08:00

build-kws-apk.sh

change modelscope link to github for build-kws-apki (#540 )

2024-01-24 16:40:14 +08:00

build-riscv64-linux-gnu.sh

Support RISC-V (#609 )

2024-02-26 06:57:18 +08:00

build-swift-macos.sh

Use piper-phonemize to convert text to token IDs (#453 )

2023-11-30 23:57:43 +08:00

build-wasm-simd-asr.sh

Add WebAssembly for ASR (#604 )

2024-02-23 17:39:11 +08:00

build-wasm-simd-tts.sh

Add WebAssembly for ASR (#604 )

2024-02-23 17:39:11 +08:00

CMakeLists.txt

Add WebAssembly for ASR (#604 )

2024-02-23 17:39:11 +08:00

CPPLINT.cfg

Use static libraries for MFC examples (#210 )

2023-07-13 14:52:43 +08:00

LICENSE

Use standard apache 2.0 license (#53 )

2023-02-22 11:30:46 +08:00

README.md

Update README (#572 )

2024-02-03 09:20:08 +08:00

release.sh

Publish pre-compiled libs for Android. (#217 )

2023-07-15 12:25:18 +08:00

setup.py

Support using alsa to access the microphone with non-streaming ASR models (#517 )

2024-02-26 21:17:26 +08:00

README.md

Introduction

This repository supports running the following functions locally

Speech-to-text (i.e., ASR)
Text-to-speech (i.e., TTS)
Speaker identification

on the following platforms and operating systems:

Linux, macOS, Windows
Android
iOS
Raspberry Pi
etc

Useful links

Documentation: https://k2-fsa.github.io/sherpa/onnx/
APK for the text-to-speech engine: https://k2-fsa.github.io/sherpa/onnx/tts/apk-engine.html
APK for speaker identification: https://k2-fsa.github.io/sherpa/onnx/speaker-identification/apk.html

How to reach us

Please see https://k2-fsa.github.io/sherpa/social-groups.html for 新一代 Kaldi 微信交流群 and QQ 交流群.

Languages

C++ 38.3%

Python 16.3%

Shell 7.6%

Kotlin 5.1%

JavaScript 5.1%

Other 27.4%