This repository has been archived on 2025-08-26 . You can view files and clone it, but cannot push or open issues or pull requests.
38c072dcb26320d02b84a477340067d5a1a6f017
* add export of per-token scores (ys, lm, context) - for best path of the modified-beam-search decoding of transducer * refactoring JSON export of OnlineRecognitionResult, extending pybind11 API of OnlineRecognitionResult * export per-token scores also for greedy-search (online-transducer) - export un-scaled lm_probs (modified-beam search, online-transducer) - polishing * fill lm_probs/context_scores only if LM/ContextGraph is present (make Result smaller)
Introduction
This repository supports running the following functions locally
- Speech-to-text (i.e., ASR)
- Text-to-speech (i.e., TTS)
- Speaker identification
on the following platforms and operating systems:
- Linux, macOS, Windows
- Android
- iOS
- Raspberry Pi
- etc
Useful links
- Documentation: https://k2-fsa.github.io/sherpa/onnx/
- APK for the text-to-speech engine: https://k2-fsa.github.io/sherpa/onnx/tts/apk-engine.html
- APK for speaker identification: https://k2-fsa.github.io/sherpa/onnx/speaker-identification/apk.html
How to reach us
Please see https://k2-fsa.github.io/sherpa/social-groups.html for 新一代 Kaldi 微信交流群 and QQ 交流群.
Description
Languages
C++
38.3%
Python
16.3%
Shell
7.6%
Kotlin
5.1%
JavaScript
5.1%
Other
27.4%