This repository has been archived on 2025-08-26. You can view files and clone it, but cannot push or open issues or pull requests.
Files
enginex_bi_series-sherpa-onnx/java-api-examples/README.md

77 lines
1.3 KiB
Markdown
Raw Normal View History

2024-04-24 18:41:48 +08:00
# Introduction
2024-04-24 18:41:48 +08:00
This directory contains examples for the JAVA API of sherpa-onnx.
2024-04-24 18:41:48 +08:00
# Usage
## Streaming Speech recognition
2023-04-15 22:17:28 +08:00
```
2024-04-24 18:41:48 +08:00
./run-streaming-decode-file-ctc.sh
2024-04-25 17:20:02 +08:00
./run-streaming-decode-file-ctc-hlg.sh
2024-04-24 18:41:48 +08:00
./run-streaming-decode-file-paraformer.sh
./run-streaming-decode-file-transducer.sh
```
## Non-Streaming Speech recognition
```bash
./run-non-streaming-decode-file-paraformer.sh
./run-non-streaming-decode-file-transducer.sh
./run-non-streaming-decode-file-whisper.sh
2024-04-25 17:20:02 +08:00
./run-non-streaming-decode-file-nemo.sh
```
2024-04-26 09:26:39 +08:00
## Non-Streaming text-to-speech
```bash
./run-non-streaming-tts-piper-en.sh
./run-non-streaming-tts-coqui-de.sh
./run-non-streaming-tts-vits-zh.sh
```
## Spoken language identification
```bash
./run-spoken-language-identification-whisper.sh
```
2024-04-28 22:26:04 +08:00
## Add punctuations to text
The punctuation model supports both English and Chinese.
```bash
./run-add-punctuation-zh-en.sh
```
2024-04-28 22:26:04 +08:00
## Audio tagging
```bash
./run-audio-tagging-zipformer-from-file.sh
./run-audio-tagging-ced-from-file.sh
```
## Speaker identification
```bash
./run-speaker-identification.sh
```
2024-05-28 14:59:47 +08:00
## VAD (Remove silence)
```bash
./run-vad-remove-slience.sh
```
## VAD + Non-streaming Paraformer for speech recognition
```bash
./run-vad-non-streaming-paraformer.sh
```
2024-05-28 15:49:54 +08:00
## Keyword spotter
```bash
./run-kws-from-file.sh
```