enginex_bi_series-sherpa-onnx/java-api-examples/README.md

# Introduction

This directory contains examples for the JAVA API of sherpa-onnx.

# Usage

## Non-streaming speaker diarization

```bash
./run-offline-speaker-diarization.sh
```

## Streaming Speech recognition

```
./run-streaming-asr-from-mic-transducer.sh
./run-streaming-decode-file-ctc.sh
./run-streaming-decode-file-ctc-hlg.sh
./run-streaming-decode-file-paraformer.sh
./run-streaming-decode-file-transducer.sh
```

## Non-Streaming Speech recognition

```bash
./run-non-streaming-decode-file-dolphin-ctc.sh
./run-non-streaming-decode-file-fire-red-asr.sh
./run-non-streaming-decode-file-moonshine.sh
./run-non-streaming-decode-file-nemo-canary.sh
./run-non-streaming-decode-file-nemo.sh
./run-non-streaming-decode-file-paraformer.sh
./run-non-streaming-decode-file-sense-voice.sh
./run-non-streaming-decode-file-tele-speech-ctc.sh
./run-non-streaming-decode-file-transducer-hotwords.sh
./run-non-streaming-decode-file-transducer.sh
./run-non-streaming-decode-file-whisper-multiple.sh
./run-non-streaming-decode-file-whisper.sh
./run-non-streaming-decode-file-zipformer-ctc.sh
```

## Non-Streaming Speech recognition with homophone replacer

```bash
./run-non-streaming-decode-file-sense-voice-with-hr.sh
```

## Non-Streaming text-to-speech

```bash
./run-non-streaming-tts-piper-en.sh
./run-non-streaming-tts-coqui-de.sh
./run-non-streaming-tts-vits-zh.sh
```

## Non-Streaming text-to-speech (Play as it is generating)

```bash
./run-non-streaming-tts-piper-en-with-callback.sh
```

## Spoken language identification

```bash
./run-spoken-language-identification-whisper.sh
```

## Add punctuations to text

The punctuation model supports both English and Chinese.

```bash
./run-add-punctuation-zh-en.sh
```

## Audio tagging

```bash
./run-audio-tagging-zipformer-from-file.sh
./run-audio-tagging-ced-from-file.sh
```

## Speaker identification

```bash
./run-speaker-identification.sh
```

## VAD with a microphone

```bash
./run-vad-from-mic.sh
```

## VAD with a microphone + Non-streaming SenseVoice for speech recognition

```bash
./run-vad-from-mic-non-streaming-sense-voice.sh
```

## VAD with a microphone + Non-streaming Paraformer for speech recognition

```bash
./run-vad-from-mic-non-streaming-paraformer.sh
```

## VAD with a microphone + Non-streaming Whisper tiny.en for speech recognition

```bash
./run-vad-from-mic-non-streaming-whisper.sh
```

## VAD (Remove silence)

```bash
./run-vad-remove-slience.sh
```

## VAD + Non-streaming Dolphin CTC for speech recognition

```bash
./run-vad-non-streaming-dolphin-ctc.sh
```

## VAD + Non-streaming SenseVoice for speech recognition

```bash
./run-vad-non-streaming-sense-voice.sh
```

## VAD + Non-streaming Paraformer for speech recognition

```bash
./run-vad-non-streaming-paraformer.sh
```

## Keyword spotter

```bash
./run-kws-from-file.sh
```
Refactor Java API (#806) 2024-04-24 18:41:48 +08:00			`# Introduction`
java decode example for microphone (#122) 2023-04-20 09:10:47 +08:00
Refactor Java API (#806) 2024-04-24 18:41:48 +08:00			`This directory contains examples for the JAVA API of sherpa-onnx.`
java decode example for microphone (#122) 2023-04-20 09:10:47 +08:00
Refactor Java API (#806) 2024-04-24 18:41:48 +08:00			`# Usage`
java decode example for microphone (#122) 2023-04-20 09:10:47 +08:00
Java API for speaker diarization (#1416) 2024-10-11 16:51:40 +08:00			`## Non-streaming speaker diarization`

			```bash
			`./run-offline-speaker-diarization.sh`
			```

Add Java API for non-streaming ASR (#807) 2024-04-24 21:03:26 +08:00			`## Streaming Speech recognition`

add java wrapper suppport (#117) 2023-04-15 22:17:28 +08:00			```
Add streaming ASR example from a microphone for Java API (#1047) 2024-06-23 19:43:53 +08:00			`./run-streaming-asr-from-mic-transducer.sh`
Refactor Java API (#806) 2024-04-24 18:41:48 +08:00			`./run-streaming-decode-file-ctc.sh`
Add CTC HLG decoding for JNI (#810) 2024-04-25 17:20:02 +08:00			`./run-streaming-decode-file-ctc-hlg.sh`
Refactor Java API (#806) 2024-04-24 18:41:48 +08:00			`./run-streaming-decode-file-paraformer.sh`
			`./run-streaming-decode-file-transducer.sh`
java decode example for microphone (#122) 2023-04-20 09:10:47 +08:00			```
Add Java API for non-streaming ASR (#807) 2024-04-24 21:03:26 +08:00
			`## Non-Streaming Speech recognition`

			```bash
Add Kotlin and Java API for Dolphin CTC models (#2086) 2025-04-02 21:16:14 +08:00			`./run-non-streaming-decode-file-dolphin-ctc.sh`
Add Java and Kotlin API for NeMo Canary models (#2359) Add support for the NeMo Canary model in both Java and Kotlin APIs, wiring it through JNI and updating examples and CI. - Introduce OfflineCanaryModelConfig in Kotlin and Java with builder patterns - Extend OfflineRecognizer to accept and apply the new canary config via setConfig - Update JNI binding (GetOfflineConfig) and getOfflineModelConfig mapping (type 32), plus examples and CI workflows 2025-07-08 13:45:26 +08:00			`./run-non-streaming-decode-file-fire-red-asr.sh`
			`./run-non-streaming-decode-file-moonshine.sh`
			`./run-non-streaming-decode-file-nemo-canary.sh`
			`./run-non-streaming-decode-file-nemo.sh`
Add Java API for non-streaming ASR (#807) 2024-04-24 21:03:26 +08:00			`./run-non-streaming-decode-file-paraformer.sh`
Add Java and Kotlin API for sense voice (#1164) 2024-07-22 14:08:40 +08:00			`./run-non-streaming-decode-file-sense-voice.sh`
Add Java and Kotlin API for NeMo Canary models (#2359) Add support for the NeMo Canary model in both Java and Kotlin APIs, wiring it through JNI and updating examples and CI. - Introduce OfflineCanaryModelConfig in Kotlin and Java with builder patterns - Extend OfflineRecognizer to accept and apply the new canary config via setConfig - Update JNI binding (GetOfflineConfig) and getOfflineModelConfig mapping (type 32), plus examples and CI workflows 2025-07-08 13:45:26 +08:00			`./run-non-streaming-decode-file-tele-speech-ctc.sh`
			`./run-non-streaming-decode-file-transducer-hotwords.sh`
Add Java API for non-streaming ASR (#807) 2024-04-24 21:03:26 +08:00			`./run-non-streaming-decode-file-transducer.sh`
Add Java and Kotlin API for NeMo Canary models (#2359) Add support for the NeMo Canary model in both Java and Kotlin APIs, wiring it through JNI and updating examples and CI. - Introduce OfflineCanaryModelConfig in Kotlin and Java with builder patterns - Extend OfflineRecognizer to accept and apply the new canary config via setConfig - Update JNI binding (GetOfflineConfig) and getOfflineModelConfig mapping (type 32), plus examples and CI workflows 2025-07-08 13:45:26 +08:00			`./run-non-streaming-decode-file-whisper-multiple.sh`
Add Java API for non-streaming ASR (#807) 2024-04-24 21:03:26 +08:00			`./run-non-streaming-decode-file-whisper.sh`
Add Java and Kotlin API for NeMo Canary models (#2359) Add support for the NeMo Canary model in both Java and Kotlin APIs, wiring it through JNI and updating examples and CI. - Introduce OfflineCanaryModelConfig in Kotlin and Java with builder patterns - Extend OfflineRecognizer to accept and apply the new canary config via setConfig - Update JNI binding (GetOfflineConfig) and getOfflineModelConfig mapping (type 32), plus examples and CI workflows 2025-07-08 13:45:26 +08:00			`./run-non-streaming-decode-file-zipformer-ctc.sh`
Add Java API for non-streaming ASR (#807) 2024-04-24 21:03:26 +08:00			```
Add Java API for text-to-speech (#811) 2024-04-26 09:26:39 +08:00
Add Kotlin and Java API for homophone replacer (#2166) * Add Kotlin API for homonphone replacer * Add Java API for homonphone replacer 2025-04-29 22:55:21 +08:00			`## Non-Streaming Speech recognition with homophone replacer`

			```bash
			`./run-non-streaming-decode-file-sense-voice-with-hr.sh`
			```
Add TTS example for Java API. (#1176) It plays the generated audio as it is still generating. 2024-07-28 12:07:19 +08:00
Add Java API for text-to-speech (#811) 2024-04-26 09:26:39 +08:00			`## Non-Streaming text-to-speech`

			```bash
			`./run-non-streaming-tts-piper-en.sh`
			`./run-non-streaming-tts-coqui-de.sh`
			`./run-non-streaming-tts-vits-zh.sh`
			```
Add Java API for spoken language identification with whisper multilingual models (#817) 2024-04-26 19:05:39 +08:00
Add TTS example for Java API. (#1176) It plays the generated audio as it is still generating. 2024-07-28 12:07:19 +08:00			`## Non-Streaming text-to-speech (Play as it is generating)`

			```bash
			`./run-non-streaming-tts-piper-en-with-callback.sh`
			```

Add Java API for spoken language identification with whisper multilingual models (#817) 2024-04-26 19:05:39 +08:00			`## Spoken language identification`

			```bash
			`./run-spoken-language-identification-whisper.sh`
			```
Add Java and Kotlin API for punctuation models (#818) 2024-04-26 22:06:48 +08:00
Add Java API for audio tagging (#820) 2024-04-28 22:26:04 +08:00			`## Add punctuations to text`
Add Java and Kotlin API for punctuation models (#818) 2024-04-26 22:06:48 +08:00
			`The punctuation model supports both English and Chinese.`

			```bash
			`./run-add-punctuation-zh-en.sh`
			```
Add Java API for audio tagging (#820) 2024-04-28 22:26:04 +08:00
			`## Audio tagging`

			```bash
			`./run-audio-tagging-zipformer-from-file.sh`
			`./run-audio-tagging-ced-from-file.sh`
			```
Add Java API for speaker identification (#822) 2024-04-29 21:23:56 +08:00
			`## Speaker identification`

			```bash
			`./run-speaker-identification.sh`
			```
Add VAD demo for Java API (#928) 2024-05-28 14:59:47 +08:00
Add VAD + microphone example for Java API. (#1045) 2024-06-23 18:34:18 +08:00			`## VAD with a microphone`

			```bash
			`./run-vad-from-mic.sh`
			```

Add Java and Kotlin API for sense voice (#1164) 2024-07-22 14:08:40 +08:00			`## VAD with a microphone + Non-streaming SenseVoice for speech recognition`

			```bash
			`./run-vad-from-mic-non-streaming-sense-voice.sh`
			```

Add VAD + Non-streaming ASR + microphone examples for Java API (#1046) 2024-06-23 19:09:21 +08:00			`## VAD with a microphone + Non-streaming Paraformer for speech recognition`

			```bash
			`./run-vad-from-mic-non-streaming-paraformer.sh`
			```

			`## VAD with a microphone + Non-streaming Whisper tiny.en for speech recognition`

			```bash
			`./run-vad-from-mic-non-streaming-whisper.sh`
			```

Add VAD demo for Java API (#928) 2024-05-28 14:59:47 +08:00			`## VAD (Remove silence)`

			```bash
			`./run-vad-remove-slience.sh`
			```

Add Kotlin and Java API for Dolphin CTC models (#2086) 2025-04-02 21:16:14 +08:00			`## VAD + Non-streaming Dolphin CTC for speech recognition`

			```bash
			`./run-vad-non-streaming-dolphin-ctc.sh`
			```

Add Java and Kotlin API for sense voice (#1164) 2024-07-22 14:08:40 +08:00			`## VAD + Non-streaming SenseVoice for speech recognition`

			```bash
			`./run-vad-non-streaming-sense-voice.sh`
			```

Add VAD demo for Java API (#928) 2024-05-28 14:59:47 +08:00			`## VAD + Non-streaming Paraformer for speech recognition`

			```bash
			`./run-vad-non-streaming-paraformer.sh`
			```
Add KWS examples for Java API (#930) 2024-05-28 15:49:54 +08:00
			`## Keyword spotter`

			```bash
			`./run-kws-from-file.sh`
			```