Commit Graph

53 Commits

Author SHA1 Message Date
Fangjun Kuang
103e93d9f6 Add Java and Kotlin API for NeMo Canary models (#2359)
Add support for the NeMo Canary model in both Java and Kotlin APIs, wiring it through
JNI and updating examples and CI.

- Introduce OfflineCanaryModelConfig in Kotlin and Java with builder patterns
- Extend OfflineRecognizer to accept and apply the new canary config via setConfig
- Update JNI binding (GetOfflineConfig) and getOfflineModelConfig mapping (type 32), 
   plus examples and CI workflows
2025-07-08 13:45:26 +08:00
Fangjun Kuang
3bf986d08d Support non-streaming zipformer CTC ASR models (#2340)
This PR adds support for non-streaming Zipformer CTC ASR models across 
multiple language bindings, WebAssembly, examples, and CI workflows.

- Introduces a new OfflineZipformerCtcModelConfig in C/C++, Python, Swift, Java, Kotlin, Go, Dart, Pascal, and C# APIs
- Updates initialization, freeing, and recognition logic to include Zipformer CTC in WASM and Node.js
- Adds example scripts and CI steps for downloading, building, and running Zipformer CTC models

Model doc is available at
https://k2-fsa.github.io/sherpa/onnx/pretrained_models/offline-ctc/icefall/zipformer.html
2025-07-04 15:57:07 +08:00
Fangjun Kuang
bda427f4b2 Add API to get version information (#2309) 2025-06-25 00:22:21 +08:00
Fangjun Kuang
e537094b07 Add Kotlin and Java API for homophone replacer (#2166)
* Add Kotlin API for homonphone replacer

* Add Java API for homonphone replacer
2025-04-29 22:55:21 +08:00
Fangjun Kuang
e3280027f9 Support decoding multiple streams in Java API. (#2149) 2025-04-25 11:18:57 +08:00
Fangjun Kuang
eee5575836 Add Kotlin and Java API for Dolphin CTC models (#2086) 2025-04-02 21:16:14 +08:00
Fangjun Kuang
0aacf02dd8 Add C++ runtime for vocos (#2014) 2025-03-17 17:05:15 +08:00
Fangjun Kuang
71824992a7 Add Java API for speech enhancement GTCRN models (#2009) 2025-03-16 15:13:20 +08:00
Fangjun Kuang
f5dfcf8d2f Add Kotlin and Java API for online punctuation models (#1936) 2025-02-27 16:52:36 +08:00
Fangjun Kuang
d148860d2c Add Kotlin and Java API for FireRedAsr AED model (#1870) 2025-02-17 10:50:25 +08:00
Fangjun Kuang
4372a7a7b0 Add Java and Koltin API for Kokoro TTS 1.0 (#1798) 2025-02-07 09:59:27 +08:00
Fangjun Kuang
8b989a851c Fix keyword spotting. (#1689)
Reset the stream right after detecting a keyword
2025-01-20 16:41:10 +08:00
Fangjun Kuang
99cef4198b Add Koltin and Java API for Kokoro TTS models (#1728) 2025-01-17 17:36:13 +08:00
Fangjun Kuang
6f085babcc Add Swift API for MatchaTTS models. (#1684) 2025-01-06 07:23:45 +08:00
Fangjun Kuang
a00d3b4821 Add Java API for Matcha-TTS models. (#1673) 2025-01-02 15:15:30 +08:00
Fangjun Kuang
bd4b223920 Add Kotlin and Java API for Moonshine models (#1474) 2024-10-26 22:30:29 +08:00
Fangjun Kuang
1af8ad89e6 Add Java API example for hotwords. (#1442) 2024-10-18 16:35:31 +08:00
Fangjun Kuang
1851ff6337 Java API for speaker diarization (#1416) 2024-10-11 16:51:40 +08:00
Fangjun Kuang
e7ffcbd677 Add APIs about max speech duration in VAD for various programming languages (#1349) 2024-09-14 12:30:13 +08:00
Fangjun Kuang
5791b695ea Pascal API for streaming ASR (#1246) 2024-08-12 19:55:51 +08:00
Fangjun Kuang
ec98110e11 Add speaker identification and verification exmaple for Dart API (#1194) 2024-07-31 13:53:52 +08:00
Fangjun Kuang
9e02f88dbb Non-streaming WebSocket client for Java. (#1190) 2024-07-30 17:21:33 +08:00
Fangjun Kuang
1f8e575133 Add TTS example for Java API. (#1176)
It plays the generated audio as it is still generating.
2024-07-28 12:07:19 +08:00
Fangjun Kuang
dd300b1de5 Add Java and Kotlin API for sense voice (#1164) 2024-07-22 14:08:40 +08:00
Fangjun Kuang
dd0ff2ca06 Support onnxruntime 1.18.0 (#906) 2024-07-10 17:05:26 +08:00
Fangjun Kuang
c2cc9dec58 Add Flush to VAD so that the last segment can be detected. (#1099) 2024-07-09 16:15:56 +08:00
Fangjun Kuang
7a663afb76 Add streaming ASR example from a microphone for Java API (#1047) 2024-06-23 19:43:53 +08:00
Fangjun Kuang
29abf242c3 Add VAD + Non-streaming ASR + microphone examples for Java API (#1046) 2024-06-23 19:09:21 +08:00
Fangjun Kuang
757a44b116 Add VAD + microphone example for Java API. (#1045) 2024-06-23 18:34:18 +08:00
Fangjun Kuang
6789c909d2 Inverse text normalization API of streaming ASR for various programming languages (#1022) 2024-06-18 13:42:17 +08:00
Fangjun Kuang
6e09933d99 Inverse text normalization API for other programming languages (#1019) 2024-06-17 17:02:39 +08:00
Fangjun Kuang
fd5a0d1e00 Add C++ runtime for Tele-AI/TeleSpeech-ASR (#970) 2024-06-05 00:26:40 +08:00
Fangjun Kuang
0196f1d546 Wrap VAD APIs to C# (#946) 2024-05-30 18:29:38 +08:00
Fangjun Kuang
5860e45b4c Add KWS examples for Java API (#930) 2024-05-28 15:49:54 +08:00
Fangjun Kuang
bcaa6df389 Add VAD demo for Java API (#928) 2024-05-28 14:59:47 +08:00
Fangjun Kuang
cff207623e Add Java API for speaker identification (#822) 2024-04-29 21:23:56 +08:00
Fangjun Kuang
88202f05bb Add Java API for audio tagging (#820) 2024-04-28 22:26:04 +08:00
Fangjun Kuang
5407f880c0 Add Java and Kotlin API for punctuation models (#818) 2024-04-26 22:06:48 +08:00
Fangjun Kuang
db25986240 Add Java API for spoken language identification with whisper multilingual models (#817) 2024-04-26 19:05:39 +08:00
Fangjun Kuang
15772d2150 Add Java API for text-to-speech (#811) 2024-04-26 09:26:39 +08:00
Fangjun Kuang
f7b3735621 Add CTC HLG decoding for JNI (#810) 2024-04-25 17:20:02 +08:00
Fangjun Kuang
83cd533f67 Add Java API for non-streaming ASR (#807) 2024-04-24 21:03:26 +08:00
Fangjun Kuang
c3a2e8a67c Refactor Java API (#806) 2024-04-24 18:41:48 +08:00
Fangjun Kuang
fb04366179 Fix #608 (#610)
Fix java tests.
2024-02-26 13:49:37 +08:00
Wei Kang
d7eab95439 Add java api for hotwords (#319)
* Add java api

* support websocket

* Fix kotlin
2023-09-18 22:44:29 +08:00
Wei Kang
4dfc11066a format to linux file (\r\n -> \n) (#320) 2023-09-18 16:57:12 +08:00
zhaomingwork
3ab135c1eb update Makefile for paraformer java (#277) 2023-08-16 22:11:50 +08:00
zhaomingwork
256a8ecb50 update java for paraformer (#276) 2023-08-16 20:16:51 +08:00
zhaomingwork
5c9aebd583 For java ci actions (#231) 2023-08-05 10:32:48 +08:00
zhaomingwork
5a549615df Java api update for adding modelType in config class (#228) 2023-07-30 17:04:18 +08:00