Commit Graph

33 Commits

Author SHA1 Message Date
Fangjun Kuang
103e93d9f6 Add Java and Kotlin API for NeMo Canary models (#2359)
Add support for the NeMo Canary model in both Java and Kotlin APIs, wiring it through
JNI and updating examples and CI.

- Introduce OfflineCanaryModelConfig in Kotlin and Java with builder patterns
- Extend OfflineRecognizer to accept and apply the new canary config via setConfig
- Update JNI binding (GetOfflineConfig) and getOfflineModelConfig mapping (type 32), 
   plus examples and CI workflows
2025-07-08 13:45:26 +08:00
Fangjun Kuang
3bf986d08d Support non-streaming zipformer CTC ASR models (#2340)
This PR adds support for non-streaming Zipformer CTC ASR models across 
multiple language bindings, WebAssembly, examples, and CI workflows.

- Introduces a new OfflineZipformerCtcModelConfig in C/C++, Python, Swift, Java, Kotlin, Go, Dart, Pascal, and C# APIs
- Updates initialization, freeing, and recognition logic to include Zipformer CTC in WASM and Node.js
- Adds example scripts and CI steps for downloading, building, and running Zipformer CTC models

Model doc is available at
https://k2-fsa.github.io/sherpa/onnx/pretrained_models/offline-ctc/icefall/zipformer.html
2025-07-04 15:57:07 +08:00
Fangjun Kuang
bda427f4b2 Add API to get version information (#2309) 2025-06-25 00:22:21 +08:00
Fangjun Kuang
e537094b07 Add Kotlin and Java API for homophone replacer (#2166)
* Add Kotlin API for homonphone replacer

* Add Java API for homonphone replacer
2025-04-29 22:55:21 +08:00
Fangjun Kuang
e3280027f9 Support decoding multiple streams in Java API. (#2149) 2025-04-25 11:18:57 +08:00
Fangjun Kuang
eee5575836 Add Kotlin and Java API for Dolphin CTC models (#2086) 2025-04-02 21:16:14 +08:00
Fangjun Kuang
0aacf02dd8 Add C++ runtime for vocos (#2014) 2025-03-17 17:05:15 +08:00
Fangjun Kuang
71824992a7 Add Java API for speech enhancement GTCRN models (#2009) 2025-03-16 15:13:20 +08:00
Fangjun Kuang
f5dfcf8d2f Add Kotlin and Java API for online punctuation models (#1936) 2025-02-27 16:52:36 +08:00
Fangjun Kuang
d148860d2c Add Kotlin and Java API for FireRedAsr AED model (#1870) 2025-02-17 10:50:25 +08:00
Fangjun Kuang
4372a7a7b0 Add Java and Koltin API for Kokoro TTS 1.0 (#1798) 2025-02-07 09:59:27 +08:00
Fangjun Kuang
99cef4198b Add Koltin and Java API for Kokoro TTS models (#1728) 2025-01-17 17:36:13 +08:00
Fangjun Kuang
930986b06c Release v1.10.38 (#1688) 2025-01-06 11:31:18 +08:00
Fangjun Kuang
a00d3b4821 Add Java API for Matcha-TTS models. (#1673) 2025-01-02 15:15:30 +08:00
Fangjun Kuang
bd4b223920 Add Kotlin and Java API for Moonshine models (#1474) 2024-10-26 22:30:29 +08:00
Fangjun Kuang
1851ff6337 Java API for speaker diarization (#1416) 2024-10-11 16:51:40 +08:00
Fangjun Kuang
dd300b1de5 Add Java and Kotlin API for sense voice (#1164) 2024-07-22 14:08:40 +08:00
Fangjun Kuang
6789c909d2 Inverse text normalization API of streaming ASR for various programming languages (#1022) 2024-06-18 13:42:17 +08:00
Fangjun Kuang
6e09933d99 Inverse text normalization API for other programming languages (#1019) 2024-06-17 17:02:39 +08:00
Fangjun Kuang
5860e45b4c Add KWS examples for Java API (#930) 2024-05-28 15:49:54 +08:00
Fangjun Kuang
bcaa6df389 Add VAD demo for Java API (#928) 2024-05-28 14:59:47 +08:00
Fangjun Kuang
cff207623e Add Java API for speaker identification (#822) 2024-04-29 21:23:56 +08:00
Fangjun Kuang
88202f05bb Add Java API for audio tagging (#820) 2024-04-28 22:26:04 +08:00
Fangjun Kuang
5407f880c0 Add Java and Kotlin API for punctuation models (#818) 2024-04-26 22:06:48 +08:00
Fangjun Kuang
db25986240 Add Java API for spoken language identification with whisper multilingual models (#817) 2024-04-26 19:05:39 +08:00
Fangjun Kuang
15772d2150 Add Java API for text-to-speech (#811) 2024-04-26 09:26:39 +08:00
Fangjun Kuang
f7b3735621 Add CTC HLG decoding for JNI (#810) 2024-04-25 17:20:02 +08:00
Fangjun Kuang
83cd533f67 Add Java API for non-streaming ASR (#807) 2024-04-24 21:03:26 +08:00
Fangjun Kuang
c3a2e8a67c Refactor Java API (#806) 2024-04-24 18:41:48 +08:00
Fangjun Kuang
2a932aca86 Add ccache to CI (#375) 2023-10-20 11:17:37 +08:00
Fangjun Kuang
eead16e27f Fix CI for pip install (#371) 2023-10-19 10:43:14 +08:00
Fangjun Kuang
45b9d4ab37 Support whisper models (#238) 2023-08-07 12:34:18 +08:00
zhaomingwork
5c9aebd583 For java ci actions (#231) 2023-08-05 10:32:48 +08:00