Commit Graph

196 Commits

Author SHA1 Message Date
Fangjun Kuang
fd9a687ec2 Add Pascal/Go/C#/Dart API for NeMo Canary ASR models (#2367)
Add support for the new NeMo Canary ASR model across multiple language bindings by introducing a Canary model configuration and setter method on the offline recognizer.

- Define Canary model config in Pascal, Go, C#, Dart and update converter functions
- Add SetConfig API for offline recognizer (Pascal, Go, C#, Dart)
- Extend CI/workflows and example scripts to test non-streaming Canary decoding
2025-07-10 14:53:33 +08:00
Askars Salimbajevs
f0960342ad Add LODR support to online and offline recognizers (#2026)
This PR integrates LODR (Level-Ordered Deterministic Rescoring) support from Icefall into both online and offline recognizers, enabling LODR for LM shallow fusion and LM rescore.

- Extended OnlineLMConfig and OfflineLMConfig to include lodr_fst, lodr_scale, and lodr_backoff_id.
- Implemented LodrFst and LodrStateCost classes and wired them into RNN LM scoring in both online and offline code paths.
- Updated Python bindings, CLI entry points, examples, and CI test scripts to accept and exercise the new LODR options.
2025-07-09 16:23:46 +08:00
Fangjun Kuang
df4615ca1d Add C/CXX/JavaScript API for NeMo Canary models (#2357)
This PR introduces support for NeMo Canary models across C, C++, and JavaScript APIs 
by adding new Canary configuration structures, updating bindings, extending examples,
and enhancing CI workflows.

- Add OfflineCanaryModelConfig to all language bindings (C, C++, JS, ETS).
- Implement SetConfig methods and NAPI wrappers for updating recognizer config at runtime.
- Update examples and CI scripts to demonstrate and test NeMo Canary model usage.
2025-07-07 23:38:04 +08:00
Fangjun Kuang
0e738c356c Add C++ runtime and Python API for NeMo Canary models (#2352) 2025-07-07 17:03:49 +08:00
Fangjun Kuang
3bf986d08d Support non-streaming zipformer CTC ASR models (#2340)
This PR adds support for non-streaming Zipformer CTC ASR models across 
multiple language bindings, WebAssembly, examples, and CI workflows.

- Introduces a new OfflineZipformerCtcModelConfig in C/C++, Python, Swift, Java, Kotlin, Go, Dart, Pascal, and C# APIs
- Updates initialization, freeing, and recognition logic to include Zipformer CTC in WASM and Node.js
- Adds example scripts and CI steps for downloading, building, and running Zipformer CTC models

Model doc is available at
https://k2-fsa.github.io/sherpa/onnx/pretrained_models/offline-ctc/icefall/zipformer.html
2025-07-04 15:57:07 +08:00
Fangjun Kuang
bda427f4b2 Add API to get version information (#2309) 2025-06-25 00:22:21 +08:00
Fangjun Kuang
d57e4f84de Add Python API for source separation (#2283) 2025-06-05 20:44:26 +08:00
Fangjun Kuang
2b2788332e Add C++ support for UVR models (#2269) 2025-06-01 17:22:08 +08:00
Fangjun Kuang
99defc5b90 Add nodejs example for parakeet-tdt-0.6b-v2. (#2219) 2025-05-15 11:27:22 +08:00
Fangjun Kuang
85df96d528 Add Dart API for homophone replacer (#2167) 2025-04-30 23:15:28 +08:00
Fangjun Kuang
63d01a9534 Add Swift API for homophone replacer. (#2164) 2025-04-29 18:50:41 +08:00
Fangjun Kuang
51f8824219 Add homonphone replacer example for Python API. (#2161) 2025-04-29 15:59:34 +08:00
Fangjun Kuang
9d25c90a59 Add JavaScript API (node-addon) for homophone replacer (#2158) 2025-04-28 20:52:42 +08:00
Fangjun Kuang
a0aef1f6cd Add JavaScript API (WASM) for homophone replacer (#2157) 2025-04-28 20:47:49 +08:00
Fangjun Kuang
f64c58342b Support replacing homonphonic phrases (#2153) 2025-04-27 15:31:11 +08:00
Fangjun Kuang
be0f382a54 Support Giga AM transducer V2 (#2136) 2025-04-20 10:15:20 +08:00
Fangjun Kuang
07a5701af6 Add Dart API for Dolphin CTC models (#2095) 2025-04-03 15:59:38 +08:00
Fangjun Kuang
903e825eba Add Javascript (node-addon) API for Dolphin CTC models (#2094) 2025-04-03 15:03:33 +08:00
Fangjun Kuang
639ad1744f Add Javascript (WebAssembly) API for Dolphin CTC models (#2093) 2025-04-03 15:02:06 +08:00
Fangjun Kuang
74f402e490 Add Swift API for Dolphin CTC models (#2091) 2025-04-03 00:03:11 +08:00
Fangjun Kuang
2dc0f91904 Add C# API for Dolphin CTC models (#2089) 2025-04-02 23:36:22 +08:00
Fangjun Kuang
0de7e1b9f0 Add C++ and Python API for Dolphin CTC models (#2085) 2025-04-02 19:09:00 +08:00
Fangjun Kuang
0aacf02dd8 Add C++ runtime for vocos (#2014) 2025-03-17 17:05:15 +08:00
Fangjun Kuang
c972554ad1 Add JavaScript API (wasm) for speech enhancement GTCRN models (#2007) 2025-03-15 17:41:23 +08:00
Fangjun Kuang
6a97f8adcf Add JavaScript (node-addon) API for speech enhancement GTCRN models (#1996) 2025-03-12 15:52:01 +08:00
Fangjun Kuang
fd78a482df Add Dart API for speech enhancement GTCRN models (#1993) 2025-03-12 12:39:08 +08:00
Fangjun Kuang
d3e27d5e21 Add C# API for speech enhancement GTCRN models (#1990) 2025-03-11 18:58:17 +08:00
Fangjun Kuang
c12d1d88c0 Add Swift API for speech enhancement GTCRN models (#1989) 2025-03-11 18:03:13 +08:00
Fangjun Kuang
5d2d792b1d Add Python API for speech enhancement GTCRN models (#1978) 2025-03-10 19:02:17 +08:00
Fangjun Kuang
488a6e687c Add C++ runtime for speech enhancement GTCRN models (#1977)
See also https://github.com/Xiaobin-Rong/gtcrn
2025-03-10 18:11:16 +08:00
Fangjun Kuang
1e2328242d Test using sherpa-onnx as a cmake subproject (#1961) 2025-03-06 12:12:56 +08:00
Fangjun Kuang
ed922e61b5 Fix publishing pre-built windows libraries (#1905) 2025-02-21 11:59:27 +08:00
Fangjun Kuang
b5d89d7bcb Add Dart API for FireRedAsr AED Model (#1877) 2025-02-17 15:17:08 +08:00
Fangjun Kuang
b03f6e6e8c Add Swift API for FireRedAsr AED Model (#1876) 2025-02-17 15:16:23 +08:00
Fangjun Kuang
d95d43112d Add C# API for FireRedAsr Model (#1875) 2025-02-17 13:32:41 +08:00
Fangjun Kuang
050df2a357 Add JavaScript API (node-addon) for FireRedAsr (#1873) 2025-02-17 12:24:52 +08:00
Fangjun Kuang
9559a10bd3 Add C++ support for MatchaTTS models not from icefall. (#1834) 2025-02-10 15:38:29 +08:00
Fangjun Kuang
0610679539 Add JavaScript API (WebAssembly) for Kokoro TTS 1.0 (#1809) 2025-02-07 16:46:03 +08:00
Fangjun Kuang
19513af0ae Add JavaScript API (node-addon) for Kokoro TTS 1.0 (#1808) 2025-02-07 16:26:41 +08:00
Fangjun Kuang
35f5ff3166 Add Dart API for Kokoro TTS 1.0 (#1806) 2025-02-07 15:53:13 +08:00
Fangjun Kuang
ae32dfaa56 Add C# API for Kokoro TTS 1.0 (#1805) 2025-02-07 15:39:03 +08:00
Fangjun Kuang
e2e0f25100 Add Swift API for Kokoro TTS 1.0 (#1803) 2025-02-07 15:06:34 +08:00
Fangjun Kuang
c84a833863 Add C++ and Python API for Kokoro 1.0 multilingual TTS model (#1795) 2025-02-06 22:57:13 +08:00
Fangjun Kuang
8b989a851c Fix keyword spotting. (#1689)
Reset the stream right after detecting a keyword
2025-01-20 16:41:10 +08:00
Fangjun Kuang
3a1de0bfc1 Add JavaScript (WebAssembly) API for Kokoro TTS models. (#1726) 2025-01-17 11:17:18 +08:00
Fangjun Kuang
e8d499d218 Add JavaScript API (node-addon) for Kokoro TTS models (#1725) 2025-01-16 18:33:47 +08:00
Fangjun Kuang
4335e2accd Add Dart API for Kokoro TTS models (#1723) 2025-01-16 17:58:19 +08:00
Fangjun Kuang
ad61ad6ff5 Add Swift API for Kokoro TTS models (#1721) 2025-01-16 16:47:37 +08:00
Fangjun Kuang
cc812e6237 Add C# API for Kokoro TTS models (#1720) 2025-01-16 16:30:10 +08:00
Fangjun Kuang
ffc6b480a0 Add C++ and Python API for Kokoro TTS models. (#1715) 2025-01-16 14:24:51 +08:00