Fangjun Kuang
3bf986d08d
Support non-streaming zipformer CTC ASR models ( #2340 )
...
This PR adds support for non-streaming Zipformer CTC ASR models across
multiple language bindings, WebAssembly, examples, and CI workflows.
- Introduces a new OfflineZipformerCtcModelConfig in C/C++, Python, Swift, Java, Kotlin, Go, Dart, Pascal, and C# APIs
- Updates initialization, freeing, and recognition logic to include Zipformer CTC in WASM and Node.js
- Adds example scripts and CI steps for downloading, building, and running Zipformer CTC models
Model doc is available at
https://k2-fsa.github.io/sherpa/onnx/pretrained_models/offline-ctc/icefall/zipformer.html
2025-07-04 15:57:07 +08:00
Fangjun Kuang
bda427f4b2
Add API to get version information ( #2309 )
2025-06-25 00:22:21 +08:00
Fangjun Kuang
d57e4f84de
Add Python API for source separation ( #2283 )
2025-06-05 20:44:26 +08:00
Fangjun Kuang
2b2788332e
Add C++ support for UVR models ( #2269 )
2025-06-01 17:22:08 +08:00
Fangjun Kuang
99defc5b90
Add nodejs example for parakeet-tdt-0.6b-v2. ( #2219 )
2025-05-15 11:27:22 +08:00
Fangjun Kuang
85df96d528
Add Dart API for homophone replacer ( #2167 )
2025-04-30 23:15:28 +08:00
Fangjun Kuang
63d01a9534
Add Swift API for homophone replacer. ( #2164 )
2025-04-29 18:50:41 +08:00
Fangjun Kuang
51f8824219
Add homonphone replacer example for Python API. ( #2161 )
2025-04-29 15:59:34 +08:00
Fangjun Kuang
9d25c90a59
Add JavaScript API (node-addon) for homophone replacer ( #2158 )
2025-04-28 20:52:42 +08:00
Fangjun Kuang
a0aef1f6cd
Add JavaScript API (WASM) for homophone replacer ( #2157 )
2025-04-28 20:47:49 +08:00
Fangjun Kuang
f64c58342b
Support replacing homonphonic phrases ( #2153 )
2025-04-27 15:31:11 +08:00
Fangjun Kuang
be0f382a54
Support Giga AM transducer V2 ( #2136 )
2025-04-20 10:15:20 +08:00
Fangjun Kuang
07a5701af6
Add Dart API for Dolphin CTC models ( #2095 )
2025-04-03 15:59:38 +08:00
Fangjun Kuang
903e825eba
Add Javascript (node-addon) API for Dolphin CTC models ( #2094 )
2025-04-03 15:03:33 +08:00
Fangjun Kuang
639ad1744f
Add Javascript (WebAssembly) API for Dolphin CTC models ( #2093 )
2025-04-03 15:02:06 +08:00
Fangjun Kuang
74f402e490
Add Swift API for Dolphin CTC models ( #2091 )
2025-04-03 00:03:11 +08:00
Fangjun Kuang
2dc0f91904
Add C# API for Dolphin CTC models ( #2089 )
2025-04-02 23:36:22 +08:00
Fangjun Kuang
0de7e1b9f0
Add C++ and Python API for Dolphin CTC models ( #2085 )
2025-04-02 19:09:00 +08:00
Fangjun Kuang
0aacf02dd8
Add C++ runtime for vocos ( #2014 )
2025-03-17 17:05:15 +08:00
Fangjun Kuang
c972554ad1
Add JavaScript API (wasm) for speech enhancement GTCRN models ( #2007 )
2025-03-15 17:41:23 +08:00
Fangjun Kuang
6a97f8adcf
Add JavaScript (node-addon) API for speech enhancement GTCRN models ( #1996 )
2025-03-12 15:52:01 +08:00
Fangjun Kuang
fd78a482df
Add Dart API for speech enhancement GTCRN models ( #1993 )
2025-03-12 12:39:08 +08:00
Fangjun Kuang
d3e27d5e21
Add C# API for speech enhancement GTCRN models ( #1990 )
2025-03-11 18:58:17 +08:00
Fangjun Kuang
c12d1d88c0
Add Swift API for speech enhancement GTCRN models ( #1989 )
2025-03-11 18:03:13 +08:00
Fangjun Kuang
5d2d792b1d
Add Python API for speech enhancement GTCRN models ( #1978 )
2025-03-10 19:02:17 +08:00
Fangjun Kuang
488a6e687c
Add C++ runtime for speech enhancement GTCRN models ( #1977 )
...
See also https://github.com/Xiaobin-Rong/gtcrn
2025-03-10 18:11:16 +08:00
Fangjun Kuang
1e2328242d
Test using sherpa-onnx as a cmake subproject ( #1961 )
2025-03-06 12:12:56 +08:00
Fangjun Kuang
ed922e61b5
Fix publishing pre-built windows libraries ( #1905 )
2025-02-21 11:59:27 +08:00
Fangjun Kuang
b5d89d7bcb
Add Dart API for FireRedAsr AED Model ( #1877 )
2025-02-17 15:17:08 +08:00
Fangjun Kuang
b03f6e6e8c
Add Swift API for FireRedAsr AED Model ( #1876 )
2025-02-17 15:16:23 +08:00
Fangjun Kuang
d95d43112d
Add C# API for FireRedAsr Model ( #1875 )
2025-02-17 13:32:41 +08:00
Fangjun Kuang
050df2a357
Add JavaScript API (node-addon) for FireRedAsr ( #1873 )
2025-02-17 12:24:52 +08:00
Fangjun Kuang
9559a10bd3
Add C++ support for MatchaTTS models not from icefall. ( #1834 )
2025-02-10 15:38:29 +08:00
Fangjun Kuang
0610679539
Add JavaScript API (WebAssembly) for Kokoro TTS 1.0 ( #1809 )
2025-02-07 16:46:03 +08:00
Fangjun Kuang
19513af0ae
Add JavaScript API (node-addon) for Kokoro TTS 1.0 ( #1808 )
2025-02-07 16:26:41 +08:00
Fangjun Kuang
35f5ff3166
Add Dart API for Kokoro TTS 1.0 ( #1806 )
2025-02-07 15:53:13 +08:00
Fangjun Kuang
ae32dfaa56
Add C# API for Kokoro TTS 1.0 ( #1805 )
2025-02-07 15:39:03 +08:00
Fangjun Kuang
e2e0f25100
Add Swift API for Kokoro TTS 1.0 ( #1803 )
2025-02-07 15:06:34 +08:00
Fangjun Kuang
c84a833863
Add C++ and Python API for Kokoro 1.0 multilingual TTS model ( #1795 )
2025-02-06 22:57:13 +08:00
Fangjun Kuang
8b989a851c
Fix keyword spotting. ( #1689 )
...
Reset the stream right after detecting a keyword
2025-01-20 16:41:10 +08:00
Fangjun Kuang
3a1de0bfc1
Add JavaScript (WebAssembly) API for Kokoro TTS models. ( #1726 )
2025-01-17 11:17:18 +08:00
Fangjun Kuang
e8d499d218
Add JavaScript API (node-addon) for Kokoro TTS models ( #1725 )
2025-01-16 18:33:47 +08:00
Fangjun Kuang
4335e2accd
Add Dart API for Kokoro TTS models ( #1723 )
2025-01-16 17:58:19 +08:00
Fangjun Kuang
ad61ad6ff5
Add Swift API for Kokoro TTS models ( #1721 )
2025-01-16 16:47:37 +08:00
Fangjun Kuang
cc812e6237
Add C# API for Kokoro TTS models ( #1720 )
2025-01-16 16:30:10 +08:00
Fangjun Kuang
ffc6b480a0
Add C++ and Python API for Kokoro TTS models. ( #1715 )
2025-01-16 14:24:51 +08:00
Fangjun Kuang
0cb2db385a
Add README for android libs. ( #1693 )
2025-01-07 11:04:10 +08:00
Fangjun Kuang
d7c95d33a3
Add Dart API for MatchaTTS models ( #1687 )
2025-01-06 11:03:31 +08:00
Fangjun Kuang
6f085babcc
Add Swift API for MatchaTTS models. ( #1684 )
2025-01-06 07:23:45 +08:00
Fangjun Kuang
3eced3e7ee
Add C# and JavaScript (wasm) API for MatchaTTS models ( #1682 )
2025-01-05 15:08:19 +08:00