Commit Graph

21 Commits

Author SHA1 Message Date
Fangjun Kuang
3bf986d08d Support non-streaming zipformer CTC ASR models (#2340)
This PR adds support for non-streaming Zipformer CTC ASR models across 
multiple language bindings, WebAssembly, examples, and CI workflows.

- Introduces a new OfflineZipformerCtcModelConfig in C/C++, Python, Swift, Java, Kotlin, Go, Dart, Pascal, and C# APIs
- Updates initialization, freeing, and recognition logic to include Zipformer CTC in WASM and Node.js
- Adds example scripts and CI steps for downloading, building, and running Zipformer CTC models

Model doc is available at
https://k2-fsa.github.io/sherpa/onnx/pretrained_models/offline-ctc/icefall/zipformer.html
2025-07-04 15:57:07 +08:00
Fangjun Kuang
bda427f4b2 Add API to get version information (#2309) 2025-06-25 00:22:21 +08:00
Fangjun Kuang
6982b86c66 Support extra languages in multi-lang kokoro tts (#2303) 2025-06-20 11:22:52 +08:00
Fangjun Kuang
028b8f2718 Add C++ example for streaming ASR with SenseVoice. (#2199) 2025-05-11 00:23:32 +08:00
Fangjun Kuang
e51c37eb2f Add C and CXX API for homophone replacer (#2156) 2025-04-27 22:09:13 +08:00
Fangjun Kuang
da4aad1189 Add C and CXX API for Dolphin CTC models (#2088) 2025-04-02 21:54:20 +08:00
Fangjun Kuang
0703bc1b86 Add CXX API for VAD (#2077) 2025-04-01 14:51:43 +08:00
Fangjun Kuang
802119db17 Add CXX API for speech enhancement GTCRN models (#1986) 2025-03-11 17:07:52 +08:00
Fangjun Kuang
1d49dd2fb0 Add CXX API for FireRedAsr (#1872) 2025-02-17 11:46:13 +08:00
Kell
2ac41d3d85 OfflineRecognizer supports create stream with hotwords (#1833)
Co-authored-by: Wangkai <kell.wang@huawei.com>
2025-02-10 16:26:56 +08:00
Fangjun Kuang
69f489f0cd Support scaling the duration of a pause in TTS. (#1820) 2025-02-08 12:47:26 +08:00
Fangjun Kuang
d815204774 Add CXX API for Kokoro TTS 1.0 (#1802) 2025-02-07 14:51:49 +08:00
Fangjun Kuang
8b989a851c Fix keyword spotting. (#1689)
Reset the stream right after detecting a keyword
2025-01-20 16:41:10 +08:00
Fangjun Kuang
af671e2b63 Add C API for Kokoro TTS models (#1717) 2025-01-16 15:07:26 +08:00
Fangjun Kuang
648903834b Add CXX API for MatchaTTS models (#1676) 2025-01-03 14:16:36 +08:00
Fangjun Kuang
298b6b6fda Add non-streaming ASR support for HarmonyOS. (#1564) 2024-11-26 16:38:35 +08:00
Fangjun Kuang
356da3b54c Publish pre-built macos xcframework (#1490) 2024-10-29 12:26:26 +08:00
Fangjun Kuang
2ca2985d04 Add C and C++ API for Moonshine models (#1476) 2024-10-26 23:24:46 +08:00
Fangjun Kuang
b3e05f6dc4 Fix style issues (#1458) 2024-10-24 11:15:08 +08:00
Fangjun Kuang
ceb69ebd94 Add C++ API for non-streaming ASR (#1456) 2024-10-23 16:40:12 +08:00
Fangjun Kuang
effd5ef2be Add C++ API for streaming ASR. (#1455)
It is a wrapper around the C API.
2024-10-23 12:07:43 +08:00