Commit Graph

22 Commits

Author SHA1 Message Date
Fangjun Kuang
3bf986d08d Support non-streaming zipformer CTC ASR models (#2340)
This PR adds support for non-streaming Zipformer CTC ASR models across 
multiple language bindings, WebAssembly, examples, and CI workflows.

- Introduces a new OfflineZipformerCtcModelConfig in C/C++, Python, Swift, Java, Kotlin, Go, Dart, Pascal, and C# APIs
- Updates initialization, freeing, and recognition logic to include Zipformer CTC in WASM and Node.js
- Adds example scripts and CI steps for downloading, building, and running Zipformer CTC models

Model doc is available at
https://k2-fsa.github.io/sherpa/onnx/pretrained_models/offline-ctc/icefall/zipformer.html
2025-07-04 15:57:07 +08:00
Fangjun Kuang
bda427f4b2 Add API to get version information (#2309) 2025-06-25 00:22:21 +08:00
Fangjun Kuang
2dc0f91904 Add C# API for Dolphin CTC models (#2089) 2025-04-02 23:36:22 +08:00
Fangjun Kuang
d3e27d5e21 Add C# API for speech enhancement GTCRN models (#1990) 2025-03-11 18:58:17 +08:00
Fangjun Kuang
d95d43112d Add C# API for FireRedAsr Model (#1875) 2025-02-17 13:32:41 +08:00
Fangjun Kuang
ae32dfaa56 Add C# API for Kokoro TTS 1.0 (#1805) 2025-02-07 15:39:03 +08:00
Fangjun Kuang
cc812e6237 Add C# API for Kokoro TTS models (#1720) 2025-01-16 16:30:10 +08:00
Fangjun Kuang
3eced3e7ee Add C# and JavaScript (wasm) API for MatchaTTS models (#1682) 2025-01-05 15:08:19 +08:00
Fangjun Kuang
3622104133 Add C# API for Moonshine models. (#1483)
* Also, return timestamps for non-streaming ASR.
2024-10-27 13:14:25 +08:00
Fangjun Kuang
a45e5dba99 C# API for speaker diarization (#1407) 2024-10-10 14:29:05 +08:00
Fangjun Kuang
06fd50f536 Add test about whisper large-v3 for .Net (#1187) 2024-07-29 20:49:38 +08:00
Fangjun Kuang
e472180f2c Add C# API for SenseVoice models (#1151) 2024-07-20 17:09:23 +08:00
Fangjun Kuang
08c758520f Add keyword spotting for C# (#1105) 2024-07-10 21:18:46 +08:00
Fangjun Kuang
6789c909d2 Inverse text normalization API of streaming ASR for various programming languages (#1022) 2024-06-18 13:42:17 +08:00
Fangjun Kuang
6e09933d99 Inverse text normalization API for other programming languages (#1019) 2024-06-17 17:02:39 +08:00
Fangjun Kuang
fd5a0d1e00 Add C++ runtime for Tele-AI/TeleSpeech-ASR (#970) 2024-06-05 00:26:40 +08:00
Fangjun Kuang
0196f1d546 Wrap VAD APIs to C# (#946) 2024-05-30 18:29:38 +08:00
Fangjun Kuang
a99c7cb35c Wrap punctuation APIs to C#. (#945) 2024-05-30 16:19:56 +08:00
Fangjun Kuang
677bc1da3e Add Speaker ID demo for C# (#862) 2024-05-11 13:27:33 +08:00
Fangjun Kuang
612002da57 Fix C# to support Chinese tts models using jieba (#815) 2024-04-26 11:50:07 +08:00
Fangjun Kuang
dbff2eaadb Add C API for streaming HLG decoding (#734) 2024-04-05 10:31:20 +08:00
Fangjun Kuang
305c373107 Add C# API for spoken language identification (#697) 2024-03-25 18:45:09 +08:00