Fangjun Kuang
3bf986d08d
Support non-streaming zipformer CTC ASR models ( #2340 )
...
This PR adds support for non-streaming Zipformer CTC ASR models across
multiple language bindings, WebAssembly, examples, and CI workflows.
- Introduces a new OfflineZipformerCtcModelConfig in C/C++, Python, Swift, Java, Kotlin, Go, Dart, Pascal, and C# APIs
- Updates initialization, freeing, and recognition logic to include Zipformer CTC in WASM and Node.js
- Adds example scripts and CI steps for downloading, building, and running Zipformer CTC models
Model doc is available at
https://k2-fsa.github.io/sherpa/onnx/pretrained_models/offline-ctc/icefall/zipformer.html
2025-07-04 15:57:07 +08:00
Fangjun Kuang
bda427f4b2
Add API to get version information ( #2309 )
2025-06-25 00:22:21 +08:00
Fangjun Kuang
2dc0f91904
Add C# API for Dolphin CTC models ( #2089 )
2025-04-02 23:36:22 +08:00
Fangjun Kuang
d3e27d5e21
Add C# API for speech enhancement GTCRN models ( #1990 )
2025-03-11 18:58:17 +08:00
Fangjun Kuang
d95d43112d
Add C# API for FireRedAsr Model ( #1875 )
2025-02-17 13:32:41 +08:00
Fangjun Kuang
ae32dfaa56
Add C# API for Kokoro TTS 1.0 ( #1805 )
2025-02-07 15:39:03 +08:00
Fangjun Kuang
cc812e6237
Add C# API for Kokoro TTS models ( #1720 )
2025-01-16 16:30:10 +08:00
Fangjun Kuang
3eced3e7ee
Add C# and JavaScript (wasm) API for MatchaTTS models ( #1682 )
2025-01-05 15:08:19 +08:00
Fangjun Kuang
3622104133
Add C# API for Moonshine models. ( #1483 )
...
* Also, return timestamps for non-streaming ASR.
2024-10-27 13:14:25 +08:00
Fangjun Kuang
a45e5dba99
C# API for speaker diarization ( #1407 )
2024-10-10 14:29:05 +08:00
Fangjun Kuang
06fd50f536
Add test about whisper large-v3 for .Net ( #1187 )
2024-07-29 20:49:38 +08:00
Fangjun Kuang
e472180f2c
Add C# API for SenseVoice models ( #1151 )
2024-07-20 17:09:23 +08:00
Fangjun Kuang
08c758520f
Add keyword spotting for C# ( #1105 )
2024-07-10 21:18:46 +08:00
Fangjun Kuang
6789c909d2
Inverse text normalization API of streaming ASR for various programming languages ( #1022 )
2024-06-18 13:42:17 +08:00
Fangjun Kuang
6e09933d99
Inverse text normalization API for other programming languages ( #1019 )
2024-06-17 17:02:39 +08:00
Fangjun Kuang
fd5a0d1e00
Add C++ runtime for Tele-AI/TeleSpeech-ASR ( #970 )
2024-06-05 00:26:40 +08:00
Fangjun Kuang
0196f1d546
Wrap VAD APIs to C# ( #946 )
2024-05-30 18:29:38 +08:00
Fangjun Kuang
a99c7cb35c
Wrap punctuation APIs to C#. ( #945 )
2024-05-30 16:19:56 +08:00
Fangjun Kuang
677bc1da3e
Add Speaker ID demo for C# ( #862 )
2024-05-11 13:27:33 +08:00
Fangjun Kuang
612002da57
Fix C# to support Chinese tts models using jieba ( #815 )
2024-04-26 11:50:07 +08:00
Fangjun Kuang
dbff2eaadb
Add C API for streaming HLG decoding ( #734 )
2024-04-05 10:31:20 +08:00
Fangjun Kuang
305c373107
Add C# API for spoken language identification ( #697 )
2024-03-25 18:45:09 +08:00