Commit Graph

58 Commits

Author SHA1 Message Date
Fangjun Kuang
fd9a687ec2 Add Pascal/Go/C#/Dart API for NeMo Canary ASR models (#2367)
Add support for the new NeMo Canary ASR model across multiple language bindings by introducing a Canary model configuration and setter method on the offline recognizer.

- Define Canary model config in Pascal, Go, C#, Dart and update converter functions
- Add SetConfig API for offline recognizer (Pascal, Go, C#, Dart)
- Extend CI/workflows and example scripts to test non-streaming Canary decoding
2025-07-10 14:53:33 +08:00
Fangjun Kuang
3bf986d08d Support non-streaming zipformer CTC ASR models (#2340)
This PR adds support for non-streaming Zipformer CTC ASR models across 
multiple language bindings, WebAssembly, examples, and CI workflows.

- Introduces a new OfflineZipformerCtcModelConfig in C/C++, Python, Swift, Java, Kotlin, Go, Dart, Pascal, and C# APIs
- Updates initialization, freeing, and recognition logic to include Zipformer CTC in WASM and Node.js
- Adds example scripts and CI steps for downloading, building, and running Zipformer CTC models

Model doc is available at
https://k2-fsa.github.io/sherpa/onnx/pretrained_models/offline-ctc/icefall/zipformer.html
2025-07-04 15:57:07 +08:00
Fangjun Kuang
bda427f4b2 Add API to get version information (#2309) 2025-06-25 00:22:21 +08:00
Fangjun Kuang
2dc0f91904 Add C# API for Dolphin CTC models (#2089) 2025-04-02 23:36:22 +08:00
Fangjun Kuang
0aacf02dd8 Add C++ runtime for vocos (#2014) 2025-03-17 17:05:15 +08:00
Fangjun Kuang
d3e27d5e21 Add C# API for speech enhancement GTCRN models (#1990) 2025-03-11 18:58:17 +08:00
Fangjun Kuang
b5d89d7bcb Add Dart API for FireRedAsr AED Model (#1877) 2025-02-17 15:17:08 +08:00
Fangjun Kuang
d95d43112d Add C# API for FireRedAsr Model (#1875) 2025-02-17 13:32:41 +08:00
Fangjun Kuang
ae32dfaa56 Add C# API for Kokoro TTS 1.0 (#1805) 2025-02-07 15:39:03 +08:00
Fangjun Kuang
8b989a851c Fix keyword spotting. (#1689)
Reset the stream right after detecting a keyword
2025-01-20 16:41:10 +08:00
Fangjun Kuang
cc812e6237 Add C# API for Kokoro TTS models (#1720) 2025-01-16 16:30:10 +08:00
Fangjun Kuang
3eced3e7ee Add C# and JavaScript (wasm) API for MatchaTTS models (#1682) 2025-01-05 15:08:19 +08:00
Michael Lamothe
8a60985363 Upgraded to .NET 8 and made code style a little more internally consistent. (#1680) 2025-01-04 16:39:06 +08:00
Fangjun Kuang
d9c586ccf2 Removed unused TTS example code in .Net examples (#1492) 2024-10-29 14:59:12 +08:00
Fangjun Kuang
3622104133 Add C# API for Moonshine models. (#1483)
* Also, return timestamps for non-streaming ASR.
2024-10-27 13:14:25 +08:00
Fangjun Kuang
a45e5dba99 C# API for speaker diarization (#1407) 2024-10-10 14:29:05 +08:00
Michael Twohey
b409b0a958 Fixed the C api calls and created the TTS project file (#1324)
Co-authored-by: Michael Twohey <mtwohey@americanambulance.com>
2024-09-07 23:25:02 +08:00
Fangjun Kuang
06fd50f536 Add test about whisper large-v3 for .Net (#1187) 2024-07-29 20:49:38 +08:00
Fangjun Kuang
e472180f2c Add C# API for SenseVoice models (#1151) 2024-07-20 17:09:23 +08:00
Fangjun Kuang
54e6e962bf Add microphone example for .Net keyword spotting (#1120) 2024-07-13 14:56:39 +08:00
Fangjun Kuang
08c758520f Add keyword spotting for C# (#1105) 2024-07-10 21:18:46 +08:00
Fangjun Kuang
dd0ff2ca06 Support onnxruntime 1.18.0 (#906) 2024-07-10 17:05:26 +08:00
Fangjun Kuang
c2cc9dec58 Add Flush to VAD so that the last segment can be detected. (#1099) 2024-07-09 16:15:56 +08:00
Fangjun Kuang
8c4f576f1b Support .Net framework 2.0 (#1062) 2024-06-28 11:27:19 +08:00
Fangjun Kuang
5cce159cf3 Fix passing C# string to C++ (#1055) 2024-06-25 10:52:59 +08:00
Fangjun Kuang
e7a45108ac Remove unused files from .Net examples (#1051) 2024-06-24 10:25:14 +08:00
东风破
00de2bd00b Refactor .Net example project (#1049)
Co-authored-by: 东风破 <birdfishs@163.com>
2024-06-24 10:10:13 +08:00
Fangjun Kuang
9dd0e03568 Enable to stop TTS generation (#1041) 2024-06-22 18:18:36 +08:00
Fangjun Kuang
36336b31f4 Build Android APK for Thai (#1036) 2024-06-20 18:05:57 +08:00
Fangjun Kuang
6789c909d2 Inverse text normalization API of streaming ASR for various programming languages (#1022) 2024-06-18 13:42:17 +08:00
Fangjun Kuang
6e09933d99 Inverse text normalization API for other programming languages (#1019) 2024-06-17 17:02:39 +08:00
Fangjun Kuang
fd5a0d1e00 Add C++ runtime for Tele-AI/TeleSpeech-ASR (#970) 2024-06-05 00:26:40 +08:00
Fangjun Kuang
0196f1d546 Wrap VAD APIs to C# (#946) 2024-05-30 18:29:38 +08:00
Fangjun Kuang
a99c7cb35c Wrap punctuation APIs to C#. (#945) 2024-05-30 16:19:56 +08:00
Fangjun Kuang
b445956675 Fix CI tests. (#898) 2024-05-21 20:37:29 +08:00
Fangjun Kuang
677bc1da3e Add Speaker ID demo for C# (#862) 2024-05-11 13:27:33 +08:00
Fangjun Kuang
612002da57 Fix C# to support Chinese tts models using jieba (#815) 2024-04-26 11:50:07 +08:00
Fangjun Kuang
f7b3735621 Add CTC HLG decoding for JNI (#810) 2024-04-25 17:20:02 +08:00
Fangjun Kuang
a5f8fbc83f Support heteronyms in Chinese TTS (#738) 2024-04-08 11:01:30 +08:00
Fangjun Kuang
dbff2eaadb Add C API for streaming HLG decoding (#734) 2024-04-05 10:31:20 +08:00
Fangjun Kuang
305c373107 Add C# API for spoken language identification (#697) 2024-03-25 18:45:09 +08:00
Fangjun Kuang
ab7cff2513 Add C API for spoken language identification. (#695) 2024-03-25 15:16:47 +08:00
Fangjun Kuang
1952772654 Add timestamps and tokens for .Net's online models. (#690) 2024-03-23 18:51:56 +08:00
Fangjun Kuang
2fc1201924 Add hotwords support to .Net (#689) 2024-03-22 21:40:42 +08:00
Fangjun Kuang
24f437a6f1 Refactor github actions tests (#688) 2024-03-22 21:22:42 +08:00
Fangjun Kuang
acf0975153 Support whisper language/task in various language bindings. (#679) 2024-03-20 16:43:35 +08:00
Fangjun Kuang
6571fc9552 Add tts play example for .Net. (#676)
It plays the generated audio via a speaker as it is generating.
2024-03-19 17:33:15 +08:00
Fangjun Kuang
be84932f86 Use curl to replace wget for Windows. (#558)
wget is not available on Windows in GitHub actions
2024-01-29 10:46:34 +08:00
Fangjun Kuang
fa2af5dc69 Add TTS demo for C# API (#557) 2024-01-28 23:29:39 +08:00
Fangjun Kuang
e475e750ac Support streaming zipformer CTC (#496)
* Support streaming zipformer CTC

* test online zipformer2 CTC

* Update doc of sherpa-onnx.cc

* Add Python APIs for streaming zipformer2 ctc

* Add Python API examples for streaming zipformer2 ctc

* Swift API for streaming zipformer2 CTC

* NodeJS API for streaming zipformer2 CTC

* Kotlin API for streaming zipformer2 CTC

* Golang API for streaming zipformer2 CTC

* C# API for streaming zipformer2 CTC

* Release v1.9.6
2023-12-22 13:46:33 +08:00