Fangjun Kuang
fd9a687ec2
Add Pascal/Go/C#/Dart API for NeMo Canary ASR models ( #2367 )
...
Add support for the new NeMo Canary ASR model across multiple language bindings by introducing a Canary model configuration and setter method on the offline recognizer.
- Define Canary model config in Pascal, Go, C#, Dart and update converter functions
- Add SetConfig API for offline recognizer (Pascal, Go, C#, Dart)
- Extend CI/workflows and example scripts to test non-streaming Canary decoding
2025-07-10 14:53:33 +08:00
Fangjun Kuang
3bf986d08d
Support non-streaming zipformer CTC ASR models ( #2340 )
...
This PR adds support for non-streaming Zipformer CTC ASR models across
multiple language bindings, WebAssembly, examples, and CI workflows.
- Introduces a new OfflineZipformerCtcModelConfig in C/C++, Python, Swift, Java, Kotlin, Go, Dart, Pascal, and C# APIs
- Updates initialization, freeing, and recognition logic to include Zipformer CTC in WASM and Node.js
- Adds example scripts and CI steps for downloading, building, and running Zipformer CTC models
Model doc is available at
https://k2-fsa.github.io/sherpa/onnx/pretrained_models/offline-ctc/icefall/zipformer.html
2025-07-04 15:57:07 +08:00
Fangjun Kuang
bda427f4b2
Add API to get version information ( #2309 )
2025-06-25 00:22:21 +08:00
Fangjun Kuang
2dc0f91904
Add C# API for Dolphin CTC models ( #2089 )
2025-04-02 23:36:22 +08:00
Fangjun Kuang
0aacf02dd8
Add C++ runtime for vocos ( #2014 )
2025-03-17 17:05:15 +08:00
Fangjun Kuang
d3e27d5e21
Add C# API for speech enhancement GTCRN models ( #1990 )
2025-03-11 18:58:17 +08:00
Fangjun Kuang
b5d89d7bcb
Add Dart API for FireRedAsr AED Model ( #1877 )
2025-02-17 15:17:08 +08:00
Fangjun Kuang
d95d43112d
Add C# API for FireRedAsr Model ( #1875 )
2025-02-17 13:32:41 +08:00
Fangjun Kuang
ae32dfaa56
Add C# API for Kokoro TTS 1.0 ( #1805 )
2025-02-07 15:39:03 +08:00
Fangjun Kuang
8b989a851c
Fix keyword spotting. ( #1689 )
...
Reset the stream right after detecting a keyword
2025-01-20 16:41:10 +08:00
Fangjun Kuang
cc812e6237
Add C# API for Kokoro TTS models ( #1720 )
2025-01-16 16:30:10 +08:00
Fangjun Kuang
3eced3e7ee
Add C# and JavaScript (wasm) API for MatchaTTS models ( #1682 )
2025-01-05 15:08:19 +08:00
Michael Lamothe
8a60985363
Upgraded to .NET 8 and made code style a little more internally consistent. ( #1680 )
2025-01-04 16:39:06 +08:00
Fangjun Kuang
d9c586ccf2
Removed unused TTS example code in .Net examples ( #1492 )
2024-10-29 14:59:12 +08:00
Fangjun Kuang
3622104133
Add C# API for Moonshine models. ( #1483 )
...
* Also, return timestamps for non-streaming ASR.
2024-10-27 13:14:25 +08:00
Fangjun Kuang
a45e5dba99
C# API for speaker diarization ( #1407 )
2024-10-10 14:29:05 +08:00
Michael Twohey
b409b0a958
Fixed the C api calls and created the TTS project file ( #1324 )
...
Co-authored-by: Michael Twohey <mtwohey@americanambulance.com >
2024-09-07 23:25:02 +08:00
Fangjun Kuang
06fd50f536
Add test about whisper large-v3 for .Net ( #1187 )
2024-07-29 20:49:38 +08:00
Fangjun Kuang
e472180f2c
Add C# API for SenseVoice models ( #1151 )
2024-07-20 17:09:23 +08:00
Fangjun Kuang
54e6e962bf
Add microphone example for .Net keyword spotting ( #1120 )
2024-07-13 14:56:39 +08:00
Fangjun Kuang
08c758520f
Add keyword spotting for C# ( #1105 )
2024-07-10 21:18:46 +08:00
Fangjun Kuang
dd0ff2ca06
Support onnxruntime 1.18.0 ( #906 )
2024-07-10 17:05:26 +08:00
Fangjun Kuang
c2cc9dec58
Add Flush to VAD so that the last segment can be detected. ( #1099 )
2024-07-09 16:15:56 +08:00
Fangjun Kuang
8c4f576f1b
Support .Net framework 2.0 ( #1062 )
2024-06-28 11:27:19 +08:00
Fangjun Kuang
5cce159cf3
Fix passing C# string to C++ ( #1055 )
2024-06-25 10:52:59 +08:00
Fangjun Kuang
e7a45108ac
Remove unused files from .Net examples ( #1051 )
2024-06-24 10:25:14 +08:00
东风破
00de2bd00b
Refactor .Net example project ( #1049 )
...
Co-authored-by: 东风破 <birdfishs@163.com >
2024-06-24 10:10:13 +08:00
Fangjun Kuang
9dd0e03568
Enable to stop TTS generation ( #1041 )
2024-06-22 18:18:36 +08:00
Fangjun Kuang
36336b31f4
Build Android APK for Thai ( #1036 )
2024-06-20 18:05:57 +08:00
Fangjun Kuang
6789c909d2
Inverse text normalization API of streaming ASR for various programming languages ( #1022 )
2024-06-18 13:42:17 +08:00
Fangjun Kuang
6e09933d99
Inverse text normalization API for other programming languages ( #1019 )
2024-06-17 17:02:39 +08:00
Fangjun Kuang
fd5a0d1e00
Add C++ runtime for Tele-AI/TeleSpeech-ASR ( #970 )
2024-06-05 00:26:40 +08:00
Fangjun Kuang
0196f1d546
Wrap VAD APIs to C# ( #946 )
2024-05-30 18:29:38 +08:00
Fangjun Kuang
a99c7cb35c
Wrap punctuation APIs to C#. ( #945 )
2024-05-30 16:19:56 +08:00
Fangjun Kuang
b445956675
Fix CI tests. ( #898 )
2024-05-21 20:37:29 +08:00
Fangjun Kuang
677bc1da3e
Add Speaker ID demo for C# ( #862 )
2024-05-11 13:27:33 +08:00
Fangjun Kuang
612002da57
Fix C# to support Chinese tts models using jieba ( #815 )
2024-04-26 11:50:07 +08:00
Fangjun Kuang
f7b3735621
Add CTC HLG decoding for JNI ( #810 )
2024-04-25 17:20:02 +08:00
Fangjun Kuang
a5f8fbc83f
Support heteronyms in Chinese TTS ( #738 )
2024-04-08 11:01:30 +08:00
Fangjun Kuang
dbff2eaadb
Add C API for streaming HLG decoding ( #734 )
2024-04-05 10:31:20 +08:00
Fangjun Kuang
305c373107
Add C# API for spoken language identification ( #697 )
2024-03-25 18:45:09 +08:00
Fangjun Kuang
ab7cff2513
Add C API for spoken language identification. ( #695 )
2024-03-25 15:16:47 +08:00
Fangjun Kuang
1952772654
Add timestamps and tokens for .Net's online models. ( #690 )
2024-03-23 18:51:56 +08:00
Fangjun Kuang
2fc1201924
Add hotwords support to .Net ( #689 )
2024-03-22 21:40:42 +08:00
Fangjun Kuang
24f437a6f1
Refactor github actions tests ( #688 )
2024-03-22 21:22:42 +08:00
Fangjun Kuang
acf0975153
Support whisper language/task in various language bindings. ( #679 )
2024-03-20 16:43:35 +08:00
Fangjun Kuang
6571fc9552
Add tts play example for .Net. ( #676 )
...
It plays the generated audio via a speaker as it is generating.
2024-03-19 17:33:15 +08:00
Fangjun Kuang
be84932f86
Use curl to replace wget for Windows. ( #558 )
...
wget is not available on Windows in GitHub actions
2024-01-29 10:46:34 +08:00
Fangjun Kuang
fa2af5dc69
Add TTS demo for C# API ( #557 )
2024-01-28 23:29:39 +08:00
Fangjun Kuang
e475e750ac
Support streaming zipformer CTC ( #496 )
...
* Support streaming zipformer CTC
* test online zipformer2 CTC
* Update doc of sherpa-onnx.cc
* Add Python APIs for streaming zipformer2 ctc
* Add Python API examples for streaming zipformer2 ctc
* Swift API for streaming zipformer2 CTC
* NodeJS API for streaming zipformer2 CTC
* Kotlin API for streaming zipformer2 CTC
* Golang API for streaming zipformer2 CTC
* C# API for streaming zipformer2 CTC
* Release v1.9.6
2023-12-22 13:46:33 +08:00