Fangjun Kuang
3bf986d08d
Support non-streaming zipformer CTC ASR models ( #2340 )
...
This PR adds support for non-streaming Zipformer CTC ASR models across
multiple language bindings, WebAssembly, examples, and CI workflows.
- Introduces a new OfflineZipformerCtcModelConfig in C/C++, Python, Swift, Java, Kotlin, Go, Dart, Pascal, and C# APIs
- Updates initialization, freeing, and recognition logic to include Zipformer CTC in WASM and Node.js
- Adds example scripts and CI steps for downloading, building, and running Zipformer CTC models
Model doc is available at
https://k2-fsa.github.io/sherpa/onnx/pretrained_models/offline-ctc/icefall/zipformer.html
2025-07-04 15:57:07 +08:00
Fangjun Kuang
bda427f4b2
Add API to get version information ( #2309 )
2025-06-25 00:22:21 +08:00
Fangjun Kuang
6982b86c66
Support extra languages in multi-lang kokoro tts ( #2303 )
2025-06-20 11:22:52 +08:00
Fangjun Kuang
e51c37eb2f
Add C and CXX API for homophone replacer ( #2156 )
2025-04-27 22:09:13 +08:00
Fangjun Kuang
da4aad1189
Add C and CXX API for Dolphin CTC models ( #2088 )
2025-04-02 21:54:20 +08:00
Fangjun Kuang
0703bc1b86
Add CXX API for VAD ( #2077 )
2025-04-01 14:51:43 +08:00
Fangjun Kuang
6a97f8adcf
Add JavaScript (node-addon) API for speech enhancement GTCRN models ( #1996 )
2025-03-12 15:52:01 +08:00
Fangjun Kuang
c5dbf1177c
Add C API for speech enhancement GTCRN models ( #1984 )
2025-03-11 15:50:04 +08:00
cjsdurj
b87fce9a7f
c-api add wave write to buffer. ( #1962 )
...
Co-authored-by: jian.chen03 <jian.chen03@transwarp.io >
2025-03-10 17:21:23 +08:00
Fangjun Kuang
193d31333c
Add C API for FireRedAsr AED model. ( #1871 )
2025-02-17 11:22:17 +08:00
Kell
2ac41d3d85
OfflineRecognizer supports create stream with hotwords ( #1833 )
...
Co-authored-by: Wangkai <kell.wang@huawei.com >
2025-02-10 16:26:56 +08:00
Fangjun Kuang
69f489f0cd
Support scaling the duration of a pause in TTS. ( #1820 )
2025-02-08 12:47:26 +08:00
Fangjun Kuang
7330f7519a
Add C API for Kokoro TTS 1.0 ( #1801 )
2025-02-07 14:30:40 +08:00
Fangjun Kuang
f178e96bf0
Add keyword spotter C API for HarmonyOS ( #1769 )
2025-01-26 14:12:30 +08:00
Fangjun Kuang
8b989a851c
Fix keyword spotting. ( #1689 )
...
Reset the stream right after detecting a keyword
2025-01-20 16:41:10 +08:00
Fangjun Kuang
af671e2b63
Add C API for Kokoro TTS models ( #1717 )
2025-01-16 15:07:26 +08:00
Fangjun Kuang
bf3330c906
Add HarmonyOS examples for MatchaTTS. ( #1678 )
2025-01-03 17:09:29 +08:00
Fangjun Kuang
648903834b
Add CXX API for MatchaTTS models ( #1676 )
2025-01-03 14:16:36 +08:00
Fangjun Kuang
9aa4897a9e
Add C API for MatchaTTS models ( #1675 )
2025-01-03 12:17:26 +08:00
Fangjun Kuang
3422b9388d
Add Kotlin API for Matcha-TTS models. ( #1668 )
2024-12-31 19:20:52 +08:00
yujinqiu
5c2cc48f50
Add swift online punctuation ( #1661 )
2024-12-31 11:26:32 +08:00
Fangjun Kuang
914cbad6a3
Add speaker diarization demo for HarmonyOS ( #1610 )
2024-12-10 20:11:44 +08:00
Fangjun Kuang
1bae4085ca
Add speaker diarization API for HarmonyOS. ( #1609 )
2024-12-10 16:03:03 +08:00
Fangjun Kuang
314545f938
Add speaker identification APIs for HarmonyOS ( #1607 )
...
* Add speaker embedding extractor API for HarmonyOS
* Add ArkTS API for speaker identification
2024-12-09 19:23:18 +08:00
Fangjun Kuang
74a8735f7a
Add on-device tex-to-speech (TTS) demo for HarmonyOS ( #1590 )
2024-12-04 14:27:12 +08:00
Fangjun Kuang
dc3287f3a8
Add HarmonyOS support for text-to-speech. ( #1584 )
2024-12-01 21:43:34 +08:00
Fangjun Kuang
315d8e2a47
Publish sherpa_onnx.har for HarmonyOS ( #1572 )
2024-11-28 17:30:16 +08:00
Fangjun Kuang
2101227269
Add streaming ASR support for HarmonyOS. ( #1565 )
2024-11-26 18:36:56 +08:00
Fangjun Kuang
298b6b6fda
Add non-streaming ASR support for HarmonyOS. ( #1564 )
2024-11-26 16:38:35 +08:00
Fangjun Kuang
31d6206fde
HarmonyOS support for VAD. ( #1561 )
2024-11-24 16:29:24 +08:00
Fangjun Kuang
2ca2985d04
Add C and C++ API for Moonshine models ( #1476 )
2024-10-26 23:24:46 +08:00
Fangjun Kuang
ceb69ebd94
Add C++ API for non-streaming ASR ( #1456 )
2024-10-23 16:40:12 +08:00
Fangjun Kuang
effd5ef2be
Add C++ API for streaming ASR. ( #1455 )
...
It is a wrapper around the C API.
2024-10-23 12:07:43 +08:00
Fangjun Kuang
1ed803adc1
Dart API for speaker diarization ( #1418 )
2024-10-11 21:17:41 +08:00
Fangjun Kuang
1d061df355
WebAssembly exmaple for speaker diarization ( #1411 )
2024-10-10 22:14:45 +08:00
Fangjun Kuang
d468527f62
C API for speaker diarization ( #1402 )
2024-10-09 17:10:03 +08:00
lxiao336
06b61ccad8
Allow more online models to load tokens file from the memory ( #1352 )
...
Co-authored-by: xiao <shawl336@6163.com >
2024-09-20 16:38:41 +08:00
Fangjun Kuang
e7ffcbd677
Add APIs about max speech duration in VAD for various programming languages ( #1349 )
2024-09-14 12:30:13 +08:00
Fangjun Kuang
544857b097
Fix building ( #1343 )
2024-09-13 13:33:52 +08:00
lxiao336
65cfa7548a
re-pull-request allow tokens and hotwords be loaded from buffered string driectly ( #1339 )
...
Co-authored-by: xiao <shawl336@163.com >
2024-09-13 09:58:17 +08:00
Fangjun Kuang
537e163dd0
WebAssembly example for VAD + Non-streaming ASR ( #1284 )
2024-08-24 13:24:52 +08:00
Fangjun Kuang
5a2aa110b8
Text to speech API for Object Pascal. ( #1273 )
2024-08-20 20:52:16 +08:00
Robin Zhong
62c4d4ab62
Add emotion, event of SenseVoice. ( #1257 )
...
* Add emotion, event of SenseVoice.
* Fix tokens size check and update java api.
https://github.com/k2-fsa/sherpa-onnx/pull/1257
2024-08-14 15:50:13 +08:00
Fangjun Kuang
94e256244d
Add blank penalty for various language bindings. ( #1234 )
2024-08-08 10:43:31 +08:00
Parth Khiera
ba4cb6169f
feat: addition of blank_penalty config in online_recognizer ( #1232 )
2024-08-08 09:10:17 +08:00
Fangjun Kuang
4e6aeff07e
Refactor C API to prefix each API with SherpaOnnx. ( #1171 )
2024-07-26 18:47:02 +08:00
Fangjun Kuang
25f0a10468
Add C++ runtime for SenseVoice models ( #1148 )
2024-07-18 22:54:18 +08:00
Fangjun Kuang
960eb7529e
Add C++ runtime for MeloTTS ( #1138 )
2024-07-16 15:55:02 +08:00
ivan provalov
de04b3b9bf
Allow modify model config at decode time for ASR ( #1124 )
2024-07-13 22:30:47 +08:00
thewh1teagle
c0eaf86dbd
feat: find best embedding matches ( #1102 )
2024-07-11 09:38:06 +08:00