Fangjun Kuang
df4615ca1d
Add C/CXX/JavaScript API for NeMo Canary models ( #2357 )
...
This PR introduces support for NeMo Canary models across C, C++, and JavaScript APIs
by adding new Canary configuration structures, updating bindings, extending examples,
and enhancing CI workflows.
- Add OfflineCanaryModelConfig to all language bindings (C, C++, JS, ETS).
- Implement SetConfig methods and NAPI wrappers for updating recognizer config at runtime.
- Update examples and CI scripts to demonstrate and test NeMo Canary model usage.
2025-07-07 23:38:04 +08:00
linsui
33a689dc86
Fix typo CMAKE_EXECUTBLE_LINKER_FLAGS -> CMAKE_EXECUTABLE_LINKER_FLAGS ( #2344 )
2025-07-04 21:13:39 +08:00
Fangjun Kuang
3bf986d08d
Support non-streaming zipformer CTC ASR models ( #2340 )
...
This PR adds support for non-streaming Zipformer CTC ASR models across
multiple language bindings, WebAssembly, examples, and CI workflows.
- Introduces a new OfflineZipformerCtcModelConfig in C/C++, Python, Swift, Java, Kotlin, Go, Dart, Pascal, and C# APIs
- Updates initialization, freeing, and recognition logic to include Zipformer CTC in WASM and Node.js
- Adds example scripts and CI steps for downloading, building, and running Zipformer CTC models
Model doc is available at
https://k2-fsa.github.io/sherpa/onnx/pretrained_models/offline-ctc/icefall/zipformer.html
2025-07-04 15:57:07 +08:00
Fangjun Kuang
bda427f4b2
Add API to get version information ( #2309 )
2025-06-25 00:22:21 +08:00
Fangjun Kuang
6982b86c66
Support extra languages in multi-lang kokoro tts ( #2303 )
2025-06-20 11:22:52 +08:00
guoxiangyang
0c42c06f75
update wasm/vad-asr/assets/README.md for more clear ( #2297 )
...
Co-authored-by: gxy <gxy@conwi.cn >
2025-06-16 15:35:20 +08:00
Fangjun Kuang
716ba8317b
Add C++ runtime for spleeter about source separation ( #2242 )
2025-05-23 22:30:57 +08:00
Fangjun Kuang
a0aef1f6cd
Add JavaScript API (WASM) for homophone replacer ( #2157 )
2025-04-28 20:47:49 +08:00
Fangjun Kuang
639ad1744f
Add Javascript (WebAssembly) API for Dolphin CTC models ( #2093 )
2025-04-03 15:02:06 +08:00
Fangjun Kuang
c972554ad1
Add JavaScript API (wasm) for speech enhancement GTCRN models ( #2007 )
2025-03-15 17:41:23 +08:00
Fangjun Kuang
d320fdf65e
Add WebAssembly (WASM) for speech enhancement GTCRN models ( #2002 )
2025-03-13 18:35:03 +08:00
Fangjun Kuang
7ad44bc43a
Add JavaScript API (WebAssembly) for FireRedAsr model. ( #1874 )
2025-02-17 12:54:18 +08:00
Fangjun Kuang
69f489f0cd
Support scaling the duration of a pause in TTS. ( #1820 )
2025-02-08 12:47:26 +08:00
顾真牛
51b42748e5
Update sherpa-onnx-tts.js VitsModelConfig.model can be none ( #1817 )
...
* Update sherpa-onnx-tts.js VitsModelConfig.model can be none
* Update sherpa-onnx-tts.js
2025-02-08 08:25:19 +08:00
Fangjun Kuang
0610679539
Add JavaScript API (WebAssembly) for Kokoro TTS 1.0 ( #1809 )
2025-02-07 16:46:03 +08:00
Fangjun Kuang
8b989a851c
Fix keyword spotting. ( #1689 )
...
Reset the stream right after detecting a keyword
2025-01-20 16:41:10 +08:00
Fangjun Kuang
bad82f35cf
Update README.md for KWS to not use git lfs. ( #1729 )
2025-01-17 17:48:42 +08:00
Fangjun Kuang
3a1de0bfc1
Add JavaScript (WebAssembly) API for Kokoro TTS models. ( #1726 )
2025-01-17 11:17:18 +08:00
Fangjun Kuang
3eced3e7ee
Add C# and JavaScript (wasm) API for MatchaTTS models ( #1682 )
2025-01-05 15:08:19 +08:00
Fangjun Kuang
8436ba834c
Add WebAssembly example for VAD + Moonshine models. ( #1535 )
2024-11-13 21:06:50 +08:00
Fangjun Kuang
6f261d39f3
Add JavaScript API for Moonshine models ( #1480 )
2024-10-27 11:31:01 +08:00
Fangjun Kuang
eefc172095
JavaScript API with WebAssembly for speaker diarization ( #1414 )
...
#1408 uses [node-addon-api](https://github.com/nodejs/node-addon-api ) to call C API from JavaScript, whereas this pull request uses WebAssembly to call C API from JavaScript.
2024-10-11 11:40:10 +08:00
Fangjun Kuang
1d061df355
WebAssembly exmaple for speaker diarization ( #1411 )
2024-10-10 22:14:45 +08:00
Fangjun Kuang
d8809b520e
Fix CI errors introduced by supporting loading keywords from buffers ( #1366 )
2024-09-20 19:04:21 +08:00
Fangjun Kuang
e7ffcbd677
Add APIs about max speech duration in VAD for various programming languages ( #1349 )
2024-09-14 12:30:13 +08:00
Fangjun Kuang
544857b097
Fix building ( #1343 )
2024-09-13 13:33:52 +08:00
Fangjun Kuang
1977c8d04d
fix wasm app for streaming paraformer ( #1328 )
2024-09-08 17:49:19 +08:00
Fangjun Kuang
5ed8e31868
Add VAD and keyword spotting for the Node package with WebAssembly ( #1286 )
2024-08-24 23:05:54 +08:00
Fangjun Kuang
537e163dd0
WebAssembly example for VAD + Non-streaming ASR ( #1284 )
2024-08-24 13:24:52 +08:00
Fangjun Kuang
1ef8a7a202
Add WebAssembly for VAD ( #1281 )
2024-08-23 17:08:37 +08:00
Fangjun Kuang
94e256244d
Add blank penalty for various language bindings. ( #1234 )
2024-08-08 10:43:31 +08:00
Fangjun Kuang
4e6aeff07e
Refactor C API to prefix each API with SherpaOnnx. ( #1171 )
2024-07-26 18:47:02 +08:00
Fangjun Kuang
70d14353bb
Add WebAssembly for SenseVoice ( #1158 )
2024-07-21 15:39:55 +08:00
Fangjun Kuang
6789c909d2
Inverse text normalization API of streaming ASR for various programming languages ( #1022 )
2024-06-18 13:42:17 +08:00
Fangjun Kuang
6e09933d99
Inverse text normalization API for other programming languages ( #1019 )
2024-06-17 17:02:39 +08:00
Fangjun Kuang
c214d8fb74
fix kws for WebAssembly ( #999 )
2024-06-14 10:22:56 +08:00
Fangjun Kuang
fd5a0d1e00
Add C++ runtime for Tele-AI/TeleSpeech-ASR ( #970 )
2024-06-05 00:26:40 +08:00
Fangjun Kuang
f8dbc10146
Fix CI ( #964 )
2024-06-04 17:05:49 +08:00
Fangjun Kuang
4f21aabd3c
Fix CI for JavaScript and Python APIs. ( #901 )
2024-05-22 13:57:00 +08:00
Fangjun Kuang
8af2af8466
Add tail_paddings to Whisper C API. ( #886 )
2024-05-17 09:20:07 +08:00
Fangjun Kuang
6686c7d3e6
Add dict_dir arg to c api to support Chinese TTS models using jieba ( #809 )
2024-04-25 12:28:31 +08:00
Fangjun Kuang
69440e481f
Add WearOS demo for audio tagging ( #777 )
2024-04-17 12:22:17 +08:00
Fangjun Kuang
0f4705f775
Fix WASM for kws ( #758 )
2024-04-12 18:57:21 +08:00
Fangjun Kuang
a5f8fbc83f
Support heteronyms in Chinese TTS ( #738 )
2024-04-08 11:01:30 +08:00
Fangjun Kuang
c1c0f5bafd
return timestamps for WebAssembly ( #737 )
2024-04-05 20:24:27 +08:00
Fangjun Kuang
dbff2eaadb
Add C API for streaming HLG decoding ( #734 )
2024-04-05 10:31:20 +08:00
Fangjun Kuang
2ededa7e98
Fix building wasm in CI ( #720 )
2024-03-31 20:50:56 +08:00
Fangjun Kuang
acf0975153
Support whisper language/task in various language bindings. ( #679 )
2024-03-20 16:43:35 +08:00
Fangjun Kuang
9d6eb3e834
small fixes to wasm kws. ( #672 )
2024-03-18 15:28:10 +08:00
Lovemefan
009ed2cd30
add WebAssembly for Kws ( #648 )
2024-03-11 21:02:31 +08:00