Commit Graph

50 Commits

Author SHA1 Message Date
Fangjun Kuang
716ba8317b Add C++ runtime for spleeter about source separation (#2242) 2025-05-23 22:30:57 +08:00
Fangjun Kuang
a0aef1f6cd Add JavaScript API (WASM) for homophone replacer (#2157) 2025-04-28 20:47:49 +08:00
Fangjun Kuang
639ad1744f Add Javascript (WebAssembly) API for Dolphin CTC models (#2093) 2025-04-03 15:02:06 +08:00
Fangjun Kuang
c972554ad1 Add JavaScript API (wasm) for speech enhancement GTCRN models (#2007) 2025-03-15 17:41:23 +08:00
Fangjun Kuang
d320fdf65e Add WebAssembly (WASM) for speech enhancement GTCRN models (#2002) 2025-03-13 18:35:03 +08:00
Fangjun Kuang
7ad44bc43a Add JavaScript API (WebAssembly) for FireRedAsr model. (#1874) 2025-02-17 12:54:18 +08:00
Fangjun Kuang
69f489f0cd Support scaling the duration of a pause in TTS. (#1820) 2025-02-08 12:47:26 +08:00
顾真牛
51b42748e5 Update sherpa-onnx-tts.js VitsModelConfig.model can be none (#1817)
* Update sherpa-onnx-tts.js  VitsModelConfig.model can be none

* Update sherpa-onnx-tts.js
2025-02-08 08:25:19 +08:00
Fangjun Kuang
0610679539 Add JavaScript API (WebAssembly) for Kokoro TTS 1.0 (#1809) 2025-02-07 16:46:03 +08:00
Fangjun Kuang
8b989a851c Fix keyword spotting. (#1689)
Reset the stream right after detecting a keyword
2025-01-20 16:41:10 +08:00
Fangjun Kuang
bad82f35cf Update README.md for KWS to not use git lfs. (#1729) 2025-01-17 17:48:42 +08:00
Fangjun Kuang
3a1de0bfc1 Add JavaScript (WebAssembly) API for Kokoro TTS models. (#1726) 2025-01-17 11:17:18 +08:00
Fangjun Kuang
3eced3e7ee Add C# and JavaScript (wasm) API for MatchaTTS models (#1682) 2025-01-05 15:08:19 +08:00
Fangjun Kuang
8436ba834c Add WebAssembly example for VAD + Moonshine models. (#1535) 2024-11-13 21:06:50 +08:00
Fangjun Kuang
6f261d39f3 Add JavaScript API for Moonshine models (#1480) 2024-10-27 11:31:01 +08:00
Fangjun Kuang
eefc172095 JavaScript API with WebAssembly for speaker diarization (#1414)
#1408 uses [node-addon-api](https://github.com/nodejs/node-addon-api) to call C API from JavaScript, whereas this pull request uses WebAssembly to call C API from JavaScript.
2024-10-11 11:40:10 +08:00
Fangjun Kuang
1d061df355 WebAssembly exmaple for speaker diarization (#1411) 2024-10-10 22:14:45 +08:00
Fangjun Kuang
d8809b520e Fix CI errors introduced by supporting loading keywords from buffers (#1366) 2024-09-20 19:04:21 +08:00
Fangjun Kuang
e7ffcbd677 Add APIs about max speech duration in VAD for various programming languages (#1349) 2024-09-14 12:30:13 +08:00
Fangjun Kuang
544857b097 Fix building (#1343) 2024-09-13 13:33:52 +08:00
Fangjun Kuang
1977c8d04d fix wasm app for streaming paraformer (#1328) 2024-09-08 17:49:19 +08:00
Fangjun Kuang
5ed8e31868 Add VAD and keyword spotting for the Node package with WebAssembly (#1286) 2024-08-24 23:05:54 +08:00
Fangjun Kuang
537e163dd0 WebAssembly example for VAD + Non-streaming ASR (#1284) 2024-08-24 13:24:52 +08:00
Fangjun Kuang
1ef8a7a202 Add WebAssembly for VAD (#1281) 2024-08-23 17:08:37 +08:00
Fangjun Kuang
94e256244d Add blank penalty for various language bindings. (#1234) 2024-08-08 10:43:31 +08:00
Fangjun Kuang
4e6aeff07e Refactor C API to prefix each API with SherpaOnnx. (#1171) 2024-07-26 18:47:02 +08:00
Fangjun Kuang
70d14353bb Add WebAssembly for SenseVoice (#1158) 2024-07-21 15:39:55 +08:00
Fangjun Kuang
6789c909d2 Inverse text normalization API of streaming ASR for various programming languages (#1022) 2024-06-18 13:42:17 +08:00
Fangjun Kuang
6e09933d99 Inverse text normalization API for other programming languages (#1019) 2024-06-17 17:02:39 +08:00
Fangjun Kuang
c214d8fb74 fix kws for WebAssembly (#999) 2024-06-14 10:22:56 +08:00
Fangjun Kuang
fd5a0d1e00 Add C++ runtime for Tele-AI/TeleSpeech-ASR (#970) 2024-06-05 00:26:40 +08:00
Fangjun Kuang
f8dbc10146 Fix CI (#964) 2024-06-04 17:05:49 +08:00
Fangjun Kuang
4f21aabd3c Fix CI for JavaScript and Python APIs. (#901) 2024-05-22 13:57:00 +08:00
Fangjun Kuang
8af2af8466 Add tail_paddings to Whisper C API. (#886) 2024-05-17 09:20:07 +08:00
Fangjun Kuang
6686c7d3e6 Add dict_dir arg to c api to support Chinese TTS models using jieba (#809) 2024-04-25 12:28:31 +08:00
Fangjun Kuang
69440e481f Add WearOS demo for audio tagging (#777) 2024-04-17 12:22:17 +08:00
Fangjun Kuang
0f4705f775 Fix WASM for kws (#758) 2024-04-12 18:57:21 +08:00
Fangjun Kuang
a5f8fbc83f Support heteronyms in Chinese TTS (#738) 2024-04-08 11:01:30 +08:00
Fangjun Kuang
c1c0f5bafd return timestamps for WebAssembly (#737) 2024-04-05 20:24:27 +08:00
Fangjun Kuang
dbff2eaadb Add C API for streaming HLG decoding (#734) 2024-04-05 10:31:20 +08:00
Fangjun Kuang
2ededa7e98 Fix building wasm in CI (#720) 2024-03-31 20:50:56 +08:00
Fangjun Kuang
acf0975153 Support whisper language/task in various language bindings. (#679) 2024-03-20 16:43:35 +08:00
Fangjun Kuang
9d6eb3e834 small fixes to wasm kws. (#672) 2024-03-18 15:28:10 +08:00
Lovemefan
009ed2cd30 add WebAssembly for Kws (#648) 2024-03-11 21:02:31 +08:00
Fangjun Kuang
ed06ced16f Add WebAssembly for NodeJS. (#628) 2024-03-03 20:00:36 +08:00
Fangjun Kuang
ac6825ff11 Refactor WebAssembly for nodejs (#626) 2024-03-02 12:31:36 +08:00
Fangjun Kuang
16ba7e274a Add WebAssembly for ASR (#604) 2024-02-23 17:39:11 +08:00
Fangjun Kuang
a2df3535b7 Install wasm tts in a separate directory (#600) 2024-02-22 11:30:08 +08:00
Fangjun Kuang
7c4b59932a Refactor WebAssembly build script. (#598)
Make it easier to build WebAssembly for ASR.
2024-02-21 16:51:15 +08:00
Fangjun Kuang
d771762868 Support WebAssembly for text-to-speech (#577) 2024-02-08 23:39:12 +08:00