Commit Graph

1154 Commits

Author SHA1 Message Date
HaoWang0101
dcaf9dd208 Comment refinement: Add note about vocoder file for matcha TTS config (#2106) 2025-04-08 12:56:41 +08:00
Askars Salimbajevs
664b461d01 Disable strict hotword matching mode for offline transducer (#1837)
* Disable strict hotword matching mode for offline transducer. Also introduces new variable, so that later this mode can be switched on in the runtime.

* remove strict mode variable

---------

Co-authored-by: Askars Salimbajevs <askars.salimbajevs@tilde.lv>
2025-04-03 22:52:19 +08:00
Fangjun Kuang
31ced58f9a Release v1.11.3 (#2097) 2025-04-03 16:19:01 +08:00
Fangjun Kuang
8137ac9f0b Add Pascal API for Dolphin CTC models (#2096) 2025-04-03 16:00:22 +08:00
Fangjun Kuang
07a5701af6 Add Dart API for Dolphin CTC models (#2095) 2025-04-03 15:59:38 +08:00
Fangjun Kuang
903e825eba Add Javascript (node-addon) API for Dolphin CTC models (#2094) 2025-04-03 15:03:33 +08:00
Fangjun Kuang
639ad1744f Add Javascript (WebAssembly) API for Dolphin CTC models (#2093) 2025-04-03 15:02:06 +08:00
Fangjun Kuang
74f402e490 Add Swift API for Dolphin CTC models (#2091) 2025-04-03 00:03:11 +08:00
Fangjun Kuang
ba7d8b63f0 Add Go API for Dolphin CTC models (#2090) 2025-04-03 00:02:09 +08:00
Fangjun Kuang
2dc0f91904 Add C# API for Dolphin CTC models (#2089) 2025-04-02 23:36:22 +08:00
Askars Salimbajevs
18a6ed5ddc Preserve more context after endpointing in transducer (#2061) 2025-04-02 23:33:47 +08:00
Fangjun Kuang
da4aad1189 Add C and CXX API for Dolphin CTC models (#2088) 2025-04-02 21:54:20 +08:00
Fangjun Kuang
eee5575836 Add Kotlin and Java API for Dolphin CTC models (#2086) 2025-04-02 21:16:14 +08:00
Fangjun Kuang
0de7e1b9f0 Add C++ and Python API for Dolphin CTC models (#2085) 2025-04-02 19:09:00 +08:00
Fangjun Kuang
1316719e23 Fix building for android (#2081) 2025-04-01 19:36:40 +08:00
Fangjun Kuang
a11e359c11 Refactor rknn code (#2079) 2025-04-01 16:54:53 +08:00
Fangjun Kuang
8e51a97550 Add C++ runtime for silero_vad with RKNN (#2078) 2025-04-01 15:56:56 +08:00
Fangjun Kuang
0703bc1b86 Add CXX API for VAD (#2077) 2025-04-01 14:51:43 +08:00
Fangjun Kuang
6ef9aeb8d8 Fix building aar to include speech denoiser (#2069) 2025-03-30 14:42:57 +08:00
Anders Xiao
ce196fceae fix dml with preinstall ort (#2066) 2025-03-30 12:07:19 +08:00
Fangjun Kuang
3420c89883 Export silero_vad v4 to RKNN (#2067) 2025-03-30 12:00:52 +08:00
niansa/tuxifan
9d23606ee6 Allow building repository as CMake subdirectory (#2059)
* Use PROJECT_SOURCE_DIR rather than CMAKE_SOURCE_DIR to allow building as subdirectory

* Also use PROJECT_SOURCE_DIR instead of CMAKE_SOURCE_DIR in c/cxx api examples

* Only build examples by default when not building as subdirectory

* Do not suggest building binaries either

---------

Co-authored-by: user <user@mail.tld>
2025-03-29 06:27:59 +08:00
Fangjun Kuang
a5dd0cdfc3 Fix length scale for kokoro tts (#2060) 2025-03-27 10:52:01 +08:00
yourengod
bd61c1d8e5 Change scale factor to 32767 (#2056) 2025-03-26 10:44:49 +08:00
Fangjun Kuang
823e2e6257 Fix building wheels for RKNN (#2041) 2025-03-22 18:33:32 +08:00
Jov
ef759b7b8b fix case (#2037)
v should be V
2025-03-21 16:46:13 +08:00
Jov
572c8d292c fix vits dict dir config (#2036) 2025-03-21 16:30:54 +08:00
Fangjun Kuang
419f7fea0a Release v1.11.2 (#2035) 2025-03-21 14:05:57 +08:00
Sangeet Sagar
31096e43bd fix static linking (#2032) 2025-03-21 12:47:45 +08:00
谢乃闻
e4dff6466e Fix build script: add 'cd build' after 'mkdir build' to ensure the correct working directory for CMake (#2033) 2025-03-21 06:42:19 +08:00
Fangjun Kuang
ee2b8d0a28 Fix crash in Android tts engine demo. (#2029) 2025-03-20 10:41:52 +08:00
Fangjun Kuang
a19e57604e Fix Matcha + vocos for Android (#2024) 2025-03-19 18:39:10 +08:00
Fangjun Kuang
a50901f366 Fix a bug in vad.reset() (#2023)
We also need to clear _last
2025-03-19 17:42:05 +08:00
Fangjun Kuang
83e944d121 Update README to include more projects using sherpa-onnx (#2022) 2025-03-19 12:11:11 +08:00
Fangjun Kuang
982a1f14f8 Support cuda12 and cudnn8 for Linux aarch64. (#2021) 2025-03-19 11:21:06 +08:00
Fangjun Kuang
1f52ac2126 add alsa example for vad+offline asr (#2020) 2025-03-18 20:06:24 +08:00
Fangjun Kuang
0e0afb2cc8 Publish jar for more java versions (#2017) 2025-03-18 11:42:27 +08:00
Fangjun Kuang
406272210f Fix CI (#2016) 2025-03-17 22:31:36 +08:00
Fangjun Kuang
bdf84a7cf0 Release v1.11.1 (#2015) 2025-03-17 17:32:51 +08:00
Fangjun Kuang
0aacf02dd8 Add C++ runtime for vocos (#2014) 2025-03-17 17:05:15 +08:00
Fangjun Kuang
623cdc9eec Export vocos to sherpa-onnx (#2012) 2025-03-17 09:19:50 +08:00
Fangjun Kuang
f110c776ac Release v1.11.0 (#2010) 2025-03-16 15:27:36 +08:00
Fangjun Kuang
71824992a7 Add Java API for speech enhancement GTCRN models (#2009) 2025-03-16 15:13:20 +08:00
Fangjun Kuang
ed8e6c9aed Add Kotlin API for speech enhancement GTCRN models (#2008) 2025-03-16 10:41:01 +08:00
Fangjun Kuang
c972554ad1 Add JavaScript API (wasm) for speech enhancement GTCRN models (#2007) 2025-03-15 17:41:23 +08:00
Fangjun Kuang
d320fdf65e Add WebAssembly (WASM) for speech enhancement GTCRN models (#2002) 2025-03-13 18:35:03 +08:00
Fangjun Kuang
6a97f8adcf Add JavaScript (node-addon) API for speech enhancement GTCRN models (#1996) 2025-03-12 15:52:01 +08:00
Fangjun Kuang
fd78a482df Add Dart API for speech enhancement GTCRN models (#1993) 2025-03-12 12:39:08 +08:00
Fangjun Kuang
c3b009988b Add Pascal API for speech enhancement GTCRN models (#1992) 2025-03-12 10:48:59 +08:00
Fangjun Kuang
d78f408362 Add Go API for speech enhancement GTCRN models (#1991) 2025-03-11 19:33:05 +08:00