Fangjun Kuang
1316719e23
Fix building for android ( #2081 )
2025-04-01 19:36:40 +08:00
Fangjun Kuang
a11e359c11
Refactor rknn code ( #2079 )
2025-04-01 16:54:53 +08:00
Fangjun Kuang
8e51a97550
Add C++ runtime for silero_vad with RKNN ( #2078 )
2025-04-01 15:56:56 +08:00
Fangjun Kuang
0703bc1b86
Add CXX API for VAD ( #2077 )
2025-04-01 14:51:43 +08:00
Fangjun Kuang
6ef9aeb8d8
Fix building aar to include speech denoiser ( #2069 )
2025-03-30 14:42:57 +08:00
Anders Xiao
ce196fceae
fix dml with preinstall ort ( #2066 )
2025-03-30 12:07:19 +08:00
Fangjun Kuang
3420c89883
Export silero_vad v4 to RKNN ( #2067 )
2025-03-30 12:00:52 +08:00
niansa/tuxifan
9d23606ee6
Allow building repository as CMake subdirectory ( #2059 )
...
* Use PROJECT_SOURCE_DIR rather than CMAKE_SOURCE_DIR to allow building as subdirectory
* Also use PROJECT_SOURCE_DIR instead of CMAKE_SOURCE_DIR in c/cxx api examples
* Only build examples by default when not building as subdirectory
* Do not suggest building binaries either
---------
Co-authored-by: user <user@mail.tld >
2025-03-29 06:27:59 +08:00
Fangjun Kuang
a5dd0cdfc3
Fix length scale for kokoro tts ( #2060 )
2025-03-27 10:52:01 +08:00
yourengod
bd61c1d8e5
Change scale factor to 32767 ( #2056 )
2025-03-26 10:44:49 +08:00
Fangjun Kuang
823e2e6257
Fix building wheels for RKNN ( #2041 )
2025-03-22 18:33:32 +08:00
Jov
ef759b7b8b
fix case ( #2037 )
...
v should be V
2025-03-21 16:46:13 +08:00
Jov
572c8d292c
fix vits dict dir config ( #2036 )
2025-03-21 16:30:54 +08:00
Fangjun Kuang
419f7fea0a
Release v1.11.2 ( #2035 )
2025-03-21 14:05:57 +08:00
Sangeet Sagar
31096e43bd
fix static linking ( #2032 )
2025-03-21 12:47:45 +08:00
谢乃闻
e4dff6466e
Fix build script: add 'cd build' after 'mkdir build' to ensure the correct working directory for CMake ( #2033 )
2025-03-21 06:42:19 +08:00
Fangjun Kuang
ee2b8d0a28
Fix crash in Android tts engine demo. ( #2029 )
2025-03-20 10:41:52 +08:00
Fangjun Kuang
a19e57604e
Fix Matcha + vocos for Android ( #2024 )
2025-03-19 18:39:10 +08:00
Fangjun Kuang
a50901f366
Fix a bug in vad.reset() ( #2023 )
...
We also need to clear _last
2025-03-19 17:42:05 +08:00
Fangjun Kuang
83e944d121
Update README to include more projects using sherpa-onnx ( #2022 )
2025-03-19 12:11:11 +08:00
Fangjun Kuang
982a1f14f8
Support cuda12 and cudnn8 for Linux aarch64. ( #2021 )
2025-03-19 11:21:06 +08:00
Fangjun Kuang
1f52ac2126
add alsa example for vad+offline asr ( #2020 )
2025-03-18 20:06:24 +08:00
Fangjun Kuang
0e0afb2cc8
Publish jar for more java versions ( #2017 )
2025-03-18 11:42:27 +08:00
Fangjun Kuang
406272210f
Fix CI ( #2016 )
2025-03-17 22:31:36 +08:00
Fangjun Kuang
bdf84a7cf0
Release v1.11.1 ( #2015 )
2025-03-17 17:32:51 +08:00
Fangjun Kuang
0aacf02dd8
Add C++ runtime for vocos ( #2014 )
2025-03-17 17:05:15 +08:00
Fangjun Kuang
623cdc9eec
Export vocos to sherpa-onnx ( #2012 )
2025-03-17 09:19:50 +08:00
Fangjun Kuang
f110c776ac
Release v1.11.0 ( #2010 )
2025-03-16 15:27:36 +08:00
Fangjun Kuang
71824992a7
Add Java API for speech enhancement GTCRN models ( #2009 )
2025-03-16 15:13:20 +08:00
Fangjun Kuang
ed8e6c9aed
Add Kotlin API for speech enhancement GTCRN models ( #2008 )
2025-03-16 10:41:01 +08:00
Fangjun Kuang
c972554ad1
Add JavaScript API (wasm) for speech enhancement GTCRN models ( #2007 )
2025-03-15 17:41:23 +08:00
Fangjun Kuang
d320fdf65e
Add WebAssembly (WASM) for speech enhancement GTCRN models ( #2002 )
2025-03-13 18:35:03 +08:00
Fangjun Kuang
6a97f8adcf
Add JavaScript (node-addon) API for speech enhancement GTCRN models ( #1996 )
2025-03-12 15:52:01 +08:00
Fangjun Kuang
fd78a482df
Add Dart API for speech enhancement GTCRN models ( #1993 )
2025-03-12 12:39:08 +08:00
Fangjun Kuang
c3b009988b
Add Pascal API for speech enhancement GTCRN models ( #1992 )
2025-03-12 10:48:59 +08:00
Fangjun Kuang
d78f408362
Add Go API for speech enhancement GTCRN models ( #1991 )
2025-03-11 19:33:05 +08:00
Fangjun Kuang
d3e27d5e21
Add C# API for speech enhancement GTCRN models ( #1990 )
2025-03-11 18:58:17 +08:00
Fangjun Kuang
c12d1d88c0
Add Swift API for speech enhancement GTCRN models ( #1989 )
2025-03-11 18:03:13 +08:00
Fangjun Kuang
802119db17
Add CXX API for speech enhancement GTCRN models ( #1986 )
2025-03-11 17:07:52 +08:00
Fangjun Kuang
c5dbf1177c
Add C API for speech enhancement GTCRN models ( #1984 )
2025-03-11 15:50:04 +08:00
Fangjun Kuang
5d2d792b1d
Add Python API for speech enhancement GTCRN models ( #1978 )
2025-03-10 19:02:17 +08:00
Fangjun Kuang
488a6e687c
Add C++ runtime for speech enhancement GTCRN models ( #1977 )
...
See also https://github.com/Xiaobin-Rong/gtcrn
2025-03-10 18:11:16 +08:00
franck-li
8aaae91d4a
add SherpaOnnxOfflineRecognizerSetConfig binding for go, and optimize the new/free for C.struct_SherpaOnnxOfflineRecognizerConfig ptr ( #1976 )
...
Co-authored-by: liyuzhi <liyuzhi@info.easeus.com.cn >
2025-03-10 18:04:12 +08:00
cjsdurj
b87fce9a7f
c-api add wave write to buffer. ( #1962 )
...
Co-authored-by: jian.chen03 <jian.chen03@transwarp.io >
2025-03-10 17:21:23 +08:00
Fangjun Kuang
6e261ed63f
Export gtcrn models to sherpa-onnx ( #1975 )
2025-03-10 11:31:18 +08:00
Fangjun Kuang
362ddf2c07
Add C++ demo for VAD+non-streaming ASR ( #1964 )
2025-03-07 11:49:46 +08:00
Fangjun Kuang
1e2328242d
Test using sherpa-onnx as a cmake subproject ( #1961 )
2025-03-06 12:12:56 +08:00
Karel Vesely
7740dbfb96
Ebranchformer ( #1951 )
...
* adding ebranchformer encoder
* extend surfaced FeatureExtractorConfig
- so ebranchformer feature extraction can be configured from Python
- the GlobCmvn is not needed, as it is a module in the OnnxEncoder
* clean the code
* Integrating remarks from Fangjun
2025-03-04 19:41:09 +08:00
Fangjun Kuang
209eaaae1d
Limit number of tokens per second for whisper. ( #1958 )
...
Otherwise, it spends lots of time in the loop if the EOT token
is not predicted.
2025-03-04 15:45:28 +08:00
Fangjun Kuang
49177530ff
Update README to include projects that is using sherpa-onnx ( #1956 )
2025-03-04 14:45:07 +08:00