Commit Graph

1275 Commits

Author SHA1 Message Date
谢乃闻
e4dff6466e Fix build script: add 'cd build' after 'mkdir build' to ensure the correct working directory for CMake (#2033) 2025-03-21 06:42:19 +08:00
Fangjun Kuang
ee2b8d0a28 Fix crash in Android tts engine demo. (#2029) 2025-03-20 10:41:52 +08:00
Fangjun Kuang
a19e57604e Fix Matcha + vocos for Android (#2024) 2025-03-19 18:39:10 +08:00
Fangjun Kuang
a50901f366 Fix a bug in vad.reset() (#2023)
We also need to clear _last
2025-03-19 17:42:05 +08:00
Fangjun Kuang
83e944d121 Update README to include more projects using sherpa-onnx (#2022) 2025-03-19 12:11:11 +08:00
Fangjun Kuang
982a1f14f8 Support cuda12 and cudnn8 for Linux aarch64. (#2021) 2025-03-19 11:21:06 +08:00
Fangjun Kuang
1f52ac2126 add alsa example for vad+offline asr (#2020) 2025-03-18 20:06:24 +08:00
Fangjun Kuang
0e0afb2cc8 Publish jar for more java versions (#2017) 2025-03-18 11:42:27 +08:00
Fangjun Kuang
406272210f Fix CI (#2016) 2025-03-17 22:31:36 +08:00
Fangjun Kuang
bdf84a7cf0 Release v1.11.1 (#2015) 2025-03-17 17:32:51 +08:00
Fangjun Kuang
0aacf02dd8 Add C++ runtime for vocos (#2014) 2025-03-17 17:05:15 +08:00
Fangjun Kuang
623cdc9eec Export vocos to sherpa-onnx (#2012) 2025-03-17 09:19:50 +08:00
Fangjun Kuang
f110c776ac Release v1.11.0 (#2010) 2025-03-16 15:27:36 +08:00
Fangjun Kuang
71824992a7 Add Java API for speech enhancement GTCRN models (#2009) 2025-03-16 15:13:20 +08:00
Fangjun Kuang
ed8e6c9aed Add Kotlin API for speech enhancement GTCRN models (#2008) 2025-03-16 10:41:01 +08:00
Fangjun Kuang
c972554ad1 Add JavaScript API (wasm) for speech enhancement GTCRN models (#2007) 2025-03-15 17:41:23 +08:00
Fangjun Kuang
d320fdf65e Add WebAssembly (WASM) for speech enhancement GTCRN models (#2002) 2025-03-13 18:35:03 +08:00
Fangjun Kuang
6a97f8adcf Add JavaScript (node-addon) API for speech enhancement GTCRN models (#1996) 2025-03-12 15:52:01 +08:00
Fangjun Kuang
fd78a482df Add Dart API for speech enhancement GTCRN models (#1993) 2025-03-12 12:39:08 +08:00
Fangjun Kuang
c3b009988b Add Pascal API for speech enhancement GTCRN models (#1992) 2025-03-12 10:48:59 +08:00
Fangjun Kuang
d78f408362 Add Go API for speech enhancement GTCRN models (#1991) 2025-03-11 19:33:05 +08:00
Fangjun Kuang
d3e27d5e21 Add C# API for speech enhancement GTCRN models (#1990) 2025-03-11 18:58:17 +08:00
Fangjun Kuang
c12d1d88c0 Add Swift API for speech enhancement GTCRN models (#1989) 2025-03-11 18:03:13 +08:00
Fangjun Kuang
802119db17 Add CXX API for speech enhancement GTCRN models (#1986) 2025-03-11 17:07:52 +08:00
Fangjun Kuang
c5dbf1177c Add C API for speech enhancement GTCRN models (#1984) 2025-03-11 15:50:04 +08:00
Fangjun Kuang
5d2d792b1d Add Python API for speech enhancement GTCRN models (#1978) 2025-03-10 19:02:17 +08:00
Fangjun Kuang
488a6e687c Add C++ runtime for speech enhancement GTCRN models (#1977)
See also https://github.com/Xiaobin-Rong/gtcrn
2025-03-10 18:11:16 +08:00
franck-li
8aaae91d4a add SherpaOnnxOfflineRecognizerSetConfig binding for go, and optimize the new/free for C.struct_SherpaOnnxOfflineRecognizerConfig ptr (#1976)
Co-authored-by: liyuzhi <liyuzhi@info.easeus.com.cn>
2025-03-10 18:04:12 +08:00
cjsdurj
b87fce9a7f c-api add wave write to buffer. (#1962)
Co-authored-by: jian.chen03 <jian.chen03@transwarp.io>
2025-03-10 17:21:23 +08:00
Fangjun Kuang
6e261ed63f Export gtcrn models to sherpa-onnx (#1975) 2025-03-10 11:31:18 +08:00
Fangjun Kuang
362ddf2c07 Add C++ demo for VAD+non-streaming ASR (#1964) 2025-03-07 11:49:46 +08:00
Fangjun Kuang
1e2328242d Test using sherpa-onnx as a cmake subproject (#1961) 2025-03-06 12:12:56 +08:00
Karel Vesely
7740dbfb96 Ebranchformer (#1951)
* adding ebranchformer encoder

* extend surfaced FeatureExtractorConfig

- so ebranchformer feature extraction can be configured from Python
- the GlobCmvn is not needed, as it is a module in the OnnxEncoder

* clean the code

* Integrating remarks from Fangjun
2025-03-04 19:41:09 +08:00
Fangjun Kuang
209eaaae1d Limit number of tokens per second for whisper. (#1958)
Otherwise, it spends lots of time in the loop if the EOT token
is not predicted.
2025-03-04 15:45:28 +08:00
Fangjun Kuang
49177530ff Update README to include projects that is using sherpa-onnx (#1956) 2025-03-04 14:45:07 +08:00
Fangjun Kuang
c9d6859df7 Add transducer modified_beam_search for RKNN. (#1949) 2025-03-03 13:15:25 +08:00
Fangjun Kuang
d5e7b51af5 Support RKNN for Zipformer CTC models. (#1948) 2025-03-02 21:40:13 +08:00
Fangjun Kuang
dfcbc8d40b Add Kokoro v1.1-zh (#1942) 2025-02-28 15:47:59 +08:00
Fangjun Kuang
f5dfcf8d2f Add Kotlin and Java API for online punctuation models (#1936) 2025-02-27 16:52:36 +08:00
Fangjun Kuang
815ebac8f9 Fix building wheels for Python 3.7 (#1933) 2025-02-27 13:02:46 +08:00
Fangjun Kuang
337d5f7a80 Release v1.10.46 (#1929) 2025-02-26 19:19:33 +08:00
Fangjun Kuang
eebe19997d Build wheels for rknn linux aarch64 (#1928) 2025-02-26 18:58:57 +08:00
Fangjun Kuang
82cb8a5dc3 Minor fixes for rknn (#1925) 2025-02-26 16:26:18 +08:00
Fangjun Kuang
2f9a2b20a1 Fix publishing macos pre-built artifacts (#1922) 2025-02-26 11:52:01 +08:00
xcel3011
b042f5e179 fix: AddPunct panic for Go(#1921) 2025-02-25 18:09:28 +08:00
franck-li
0dcaf3a061 go.mod set to use go 1.17, and use unsafe.Slice to optimize the code (#1920)
Co-authored-by: liyuzhi <liyuzhi@info.easeus.com.cn>
2025-02-25 15:31:15 +08:00
Fangjun Kuang
dc2f7e9f9b Fix publishing linux pre-built artifacts (#1919) 2025-02-25 15:22:50 +08:00
Grey Faulkenberry, MD MPH
70742b69ec Flutter Config toJson/fromJson (#1893) 2025-02-25 14:43:48 +08:00
franck-li
808587accd change [1<<28] to [1<<10], to fix build issues on GOARCH=386 that [1<<28] too large (#1916)
Co-authored-by: liyuzhi <liyuzhi@info.easeus.com.cn>
2025-02-25 09:12:55 +08:00
Fangjun Kuang
4d79e6a007 Add C++ API for streaming zipformer ASR on RK NPU (#1908) 2025-02-24 19:07:37 +08:00