Fangjun Kuang
71824992a7
Add Java API for speech enhancement GTCRN models ( #2009 )
2025-03-16 15:13:20 +08:00
Fangjun Kuang
ed8e6c9aed
Add Kotlin API for speech enhancement GTCRN models ( #2008 )
2025-03-16 10:41:01 +08:00
Fangjun Kuang
c972554ad1
Add JavaScript API (wasm) for speech enhancement GTCRN models ( #2007 )
2025-03-15 17:41:23 +08:00
Fangjun Kuang
d320fdf65e
Add WebAssembly (WASM) for speech enhancement GTCRN models ( #2002 )
2025-03-13 18:35:03 +08:00
Fangjun Kuang
6a97f8adcf
Add JavaScript (node-addon) API for speech enhancement GTCRN models ( #1996 )
2025-03-12 15:52:01 +08:00
Fangjun Kuang
fd78a482df
Add Dart API for speech enhancement GTCRN models ( #1993 )
2025-03-12 12:39:08 +08:00
Fangjun Kuang
c3b009988b
Add Pascal API for speech enhancement GTCRN models ( #1992 )
2025-03-12 10:48:59 +08:00
Fangjun Kuang
d78f408362
Add Go API for speech enhancement GTCRN models ( #1991 )
2025-03-11 19:33:05 +08:00
Fangjun Kuang
d3e27d5e21
Add C# API for speech enhancement GTCRN models ( #1990 )
2025-03-11 18:58:17 +08:00
Fangjun Kuang
c12d1d88c0
Add Swift API for speech enhancement GTCRN models ( #1989 )
2025-03-11 18:03:13 +08:00
Fangjun Kuang
802119db17
Add CXX API for speech enhancement GTCRN models ( #1986 )
2025-03-11 17:07:52 +08:00
Fangjun Kuang
c5dbf1177c
Add C API for speech enhancement GTCRN models ( #1984 )
2025-03-11 15:50:04 +08:00
Fangjun Kuang
5d2d792b1d
Add Python API for speech enhancement GTCRN models ( #1978 )
2025-03-10 19:02:17 +08:00
Fangjun Kuang
488a6e687c
Add C++ runtime for speech enhancement GTCRN models ( #1977 )
...
See also https://github.com/Xiaobin-Rong/gtcrn
2025-03-10 18:11:16 +08:00
franck-li
8aaae91d4a
add SherpaOnnxOfflineRecognizerSetConfig binding for go, and optimize the new/free for C.struct_SherpaOnnxOfflineRecognizerConfig ptr ( #1976 )
...
Co-authored-by: liyuzhi <liyuzhi@info.easeus.com.cn >
2025-03-10 18:04:12 +08:00
cjsdurj
b87fce9a7f
c-api add wave write to buffer. ( #1962 )
...
Co-authored-by: jian.chen03 <jian.chen03@transwarp.io >
2025-03-10 17:21:23 +08:00
Fangjun Kuang
6e261ed63f
Export gtcrn models to sherpa-onnx ( #1975 )
2025-03-10 11:31:18 +08:00
Fangjun Kuang
362ddf2c07
Add C++ demo for VAD+non-streaming ASR ( #1964 )
2025-03-07 11:49:46 +08:00
Fangjun Kuang
1e2328242d
Test using sherpa-onnx as a cmake subproject ( #1961 )
2025-03-06 12:12:56 +08:00
Karel Vesely
7740dbfb96
Ebranchformer ( #1951 )
...
* adding ebranchformer encoder
* extend surfaced FeatureExtractorConfig
- so ebranchformer feature extraction can be configured from Python
- the GlobCmvn is not needed, as it is a module in the OnnxEncoder
* clean the code
* Integrating remarks from Fangjun
2025-03-04 19:41:09 +08:00
Fangjun Kuang
209eaaae1d
Limit number of tokens per second for whisper. ( #1958 )
...
Otherwise, it spends lots of time in the loop if the EOT token
is not predicted.
2025-03-04 15:45:28 +08:00
Fangjun Kuang
49177530ff
Update README to include projects that is using sherpa-onnx ( #1956 )
2025-03-04 14:45:07 +08:00
Fangjun Kuang
c9d6859df7
Add transducer modified_beam_search for RKNN. ( #1949 )
2025-03-03 13:15:25 +08:00
Fangjun Kuang
d5e7b51af5
Support RKNN for Zipformer CTC models. ( #1948 )
2025-03-02 21:40:13 +08:00
Fangjun Kuang
dfcbc8d40b
Add Kokoro v1.1-zh ( #1942 )
2025-02-28 15:47:59 +08:00
Fangjun Kuang
f5dfcf8d2f
Add Kotlin and Java API for online punctuation models ( #1936 )
2025-02-27 16:52:36 +08:00
Fangjun Kuang
815ebac8f9
Fix building wheels for Python 3.7 ( #1933 )
2025-02-27 13:02:46 +08:00
Fangjun Kuang
337d5f7a80
Release v1.10.46 ( #1929 )
2025-02-26 19:19:33 +08:00
Fangjun Kuang
eebe19997d
Build wheels for rknn linux aarch64 ( #1928 )
2025-02-26 18:58:57 +08:00
Fangjun Kuang
82cb8a5dc3
Minor fixes for rknn ( #1925 )
2025-02-26 16:26:18 +08:00
Fangjun Kuang
2f9a2b20a1
Fix publishing macos pre-built artifacts ( #1922 )
2025-02-26 11:52:01 +08:00
xcel3011
b042f5e179
fix: AddPunct panic for Go( #1921 )
2025-02-25 18:09:28 +08:00
franck-li
0dcaf3a061
go.mod set to use go 1.17, and use unsafe.Slice to optimize the code ( #1920 )
...
Co-authored-by: liyuzhi <liyuzhi@info.easeus.com.cn >
2025-02-25 15:31:15 +08:00
Fangjun Kuang
dc2f7e9f9b
Fix publishing linux pre-built artifacts ( #1919 )
2025-02-25 15:22:50 +08:00
Grey Faulkenberry, MD MPH
70742b69ec
Flutter Config toJson/fromJson ( #1893 )
2025-02-25 14:43:48 +08:00
franck-li
808587accd
change [1<<28] to [1<<10], to fix build issues on GOARCH=386 that [1<<28] too large ( #1916 )
...
Co-authored-by: liyuzhi <liyuzhi@info.easeus.com.cn >
2025-02-25 09:12:55 +08:00
Fangjun Kuang
4d79e6a007
Add C++ API for streaming zipformer ASR on RK NPU ( #1908 )
2025-02-24 19:07:37 +08:00
Fangjun Kuang
bafd1103d0
Add onnxruntime 1.18.1 for Linux aarch64 GPU ( #1914 )
2025-02-24 16:14:14 +08:00
wanghsinche
7774e35749
feat: add mic example for better compatibility ( #1909 )
...
Co-authored-by: wanghsinche <wanghsinche>
2025-02-21 21:47:21 +08:00
ivan provalov
94728bfbee
Fixing Whisper Model Token Normalization ( #1904 )
2025-02-21 12:58:01 +08:00
Fangjun Kuang
ed922e61b5
Fix publishing pre-built windows libraries ( #1905 )
2025-02-21 11:59:27 +08:00
Sheldon Robinson
9c810ce3fe
Fix #1901 : UnicodeEncodeError running export_bpe_vocab.py ( #1902 )
2025-02-21 06:29:28 +08:00
ivan provalov
4801094133
JNI Exception Handling ( #1452 )
2025-02-19 23:02:28 +08:00
張小凡
654d2285eb
Reduce vad-whisper-c-api example code. ( #1891 )
2025-02-18 16:25:43 +08:00
Fangjun Kuang
774cf662a4
Fix generating Chinese lexicon for Kokoro TTS 1.0 ( #1888 )
2025-02-18 12:51:26 +08:00
luffy
4e83b3473b
speaker-identification-with-vad-non-streaming-asr.py Lack of support for sense_voice. ( #1884 )
2025-02-18 12:34:47 +08:00
Fangjun Kuang
26d5f1fa01
Fix kokoro lexicon. ( #1886 )
...
还钱 is mispronounced.
2025-02-18 11:09:16 +08:00
Fangjun Kuang
9711ab2474
Release v1.10.45 ( #1881 )
2025-02-17 16:20:04 +08:00
Fangjun Kuang
614c51068b
Add Pascal API for FireRedAsr AED Model ( #1877 ) ( #1880 )
2025-02-17 16:06:18 +08:00
Fangjun Kuang
87a968b55d
Add Go API for FireRedAsr AED Model ( #1879 )
2025-02-17 16:04:07 +08:00