Fangjun Kuang
0bb325cecd
Fix building sherpa-onnx ( #2262 )
2025-05-29 16:11:22 +08:00
Fangjun Kuang
8e6826521e
Update kaldi-native-fbank. ( #2259 )
...
Now it supports FFT of an even number, not necessarily a power of 2.
2025-05-29 10:34:22 +08:00
Fangjun Kuang
d8b5a58898
repair rknn wheels ( #2257 )
2025-05-28 17:39:55 +08:00
Fangjun Kuang
16a3449945
Build APK with replace.fst ( #2254 )
2025-05-28 12:19:29 +08:00
Skepller
640ceb5513
JAVA-API: Manual Library Loading Support for Restricted Environments ( #2253 )
...
* feat: Added LibraryLoader that allows loading to be skipped
* feat: Changed static call to new LibraryLoader
* feat: Makefile adjustment
2025-05-28 06:13:39 +08:00
yegyu
2107afdbd4
Add include headers for __ANDROID_API__,__OHOS__ ( #2251 )
2025-05-27 14:44:06 +08:00
Fangjun Kuang
716ba8317b
Add C++ runtime for spleeter about source separation ( #2242 )
2025-05-23 22:30:57 +08:00
Fangjun Kuang
55a44793e6
Export spleeter model to onnx for source separation ( #2237 )
2025-05-22 15:09:38 +08:00
Fangjun Kuang
901b3f0150
Fix publishing binaries for RKNN ( #2234 )
2025-05-21 11:59:41 +08:00
Fangjun Kuang
5113094352
Fix building RKNN wheels ( #2233 )
2025-05-21 11:15:18 +08:00
Fangjun Kuang
ff6f3b17ac
Use jlong explicitly in jni. ( #2229 )
2025-05-20 15:29:47 +08:00
Fangjun Kuang
02c902a079
Release v1.12.0 ( #2221 )
2025-05-15 16:03:17 +08:00
Fangjun Kuang
d8bb20710d
Add script to build APK for simulated-streaming-asr. ( #2220 )
2025-05-15 15:40:22 +08:00
Fangjun Kuang
99defc5b90
Add nodejs example for parakeet-tdt-0.6b-v2. ( #2219 )
2025-05-15 11:27:22 +08:00
esavin
aeb311db50
Expose dither for JNI ( #2215 )
2025-05-14 23:38:25 +08:00
Fangjun Kuang
2e9e0b4e9e
Add Android demo for real-time ASR with non-streaming ASR models. ( #2214 )
2025-05-14 19:10:44 +08:00
愚者自愚
116977b5d4
Add Go implementation of the TTS generation callback ( #2213 )
2025-05-14 16:09:31 +08:00
Fangjun Kuang
0dfafed7d0
Support homophone replacer in Android asr demo. ( #2210 )
2025-05-14 10:58:35 +08:00
Fangjun Kuang
fdda292d5a
Add alsa-based streaming ASR example for sense voice. ( #2207 )
2025-05-13 19:08:09 +08:00
Fangjun Kuang
9a0e16f092
Support sending is_eof for online websocket server. ( #2204 )
...
is_final=true means an endpoint is detected.
is_eof=true means all received samples have been processed
by the server.
2025-05-13 14:49:22 +08:00
Fangjun Kuang
cf67d72c22
Add a link to YouTube video including sherpa-onnx. ( #2202 )
2025-05-12 12:09:46 +08:00
Fangjun Kuang
b269e5cccc
Add C++ example for real-time ASR with nvidia/parakeet-tdt-0.6b-v2. ( #2201 )
2025-05-11 16:30:38 +08:00
Fangjun Kuang
028b8f2718
Add C++ example for streaming ASR with SenseVoice. ( #2199 )
2025-05-11 00:23:32 +08:00
Del
fc2121c307
docs: add Open-XiaoAI KWS project ( #2198 )
2025-05-10 14:03:56 +08:00
Fangjun Kuang
53518efd2f
Add real-time speech recognition example for SenseVoice. ( #2197 )
2025-05-10 00:50:40 +08:00
Fangjun Kuang
4a833a7547
Fix displaying streaming speech recognition results for Python. ( #2196 )
2025-05-09 21:48:49 +08:00
Fangjun Kuang
a6834f6556
Show verbose logs in homophone replacer ( #2194 )
2025-05-09 10:48:30 +08:00
Fangjun Kuang
562a5f7d9b
Fix building wheels for macOS ( #2192 )
2025-05-08 19:15:33 +08:00
Fangjun Kuang
baec2da745
Release v1.11.5 ( #2187 )
2025-05-08 11:39:16 +08:00
Fangjun Kuang
f9c99032c3
Avoid NaN in feature normalization. ( #2186 )
2025-05-08 11:22:47 +08:00
Fangjun Kuang
f00066db88
Add C++ runtime for parakeet-tdt-0.6b-v2. ( #2181 )
2025-05-06 16:59:01 +08:00
Fangjun Kuang
d660d5adea
export parakeet-tdt-0.6b-v2 to sherpa-onnx ( #2180 )
2025-05-06 16:32:59 +08:00
Fangjun Kuang
abc4daa49a
Release v1.11.4 ( #2169 )
2025-05-01 11:36:44 +08:00
Fangjun Kuang
fcb4c4eb2c
Add Go API for homophone replacer ( #2168 )
2025-04-30 23:47:38 +08:00
Fangjun Kuang
85df96d528
Add Dart API for homophone replacer ( #2167 )
2025-04-30 23:15:28 +08:00
Fangjun Kuang
e537094b07
Add Kotlin and Java API for homophone replacer ( #2166 )
...
* Add Kotlin API for homonphone replacer
* Add Java API for homonphone replacer
2025-04-29 22:55:21 +08:00
Fangjun Kuang
50b5329572
Add C# API for homophone replacer ( #2165 )
2025-04-29 21:36:38 +08:00
Fangjun Kuang
63d01a9534
Add Swift API for homophone replacer. ( #2164 )
2025-04-29 18:50:41 +08:00
Fangjun Kuang
4a7a974a04
More fix for building without tts ( #2162 )
2025-04-29 16:31:31 +08:00
Fangjun Kuang
51f8824219
Add homonphone replacer example for Python API. ( #2161 )
2025-04-29 15:59:34 +08:00
Fangjun Kuang
73dba27145
Fix building without TTS ( #2159 )
2025-04-29 15:14:50 +08:00
Fangjun Kuang
9d25c90a59
Add JavaScript API (node-addon) for homophone replacer ( #2158 )
2025-04-28 20:52:42 +08:00
Fangjun Kuang
a0aef1f6cd
Add JavaScript API (WASM) for homophone replacer ( #2157 )
2025-04-28 20:47:49 +08:00
Fangjun Kuang
e51c37eb2f
Add C and CXX API for homophone replacer ( #2156 )
2025-04-27 22:09:13 +08:00
Fangjun Kuang
f64c58342b
Support replacing homonphonic phrases ( #2153 )
2025-04-27 15:31:11 +08:00
Fangjun Kuang
e3280027f9
Support decoding multiple streams in Java API. ( #2149 )
2025-04-25 11:18:57 +08:00
Fangjun Kuang
48ab90aadc
Fix setting OnlineModelConfig in Java API ( #2147 )
2025-04-24 16:25:36 +08:00
Fangjun Kuang
72742d5472
Fix punctuations for kokoro tts 1.1-zh. ( #2146 )
2025-04-24 15:08:47 +08:00
Karel Vesely
6a1efd8ac2
online-transducer: reset the encoder toghter with 2 previous output symbols (non-blank) ( #2129 )
...
* online-transducer: reset the encoder toghter with 2 previous output symbols (non-blank)
- added `reset_encoder` boolean member into the OnlineRecognizerConfig class
- by default the encoder is not reset
* pybind11, adding empty symbols for disabled modules (tts, diarization)
* reset_encoder, add default value (false) [pybind11]
2025-04-24 08:18:11 +08:00
Fangjun Kuang
921c4370e6
Fix building for open harmonyOS ( #2142 )
2025-04-21 23:07:16 +08:00