Fangjun Kuang
9fe25cc06f
Fix VAD+ASR C++ example. ( #2335 )
...
It was not able to handle short audios., e.g., 2.1 seconds.
2025-07-02 15:52:49 +08:00
Fangjun Kuang
ea3e583ac9
Fix static link without tts ( #2328 )
2025-06-30 14:21:01 +08:00
Fangjun Kuang
046ce01203
Add TTS engline APKs for more models ( #2327 )
2025-06-30 13:36:29 +08:00
Fangjun Kuang
f725cb3306
Refactor release scripts. ( #2323 )
...
It refactors the release scripts to centralize and simplify version updates across
multiple files. Key changes include:
- Introducing variables (old_version, new_version, replace_str) for version substitution.
- Replacing hard-coded sed expressions with dynamic ones in various files.
- Ensuring backup files generated by sed are cleaned up after execution.
2025-06-27 11:22:31 +08:00
Fangjun Kuang
e25634ac39
Release v1.12.3 ( #2322 )
2025-06-27 10:55:46 +08:00
Fangjun Kuang
f835642b1c
Support Zipformer transducer ASR with whisper features. ( #2321 )
...
Adds support for Zipformer transducer ASR models that use Whisper-style
features by introducing a new feature flag, parsing metadata,
and integrating per-chunk normalization.
- Introduce UseWhisperFeature in the model interface and Zipformer implementation
- Parse "feature" metadata to set the whisper flag and wire it into the recognizer
- Update feature extraction logic to handle Whisper filterbanks with early returns
2025-06-27 10:40:41 +08:00
Fangjun Kuang
54bf3732d9
Support zipformer CTC ASR with whisper features. ( #2319 )
2025-06-27 00:15:11 +08:00
Fangjun Kuang
282211c01f
Remove portaudio-go in Go API examples. ( #2317 )
...
Replace the deprecated portaudio-go integration with malgo in the Go real-time
speech recognition example and correct version string typos in the Node.js examples.
- Fixed “verison” typo in Node.js console logs.
- Swapped out portaudio-go for malgo in the Go microphone example,
introducing initRecognizer, callback-driven streaming, and sample conversion.
- Removed portaudio-go from go.mod.
2025-06-26 11:33:50 +08:00
Fangjun Kuang
074236ae80
Show cmake debug information. ( #2316 )
2025-06-25 17:44:51 +08:00
Fangjun Kuang
056da0528d
Release v1.12.2 ( #2314 )
2025-06-25 00:37:55 +08:00
Fangjun Kuang
bda427f4b2
Add API to get version information ( #2309 )
2025-06-25 00:22:21 +08:00
Fangjun Kuang
7f2145539d
Update readme to include BreezeApp from MediaTek Research. ( #2313 )
...
See also https://github.com/mtkresearch/BreezeApp
2025-06-24 18:05:50 +08:00
Fangjun Kuang
6982b86c66
Support extra languages in multi-lang kokoro tts ( #2303 )
2025-06-20 11:22:52 +08:00
Fangjun Kuang
a6095f5f64
Fix building for Pascal ( #2305 )
2025-06-20 11:10:07 +08:00
Fangjun Kuang
59d118c256
Refactor kokoro export ( #2302 )
...
- generate samples for https://k2-fsa.github.io/sherpa/onnx/tts/all/
- provide int8 model for kokoro v0.19 kokoro-int8-en-v0_19.tar.bz2
2025-06-18 20:30:10 +08:00
Fangjun Kuang
3878170991
Fixes #2172 ( #2301 )
...
Handle the case when the input audio contains no speeches.
2025-06-18 16:48:48 +08:00
DSOE1024
b4716e29a6
Update sherpa-onnx-shared.pc.in ( #2300 )
...
Fix linking with C++ examples.
2025-06-17 16:55:42 +08:00
Fangjun Kuang
2913cce77c
Add scripts for exporting Piper TTS models to sherpa-onnx ( #2299 )
2025-06-17 14:23:39 +08:00
Fangjun Kuang
4ae9382bae
Update TTS Engine APK to support multi-lang ( #2294 )
2025-06-17 14:16:48 +08:00
guoxiangyang
0c42c06f75
update wasm/vad-asr/assets/README.md for more clear ( #2297 )
...
Co-authored-by: gxy <gxy@conwi.cn >
2025-06-16 15:35:20 +08:00
GlocKieHuan
a135324c8c
Fix isspace on windows in debug build ( #2042 )
2025-06-09 10:27:16 +08:00
Fangjun Kuang
e13f7dbdd2
Add link to huggingface space for source separation. ( #2284 )
2025-06-06 13:38:16 +08:00
Fangjun Kuang
d57e4f84de
Add Python API for source separation ( #2283 )
2025-06-05 20:44:26 +08:00
Fangjun Kuang
6f0fac2064
Add jar for Java 24. ( #2280 )
2025-06-04 11:08:45 +08:00
Fangjun Kuang
db632dacf3
Fix CI for windows ( #2279 )
2025-06-04 10:35:48 +08:00
Fangjun Kuang
749dc9a239
Release v1.12.1 ( #2277 )
2025-06-03 21:55:49 +08:00
Fangjun Kuang
9539af5f5c
Fix 32-bit arm CI ( #2276 )
2025-06-03 21:02:33 +08:00
Fangjun Kuang
1fabc6c79a
Fix rknn for multi-threads ( #2274 )
2025-06-03 20:28:57 +08:00
flying-forever
818b3f6d6c
Update utils.dart ( #2275 )
...
Fix normalizer for int16_t samples in flutter_examples.
2025-06-03 20:28:33 +08:00
Fangjun Kuang
6cb44d44e9
Export nvidia/canary-180m-flash to sherpa-onnx ( #2272 )
2025-06-02 22:28:15 +08:00
Fangjun Kuang
2b2788332e
Add C++ support for UVR models ( #2269 )
2025-06-01 17:22:08 +08:00
mtdxc
e0ca224b76
fixed mfc build error ( #2267 )
...
Co-authored-by: cqm <cqm@97kid.com >
2025-05-31 23:32:35 +08:00
mtdxc
613e8084c2
move portaudio common record code to microphone ( #2264 )
...
Co-authored-by: cqm <cqm@97kid.com >
2025-05-31 21:48:41 +08:00
Fangjun Kuang
921f0f40cb
Add UVR models for source separation. ( #2266 )
2025-05-31 13:31:31 +08:00
Fangjun Kuang
93e2819c18
Fix building MFC examples ( #2263 )
2025-05-29 17:41:45 +08:00
Fangjun Kuang
0bb325cecd
Fix building sherpa-onnx ( #2262 )
2025-05-29 16:11:22 +08:00
Fangjun Kuang
8e6826521e
Update kaldi-native-fbank. ( #2259 )
...
Now it supports FFT of an even number, not necessarily a power of 2.
2025-05-29 10:34:22 +08:00
Fangjun Kuang
d8b5a58898
repair rknn wheels ( #2257 )
2025-05-28 17:39:55 +08:00
Fangjun Kuang
16a3449945
Build APK with replace.fst ( #2254 )
2025-05-28 12:19:29 +08:00
Skepller
640ceb5513
JAVA-API: Manual Library Loading Support for Restricted Environments ( #2253 )
...
* feat: Added LibraryLoader that allows loading to be skipped
* feat: Changed static call to new LibraryLoader
* feat: Makefile adjustment
2025-05-28 06:13:39 +08:00
yegyu
2107afdbd4
Add include headers for __ANDROID_API__,__OHOS__ ( #2251 )
2025-05-27 14:44:06 +08:00
Fangjun Kuang
716ba8317b
Add C++ runtime for spleeter about source separation ( #2242 )
2025-05-23 22:30:57 +08:00
Fangjun Kuang
55a44793e6
Export spleeter model to onnx for source separation ( #2237 )
2025-05-22 15:09:38 +08:00
Fangjun Kuang
901b3f0150
Fix publishing binaries for RKNN ( #2234 )
2025-05-21 11:59:41 +08:00
Fangjun Kuang
5113094352
Fix building RKNN wheels ( #2233 )
2025-05-21 11:15:18 +08:00
Fangjun Kuang
ff6f3b17ac
Use jlong explicitly in jni. ( #2229 )
2025-05-20 15:29:47 +08:00
Fangjun Kuang
02c902a079
Release v1.12.0 ( #2221 )
2025-05-15 16:03:17 +08:00
Fangjun Kuang
d8bb20710d
Add script to build APK for simulated-streaming-asr. ( #2220 )
2025-05-15 15:40:22 +08:00
Fangjun Kuang
99defc5b90
Add nodejs example for parakeet-tdt-0.6b-v2. ( #2219 )
2025-05-15 11:27:22 +08:00
esavin
aeb311db50
Expose dither for JNI ( #2215 )
2025-05-14 23:38:25 +08:00