enginex_bi_series-sherpa-onnx

EngineX-Iluvatar/enginex_bi_series-sherpa-onnx

Archived

Author	SHA1	Message	Date
Fangjun Kuang	f835642b1c	Support Zipformer transducer ASR with whisper features. (#2321 ) Adds support for Zipformer transducer ASR models that use Whisper-style features by introducing a new feature flag, parsing metadata, and integrating per-chunk normalization. - Introduce UseWhisperFeature in the model interface and Zipformer implementation - Parse "feature" metadata to set the whisper flag and wire it into the recognizer - Update feature extraction logic to handle Whisper filterbanks with early returns	2025-06-27 10:40:41 +08:00
Fangjun Kuang	54bf3732d9	Support zipformer CTC ASR with whisper features. (#2319 )	2025-06-27 00:15:11 +08:00
Fangjun Kuang	282211c01f	Remove portaudio-go in Go API examples. (#2317 ) Replace the deprecated portaudio-go integration with malgo in the Go real-time speech recognition example and correct version string typos in the Node.js examples. - Fixed “verison” typo in Node.js console logs. - Swapped out portaudio-go for malgo in the Go microphone example, introducing initRecognizer, callback-driven streaming, and sample conversion. - Removed portaudio-go from go.mod.	2025-06-26 11:33:50 +08:00
Fangjun Kuang	074236ae80	Show cmake debug information. (#2316 )	2025-06-25 17:44:51 +08:00
Fangjun Kuang	056da0528d	Release v1.12.2 (#2314 )	2025-06-25 00:37:55 +08:00
Fangjun Kuang	bda427f4b2	Add API to get version information (#2309 )	2025-06-25 00:22:21 +08:00
Fangjun Kuang	7f2145539d	Update readme to include BreezeApp from MediaTek Research. (#2313 ) See also https://github.com/mtkresearch/BreezeApp	2025-06-24 18:05:50 +08:00
Fangjun Kuang	6982b86c66	Support extra languages in multi-lang kokoro tts (#2303 )	2025-06-20 11:22:52 +08:00
Fangjun Kuang	a6095f5f64	Fix building for Pascal (#2305 )	2025-06-20 11:10:07 +08:00
Fangjun Kuang	59d118c256	Refactor kokoro export (#2302 ) - generate samples for https://k2-fsa.github.io/sherpa/onnx/tts/all/ - provide int8 model for kokoro v0.19 kokoro-int8-en-v0_19.tar.bz2	2025-06-18 20:30:10 +08:00
Fangjun Kuang	3878170991	Fixes #2172 (#2301 ) Handle the case when the input audio contains no speeches.	2025-06-18 16:48:48 +08:00
DSOE1024	b4716e29a6	Update sherpa-onnx-shared.pc.in (#2300 ) Fix linking with C++ examples.	2025-06-17 16:55:42 +08:00
Fangjun Kuang	2913cce77c	Add scripts for exporting Piper TTS models to sherpa-onnx (#2299 )	2025-06-17 14:23:39 +08:00
Fangjun Kuang	4ae9382bae	Update TTS Engine APK to support multi-lang (#2294 )	2025-06-17 14:16:48 +08:00
guoxiangyang	0c42c06f75	update wasm/vad-asr/assets/README.md for more clear (#2297 ) Co-authored-by: gxy <gxy@conwi.cn>	2025-06-16 15:35:20 +08:00
GlocKieHuan	a135324c8c	Fix isspace on windows in debug build (#2042 )	2025-06-09 10:27:16 +08:00
Fangjun Kuang	e13f7dbdd2	Add link to huggingface space for source separation. (#2284 )	2025-06-06 13:38:16 +08:00
Fangjun Kuang	d57e4f84de	Add Python API for source separation (#2283 )	2025-06-05 20:44:26 +08:00
Fangjun Kuang	6f0fac2064	Add jar for Java 24. (#2280 )	2025-06-04 11:08:45 +08:00
Fangjun Kuang	db632dacf3	Fix CI for windows (#2279 )	2025-06-04 10:35:48 +08:00
Fangjun Kuang	749dc9a239	Release v1.12.1 (#2277 )	2025-06-03 21:55:49 +08:00
Fangjun Kuang	9539af5f5c	Fix 32-bit arm CI (#2276 )	2025-06-03 21:02:33 +08:00
Fangjun Kuang	1fabc6c79a	Fix rknn for multi-threads (#2274 )	2025-06-03 20:28:57 +08:00
flying-forever	818b3f6d6c	Update utils.dart (#2275 ) Fix normalizer for int16_t samples in flutter_examples.	2025-06-03 20:28:33 +08:00
Fangjun Kuang	6cb44d44e9	Export nvidia/canary-180m-flash to sherpa-onnx (#2272 )	2025-06-02 22:28:15 +08:00
Fangjun Kuang	2b2788332e	Add C++ support for UVR models (#2269 )	2025-06-01 17:22:08 +08:00
mtdxc	e0ca224b76	fixed mfc build error (#2267 ) Co-authored-by: cqm <cqm@97kid.com>	2025-05-31 23:32:35 +08:00
mtdxc	613e8084c2	move portaudio common record code to microphone (#2264 ) Co-authored-by: cqm <cqm@97kid.com>	2025-05-31 21:48:41 +08:00
Fangjun Kuang	921f0f40cb	Add UVR models for source separation. (#2266 )	2025-05-31 13:31:31 +08:00
Fangjun Kuang	93e2819c18	Fix building MFC examples (#2263 )	2025-05-29 17:41:45 +08:00
Fangjun Kuang	0bb325cecd	Fix building sherpa-onnx (#2262 )	2025-05-29 16:11:22 +08:00
Fangjun Kuang	8e6826521e	Update kaldi-native-fbank. (#2259 ) Now it supports FFT of an even number, not necessarily a power of 2.	2025-05-29 10:34:22 +08:00
Fangjun Kuang	d8b5a58898	repair rknn wheels (#2257 )	2025-05-28 17:39:55 +08:00
Fangjun Kuang	16a3449945	Build APK with replace.fst (#2254 )	2025-05-28 12:19:29 +08:00
Skepller	640ceb5513	JAVA-API: Manual Library Loading Support for Restricted Environments (#2253 ) * feat: Added LibraryLoader that allows loading to be skipped * feat: Changed static call to new LibraryLoader * feat: Makefile adjustment	2025-05-28 06:13:39 +08:00
yegyu	2107afdbd4	Add include headers for __ANDROID_API__,__OHOS__ (#2251 )	2025-05-27 14:44:06 +08:00
Fangjun Kuang	716ba8317b	Add C++ runtime for spleeter about source separation (#2242 )	2025-05-23 22:30:57 +08:00
Fangjun Kuang	55a44793e6	Export spleeter model to onnx for source separation (#2237 )	2025-05-22 15:09:38 +08:00
Fangjun Kuang	901b3f0150	Fix publishing binaries for RKNN (#2234 )	2025-05-21 11:59:41 +08:00
Fangjun Kuang	5113094352	Fix building RKNN wheels (#2233 )	2025-05-21 11:15:18 +08:00
Fangjun Kuang	ff6f3b17ac	Use jlong explicitly in jni. (#2229 )	2025-05-20 15:29:47 +08:00
Fangjun Kuang	02c902a079	Release v1.12.0 (#2221 )	2025-05-15 16:03:17 +08:00
Fangjun Kuang	d8bb20710d	Add script to build APK for simulated-streaming-asr. (#2220 )	2025-05-15 15:40:22 +08:00
Fangjun Kuang	99defc5b90	Add nodejs example for parakeet-tdt-0.6b-v2. (#2219 )	2025-05-15 11:27:22 +08:00
esavin	aeb311db50	Expose dither for JNI (#2215 )	2025-05-14 23:38:25 +08:00
Fangjun Kuang	2e9e0b4e9e	Add Android demo for real-time ASR with non-streaming ASR models. (#2214 )	2025-05-14 19:10:44 +08:00
愚者自愚	116977b5d4	Add Go implementation of the TTS generation callback (#2213 )	2025-05-14 16:09:31 +08:00
Fangjun Kuang	0dfafed7d0	Support homophone replacer in Android asr demo. (#2210 )	2025-05-14 10:58:35 +08:00
Fangjun Kuang	fdda292d5a	Add alsa-based streaming ASR example for sense voice. (#2207 )	2025-05-13 19:08:09 +08:00
Fangjun Kuang	9a0e16f092	Support sending is_eof for online websocket server. (#2204 ) is_final=true means an endpoint is detected. is_eof=true means all received samples have been processed by the server.	2025-05-13 14:49:22 +08:00

1 2 3 4 5 ...

1245 Commits