enginex_bi_series-sherpa-onnx

EngineX-Iluvatar/enginex_bi_series-sherpa-onnx

Archived

Author	SHA1	Message	Date
Fangjun Kuang	df4615ca1d	Add C/CXX/JavaScript API for NeMo Canary models (#2357 ) This PR introduces support for NeMo Canary models across C, C++, and JavaScript APIs by adding new Canary configuration structures, updating bindings, extending examples, and enhancing CI workflows. - Add OfflineCanaryModelConfig to all language bindings (C, C++, JS, ETS). - Implement SetConfig methods and NAPI wrappers for updating recognizer config at runtime. - Update examples and CI scripts to demonstrate and test NeMo Canary model usage.	2025-07-07 23:38:04 +08:00
Fangjun Kuang	0e738c356c	Add C++ runtime and Python API for NeMo Canary models (#2352 )	2025-07-07 17:03:49 +08:00
Fangjun Kuang	f8d957a24b	Update README to include https://github.com/bbeyondllove/asr_server (#2353 )	2025-07-07 10:17:20 +08:00
Fangjun Kuang	fce481c125	Add meta data to NeMo canary ONNX models (#2351 )	2025-07-07 00:12:20 +08:00
Fangjun Kuang	25f9cec072	Update readme to include https://github.com/mawwalker/stt-server (#2350 )	2025-07-07 00:02:09 +08:00
Fangjun Kuang	c1e9e5c87f	Fix TTS for Unreal Engine (#2349 ) Unreal Engine has its own memory management, so we cannot return a struct containing a std::vector object.	2025-07-06 19:20:26 +08:00
lucaelin	5ebb71909b	fix(canary): use dynamo export, single input_ids and avoid 0/1 specialization (#2348 )	2025-07-06 18:24:06 +08:00
Fangjun Kuang	d70b789582	Fix testing dart packages (#2345 )	2025-07-04 22:27:24 +08:00
linsui	33a689dc86	Fix typo CMAKE_EXECUTBLE_LINKER_FLAGS -> CMAKE_EXECUTABLE_LINKER_FLAGS (#2344 )	2025-07-04 21:13:39 +08:00
Fangjun Kuang	e6b388067d	Release v1.12.4 (#2343 )	2025-07-04 19:41:02 +08:00
Fangjun Kuang	53a3ad366b	Support linux aarch64 for Dart and Flutter (#2342 ) Adds support for building and packaging Linux AArch64 (arm64) artifacts alongside x64 for Dart/Flutter plugins. - Detects host architecture in CMake and adjusts library paths - Extends test workflows to run on an ARM runner and handle linux-aarch64 paths - Splits release pipeline into separate x64 and aarch64 build/package jobs	2025-07-04 19:33:48 +08:00
Fangjun Kuang	3bf986d08d	Support non-streaming zipformer CTC ASR models (#2340 ) This PR adds support for non-streaming Zipformer CTC ASR models across multiple language bindings, WebAssembly, examples, and CI workflows. - Introduces a new OfflineZipformerCtcModelConfig in C/C++, Python, Swift, Java, Kotlin, Go, Dart, Pascal, and C# APIs - Updates initialization, freeing, and recognition logic to include Zipformer CTC in WASM and Node.js - Adds example scripts and CI steps for downloading, building, and running Zipformer CTC models Model doc is available at https://k2-fsa.github.io/sherpa/onnx/pretrained_models/offline-ctc/icefall/zipformer.html	2025-07-04 15:57:07 +08:00
wenjie.Li	ef16455cb5	Add sherpa-onnx-streaming-zipformer-zh-int8-2025-06-30 to android ASR apk (#2336 )	2025-07-03 11:31:13 +08:00
Fangjun Kuang	9fe25cc06f	Fix VAD+ASR C++ example. (#2335 ) It was not able to handle short audios., e.g., 2.1 seconds.	2025-07-02 15:52:49 +08:00
Fangjun Kuang	ea3e583ac9	Fix static link without tts (#2328 )	2025-06-30 14:21:01 +08:00
Fangjun Kuang	046ce01203	Add TTS engline APKs for more models (#2327 )	2025-06-30 13:36:29 +08:00
Fangjun Kuang	f725cb3306	Refactor release scripts. (#2323 ) It refactors the release scripts to centralize and simplify version updates across multiple files. Key changes include: - Introducing variables (old_version, new_version, replace_str) for version substitution. - Replacing hard-coded sed expressions with dynamic ones in various files. - Ensuring backup files generated by sed are cleaned up after execution.	2025-06-27 11:22:31 +08:00
Fangjun Kuang	e25634ac39	Release v1.12.3 (#2322 )	2025-06-27 10:55:46 +08:00
Fangjun Kuang	f835642b1c	Support Zipformer transducer ASR with whisper features. (#2321 ) Adds support for Zipformer transducer ASR models that use Whisper-style features by introducing a new feature flag, parsing metadata, and integrating per-chunk normalization. - Introduce UseWhisperFeature in the model interface and Zipformer implementation - Parse "feature" metadata to set the whisper flag and wire it into the recognizer - Update feature extraction logic to handle Whisper filterbanks with early returns	2025-06-27 10:40:41 +08:00
Fangjun Kuang	54bf3732d9	Support zipformer CTC ASR with whisper features. (#2319 )	2025-06-27 00:15:11 +08:00
Fangjun Kuang	282211c01f	Remove portaudio-go in Go API examples. (#2317 ) Replace the deprecated portaudio-go integration with malgo in the Go real-time speech recognition example and correct version string typos in the Node.js examples. - Fixed “verison” typo in Node.js console logs. - Swapped out portaudio-go for malgo in the Go microphone example, introducing initRecognizer, callback-driven streaming, and sample conversion. - Removed portaudio-go from go.mod.	2025-06-26 11:33:50 +08:00
Fangjun Kuang	074236ae80	Show cmake debug information. (#2316 )	2025-06-25 17:44:51 +08:00
Fangjun Kuang	056da0528d	Release v1.12.2 (#2314 )	2025-06-25 00:37:55 +08:00
Fangjun Kuang	bda427f4b2	Add API to get version information (#2309 )	2025-06-25 00:22:21 +08:00
Fangjun Kuang	7f2145539d	Update readme to include BreezeApp from MediaTek Research. (#2313 ) See also https://github.com/mtkresearch/BreezeApp	2025-06-24 18:05:50 +08:00
Fangjun Kuang	6982b86c66	Support extra languages in multi-lang kokoro tts (#2303 )	2025-06-20 11:22:52 +08:00
Fangjun Kuang	a6095f5f64	Fix building for Pascal (#2305 )	2025-06-20 11:10:07 +08:00
Fangjun Kuang	59d118c256	Refactor kokoro export (#2302 ) - generate samples for https://k2-fsa.github.io/sherpa/onnx/tts/all/ - provide int8 model for kokoro v0.19 kokoro-int8-en-v0_19.tar.bz2	2025-06-18 20:30:10 +08:00
Fangjun Kuang	3878170991	Fixes #2172 (#2301 ) Handle the case when the input audio contains no speeches.	2025-06-18 16:48:48 +08:00
DSOE1024	b4716e29a6	Update sherpa-onnx-shared.pc.in (#2300 ) Fix linking with C++ examples.	2025-06-17 16:55:42 +08:00
Fangjun Kuang	2913cce77c	Add scripts for exporting Piper TTS models to sherpa-onnx (#2299 )	2025-06-17 14:23:39 +08:00
Fangjun Kuang	4ae9382bae	Update TTS Engine APK to support multi-lang (#2294 )	2025-06-17 14:16:48 +08:00
guoxiangyang	0c42c06f75	update wasm/vad-asr/assets/README.md for more clear (#2297 ) Co-authored-by: gxy <gxy@conwi.cn>	2025-06-16 15:35:20 +08:00
GlocKieHuan	a135324c8c	Fix isspace on windows in debug build (#2042 )	2025-06-09 10:27:16 +08:00
Fangjun Kuang	e13f7dbdd2	Add link to huggingface space for source separation. (#2284 )	2025-06-06 13:38:16 +08:00
Fangjun Kuang	d57e4f84de	Add Python API for source separation (#2283 )	2025-06-05 20:44:26 +08:00
Fangjun Kuang	6f0fac2064	Add jar for Java 24. (#2280 )	2025-06-04 11:08:45 +08:00
Fangjun Kuang	db632dacf3	Fix CI for windows (#2279 )	2025-06-04 10:35:48 +08:00
Fangjun Kuang	749dc9a239	Release v1.12.1 (#2277 )	2025-06-03 21:55:49 +08:00
Fangjun Kuang	9539af5f5c	Fix 32-bit arm CI (#2276 )	2025-06-03 21:02:33 +08:00
Fangjun Kuang	1fabc6c79a	Fix rknn for multi-threads (#2274 )	2025-06-03 20:28:57 +08:00
flying-forever	818b3f6d6c	Update utils.dart (#2275 ) Fix normalizer for int16_t samples in flutter_examples.	2025-06-03 20:28:33 +08:00
Fangjun Kuang	6cb44d44e9	Export nvidia/canary-180m-flash to sherpa-onnx (#2272 )	2025-06-02 22:28:15 +08:00
Fangjun Kuang	2b2788332e	Add C++ support for UVR models (#2269 )	2025-06-01 17:22:08 +08:00
mtdxc	e0ca224b76	fixed mfc build error (#2267 ) Co-authored-by: cqm <cqm@97kid.com>	2025-05-31 23:32:35 +08:00
mtdxc	613e8084c2	move portaudio common record code to microphone (#2264 ) Co-authored-by: cqm <cqm@97kid.com>	2025-05-31 21:48:41 +08:00
Fangjun Kuang	921f0f40cb	Add UVR models for source separation. (#2266 )	2025-05-31 13:31:31 +08:00
Fangjun Kuang	93e2819c18	Fix building MFC examples (#2263 )	2025-05-29 17:41:45 +08:00
Fangjun Kuang	0bb325cecd	Fix building sherpa-onnx (#2262 )	2025-05-29 16:11:22 +08:00
Fangjun Kuang	8e6826521e	Update kaldi-native-fbank. (#2259 ) Now it supports FFT of an even number, not necessarily a power of 2.	2025-05-29 10:34:22 +08:00

1 2 3 4 5 ...

1263 Commits