enginex-mr_series-sherpa-onnx

EngineX-Iluvatar/enginex-mr_series-sherpa-onnx

Archived

Author	SHA1	Message	Date
Fangjun Kuang	028b8f2718	Add C++ example for streaming ASR with SenseVoice. (#2199 )	2025-05-11 00:23:32 +08:00
Fangjun Kuang	53518efd2f	Add real-time speech recognition example for SenseVoice. (#2197 )	2025-05-10 00:50:40 +08:00
Fangjun Kuang	4a833a7547	Fix displaying streaming speech recognition results for Python. (#2196 )	2025-05-09 21:48:49 +08:00
Fangjun Kuang	a6834f6556	Show verbose logs in homophone replacer (#2194 )	2025-05-09 10:48:30 +08:00
Fangjun Kuang	562a5f7d9b	Fix building wheels for macOS (#2192 )	2025-05-08 19:15:33 +08:00
Fangjun Kuang	f9c99032c3	Avoid NaN in feature normalization. (#2186 )	2025-05-08 11:22:47 +08:00
Fangjun Kuang	f00066db88	Add C++ runtime for parakeet-tdt-0.6b-v2. (#2181 )	2025-05-06 16:59:01 +08:00
Fangjun Kuang	e537094b07	Add Kotlin and Java API for homophone replacer (#2166 ) * Add Kotlin API for homonphone replacer * Add Java API for homonphone replacer	2025-04-29 22:55:21 +08:00
Fangjun Kuang	4a7a974a04	More fix for building without tts (#2162 )	2025-04-29 16:31:31 +08:00
Fangjun Kuang	e51c37eb2f	Add C and CXX API for homophone replacer (#2156 )	2025-04-27 22:09:13 +08:00
Fangjun Kuang	f64c58342b	Support replacing homonphonic phrases (#2153 )	2025-04-27 15:31:11 +08:00
Fangjun Kuang	e3280027f9	Support decoding multiple streams in Java API. (#2149 )	2025-04-25 11:18:57 +08:00
Fangjun Kuang	48ab90aadc	Fix setting OnlineModelConfig in Java API (#2147 )	2025-04-24 16:25:36 +08:00
Fangjun Kuang	72742d5472	Fix punctuations for kokoro tts 1.1-zh. (#2146 )	2025-04-24 15:08:47 +08:00
Karel Vesely	6a1efd8ac2	online-transducer: reset the encoder toghter with 2 previous output symbols (non-blank) (#2129 ) * online-transducer: reset the encoder toghter with 2 previous output symbols (non-blank) - added `reset_encoder` boolean member into the OnlineRecognizerConfig class - by default the encoder is not reset * pybind11, adding empty symbols for disabled modules (tts, diarization) * reset_encoder, add default value (false) [pybind11]	2025-04-24 08:18:11 +08:00
Fangjun Kuang	7cbb1bc433	Upload more onnx ASR models (#2141 )	2025-04-21 18:57:41 +08:00
Nickolay V. Shmyrev	84ed5d4288	Expose dither in python API (#2127 )	2025-04-17 16:47:48 +08:00
Karel Vesely	f3d23aa170	cmake build, configurable from env (#2115 ) - make sure the defaults in `cmake/cmake_extension.py` variable `extra_cmake_args` can be overriden by `cmake_args` from `SHERPA_ONNX_CMAKE_ARGS` env variable - fix a bug in `sherpa-onnx/csrc/parse-options.cc` which appears when using `-DSHERPA_ONNX_ENABLE_CHECK=ON` - avoid copying binaries when these are disabled	2025-04-16 21:26:54 +08:00
Fangjun Kuang	7a78f2eb7a	Fix building for HarmonyOS (#2125 )	2025-04-15 18:00:07 +08:00
Fangjun Kuang	e3bce847c0	Support running sherpa-onnx with RK NPU on Android (#2124 )	2025-04-15 16:42:28 +08:00
Fangjun Kuang	1c3a383002	Fix a typo in the JNI for Android. (#2108 )	2025-04-09 09:02:41 +08:00
Askars Salimbajevs	664b461d01	Disable strict hotword matching mode for offline transducer (#1837 ) * Disable strict hotword matching mode for offline transducer. Also introduces new variable, so that later this mode can be switched on in the runtime. * remove strict mode variable --------- Co-authored-by: Askars Salimbajevs <askars.salimbajevs@tilde.lv>	2025-04-03 22:52:19 +08:00
Fangjun Kuang	8137ac9f0b	Add Pascal API for Dolphin CTC models (#2096 )	2025-04-03 16:00:22 +08:00
Askars Salimbajevs	18a6ed5ddc	Preserve more context after endpointing in transducer (#2061 )	2025-04-02 23:33:47 +08:00
Fangjun Kuang	da4aad1189	Add C and CXX API for Dolphin CTC models (#2088 )	2025-04-02 21:54:20 +08:00
Fangjun Kuang	eee5575836	Add Kotlin and Java API for Dolphin CTC models (#2086 )	2025-04-02 21:16:14 +08:00
Fangjun Kuang	0de7e1b9f0	Add C++ and Python API for Dolphin CTC models (#2085 )	2025-04-02 19:09:00 +08:00
Fangjun Kuang	1316719e23	Fix building for android (#2081 )	2025-04-01 19:36:40 +08:00
Fangjun Kuang	a11e359c11	Refactor rknn code (#2079 )	2025-04-01 16:54:53 +08:00
Fangjun Kuang	8e51a97550	Add C++ runtime for silero_vad with RKNN (#2078 )	2025-04-01 15:56:56 +08:00
Fangjun Kuang	0703bc1b86	Add CXX API for VAD (#2077 )	2025-04-01 14:51:43 +08:00
Anders Xiao	ce196fceae	fix dml with preinstall ort (#2066 )	2025-03-30 12:07:19 +08:00
niansa/tuxifan	9d23606ee6	Allow building repository as CMake subdirectory (#2059 ) * Use PROJECT_SOURCE_DIR rather than CMAKE_SOURCE_DIR to allow building as subdirectory * Also use PROJECT_SOURCE_DIR instead of CMAKE_SOURCE_DIR in c/cxx api examples * Only build examples by default when not building as subdirectory * Do not suggest building binaries either --------- Co-authored-by: user <user@mail.tld>	2025-03-29 06:27:59 +08:00
Fangjun Kuang	a5dd0cdfc3	Fix length scale for kokoro tts (#2060 )	2025-03-27 10:52:01 +08:00
yourengod	bd61c1d8e5	Change scale factor to 32767 (#2056 )	2025-03-26 10:44:49 +08:00
Fangjun Kuang	823e2e6257	Fix building wheels for RKNN (#2041 )	2025-03-22 18:33:32 +08:00
Sangeet Sagar	31096e43bd	fix static linking (#2032 )	2025-03-21 12:47:45 +08:00
Fangjun Kuang	a19e57604e	Fix Matcha + vocos for Android (#2024 )	2025-03-19 18:39:10 +08:00
Fangjun Kuang	a50901f366	Fix a bug in vad.reset() (#2023 ) We also need to clear _last	2025-03-19 17:42:05 +08:00
Fangjun Kuang	1f52ac2126	add alsa example for vad+offline asr (#2020 )	2025-03-18 20:06:24 +08:00
Fangjun Kuang	406272210f	Fix CI (#2016 )	2025-03-17 22:31:36 +08:00
Fangjun Kuang	0aacf02dd8	Add C++ runtime for vocos (#2014 )	2025-03-17 17:05:15 +08:00
Fangjun Kuang	71824992a7	Add Java API for speech enhancement GTCRN models (#2009 )	2025-03-16 15:13:20 +08:00
Fangjun Kuang	ed8e6c9aed	Add Kotlin API for speech enhancement GTCRN models (#2008 )	2025-03-16 10:41:01 +08:00
Fangjun Kuang	6a97f8adcf	Add JavaScript (node-addon) API for speech enhancement GTCRN models (#1996 )	2025-03-12 15:52:01 +08:00
Fangjun Kuang	c3b009988b	Add Pascal API for speech enhancement GTCRN models (#1992 )	2025-03-12 10:48:59 +08:00
Fangjun Kuang	802119db17	Add CXX API for speech enhancement GTCRN models (#1986 )	2025-03-11 17:07:52 +08:00
Fangjun Kuang	c5dbf1177c	Add C API for speech enhancement GTCRN models (#1984 )	2025-03-11 15:50:04 +08:00
Fangjun Kuang	5d2d792b1d	Add Python API for speech enhancement GTCRN models (#1978 )	2025-03-10 19:02:17 +08:00
Fangjun Kuang	488a6e687c	Add C++ runtime for speech enhancement GTCRN models (#1977 ) See also https://github.com/Xiaobin-Rong/gtcrn	2025-03-10 18:11:16 +08:00

1 2 3 4 5 ...

602 Commits