enginex_bi_series-sherpa-onnx

EngineX-Iluvatar/enginex_bi_series-sherpa-onnx

Archived

Author	SHA1	Message	Date
Fangjun Kuang	802119db17	Add CXX API for speech enhancement GTCRN models (#1986 )	2025-03-11 17:07:52 +08:00
Fangjun Kuang	c5dbf1177c	Add C API for speech enhancement GTCRN models (#1984 )	2025-03-11 15:50:04 +08:00
Fangjun Kuang	5d2d792b1d	Add Python API for speech enhancement GTCRN models (#1978 )	2025-03-10 19:02:17 +08:00
Fangjun Kuang	488a6e687c	Add C++ runtime for speech enhancement GTCRN models (#1977 ) See also https://github.com/Xiaobin-Rong/gtcrn	2025-03-10 18:11:16 +08:00
cjsdurj	b87fce9a7f	c-api add wave write to buffer. (#1962 ) Co-authored-by: jian.chen03 <jian.chen03@transwarp.io>	2025-03-10 17:21:23 +08:00
Fangjun Kuang	362ddf2c07	Add C++ demo for VAD+non-streaming ASR (#1964 )	2025-03-07 11:49:46 +08:00
Karel Vesely	7740dbfb96	Ebranchformer (#1951 ) * adding ebranchformer encoder * extend surfaced FeatureExtractorConfig - so ebranchformer feature extraction can be configured from Python - the GlobCmvn is not needed, as it is a module in the OnnxEncoder * clean the code * Integrating remarks from Fangjun	2025-03-04 19:41:09 +08:00
Fangjun Kuang	209eaaae1d	Limit number of tokens per second for whisper. (#1958 ) Otherwise, it spends lots of time in the loop if the EOT token is not predicted.	2025-03-04 15:45:28 +08:00
Fangjun Kuang	c9d6859df7	Add transducer modified_beam_search for RKNN. (#1949 )	2025-03-03 13:15:25 +08:00
Fangjun Kuang	d5e7b51af5	Support RKNN for Zipformer CTC models. (#1948 )	2025-03-02 21:40:13 +08:00
Fangjun Kuang	dfcbc8d40b	Add Kokoro v1.1-zh (#1942 )	2025-02-28 15:47:59 +08:00
Fangjun Kuang	f5dfcf8d2f	Add Kotlin and Java API for online punctuation models (#1936 )	2025-02-27 16:52:36 +08:00
Fangjun Kuang	337d5f7a80	Release v1.10.46 (#1929 )	2025-02-26 19:19:33 +08:00
Fangjun Kuang	eebe19997d	Build wheels for rknn linux aarch64 (#1928 )	2025-02-26 18:58:57 +08:00
Fangjun Kuang	82cb8a5dc3	Minor fixes for rknn (#1925 )	2025-02-26 16:26:18 +08:00
Fangjun Kuang	4d79e6a007	Add C++ API for streaming zipformer ASR on RK NPU (#1908 )	2025-02-24 19:07:37 +08:00
ivan provalov	94728bfbee	Fixing Whisper Model Token Normalization (#1904 )	2025-02-21 12:58:01 +08:00
Fangjun Kuang	ed922e61b5	Fix publishing pre-built windows libraries (#1905 )	2025-02-21 11:59:27 +08:00
ivan provalov	4801094133	JNI Exception Handling (#1452 )	2025-02-19 23:02:28 +08:00
Fangjun Kuang	614c51068b	Add Pascal API for FireRedAsr AED Model (#1877 ) (#1880 )	2025-02-17 16:06:18 +08:00
Fangjun Kuang	1d49dd2fb0	Add CXX API for FireRedAsr (#1872 )	2025-02-17 11:46:13 +08:00
Fangjun Kuang	193d31333c	Add C API for FireRedAsr AED model. (#1871 )	2025-02-17 11:22:17 +08:00
Fangjun Kuang	d148860d2c	Add Kotlin and Java API for FireRedAsr AED model (#1870 )	2025-02-17 10:50:25 +08:00
Fangjun Kuang	316424b382	Add C++ and Python API for FireRedASR AED models (#1867 )	2025-02-16 22:45:24 +08:00
Fangjun Kuang	944400e399	Fix spliting text by languages for kokoro tts. (#1849 )	2025-02-13 18:19:34 +08:00
ahadjawaid	73d7c25233	Fix: made print sherpa_onnx_loge when it is in debug mode (#1838 ) Currently, during normal use you may get a lot of print statements such as: `Use espeak-ng to handle the OOV: 'ipsum'` which may not be relevant unless you are debugging.	2025-02-11 00:22:50 +08:00
Fangjun Kuang	ad883d44fe	Support specifying voice in espeak-ng for kokoro tts models. (#1836 )	2025-02-10 19:05:53 +08:00
Fangjun Kuang	d5da9430e8	Add PengChengStarling models to sherpa-onnx (#1835 )	2025-02-10 18:23:40 +08:00
Kell	2ac41d3d85	OfflineRecognizer supports create stream with hotwords (#1833 ) Co-authored-by: Wangkai <kell.wang@huawei.com>	2025-02-10 16:26:56 +08:00
Fangjun Kuang	9559a10bd3	Add C++ support for MatchaTTS models not from icefall. (#1834 )	2025-02-10 15:38:29 +08:00
Fangjun Kuang	69f489f0cd	Support scaling the duration of a pause in TTS. (#1820 )	2025-02-08 12:47:26 +08:00
Fangjun Kuang	d38cb81014	Fix passing gb2312 encoded strings to tts on Windows (#1819 )	2025-02-08 09:48:58 +08:00
Fangjun Kuang	c254504921	Add Pascal API for Kokoro TTS 1.0 (#1807 )	2025-02-07 16:06:11 +08:00
Fangjun Kuang	d815204774	Add CXX API for Kokoro TTS 1.0 (#1802 )	2025-02-07 14:51:49 +08:00
Fangjun Kuang	7330f7519a	Add C API for Kokoro TTS 1.0 (#1801 )	2025-02-07 14:30:40 +08:00
Fangjun Kuang	a52b819fb5	Add Android demo for Kokoro TTS 1.0 (#1799 )	2025-02-07 13:07:30 +08:00
Fangjun Kuang	4372a7a7b0	Add Java and Koltin API for Kokoro TTS 1.0 (#1798 )	2025-02-07 09:59:27 +08:00
Fangjun Kuang	c84a833863	Add C++ and Python API for Kokoro 1.0 multilingual TTS model (#1795 )	2025-02-06 22:57:13 +08:00
ahadjawaid	8677d83efc	Fix: Prepend 0 to tokenization to prevent word skipping for Kokoro. (#1787 ) Addressed issue Skipping words #1777	2025-02-03 13:49:42 +08:00
Fangjun Kuang	f178e96bf0	Add keyword spotter C API for HarmonyOS (#1769 )	2025-01-26 14:12:30 +08:00
Fangjun Kuang	8b989a851c	Fix keyword spotting. (#1689 ) Reset the stream right after detecting a keyword	2025-01-20 16:41:10 +08:00
Jacklyn	b943341fb1	Fix `dither` binding in Pybind11 to ensure independence from `high_freq` in `FeatureExtractorConfig` (#1739 )	2025-01-20 16:29:36 +08:00
Fangjun Kuang	99cef4198b	Add Koltin and Java API for Kokoro TTS models (#1728 )	2025-01-17 17:36:13 +08:00
Fangjun Kuang	46f2e32e8a	Add Pascal API for Kokoro TTS models (#1724 )	2025-01-16 18:20:21 +08:00
Fangjun Kuang	2d0869c709	Fix style issues (#1718 )	2025-01-16 15:43:51 +08:00
Fangjun Kuang	af671e2b63	Add C API for Kokoro TTS models (#1717 )	2025-01-16 15:07:26 +08:00
Fangjun Kuang	ffc6b480a0	Add C++ and Python API for Kokoro TTS models. (#1715 )	2025-01-16 14:24:51 +08:00
Fangjun Kuang	cbe07ac1b6	Release v1.10.39 (#1702 )	2025-01-13 10:28:05 +08:00
Fangjun Kuang	6d18430dbf	Fix building without TTS (#1691 )	2025-01-07 09:59:20 +08:00
Fangjun Kuang	c6fcd32552	Add Pascal API for MatchaTTS models. (#1686 )	2025-01-06 10:04:35 +08:00

1 2 3 4 5 ...

556 Commits