enginex-mr_series-sherpa-onnx

EngineX-Iluvatar/enginex-mr_series-sherpa-onnx

Archived

Author	SHA1	Message	Date
Fangjun Kuang	316424b382	Add C++ and Python API for FireRedASR AED models (#1867 )	2025-02-16 22:45:24 +08:00
Fangjun Kuang	944400e399	Fix spliting text by languages for kokoro tts. (#1849 )	2025-02-13 18:19:34 +08:00
ahadjawaid	73d7c25233	Fix: made print sherpa_onnx_loge when it is in debug mode (#1838 ) Currently, during normal use you may get a lot of print statements such as: `Use espeak-ng to handle the OOV: 'ipsum'` which may not be relevant unless you are debugging.	2025-02-11 00:22:50 +08:00
Fangjun Kuang	ad883d44fe	Support specifying voice in espeak-ng for kokoro tts models. (#1836 )	2025-02-10 19:05:53 +08:00
Fangjun Kuang	d5da9430e8	Add PengChengStarling models to sherpa-onnx (#1835 )	2025-02-10 18:23:40 +08:00
Fangjun Kuang	9559a10bd3	Add C++ support for MatchaTTS models not from icefall. (#1834 )	2025-02-10 15:38:29 +08:00
Fangjun Kuang	69f489f0cd	Support scaling the duration of a pause in TTS. (#1820 )	2025-02-08 12:47:26 +08:00
Fangjun Kuang	d38cb81014	Fix passing gb2312 encoded strings to tts on Windows (#1819 )	2025-02-08 09:48:58 +08:00
Fangjun Kuang	7330f7519a	Add C API for Kokoro TTS 1.0 (#1801 )	2025-02-07 14:30:40 +08:00
Fangjun Kuang	c84a833863	Add C++ and Python API for Kokoro 1.0 multilingual TTS model (#1795 )	2025-02-06 22:57:13 +08:00
ahadjawaid	8677d83efc	Fix: Prepend 0 to tokenization to prevent word skipping for Kokoro. (#1787 ) Addressed issue Skipping words #1777	2025-02-03 13:49:42 +08:00
Fangjun Kuang	f178e96bf0	Add keyword spotter C API for HarmonyOS (#1769 )	2025-01-26 14:12:30 +08:00
Fangjun Kuang	8b989a851c	Fix keyword spotting. (#1689 ) Reset the stream right after detecting a keyword	2025-01-20 16:41:10 +08:00
Fangjun Kuang	2d0869c709	Fix style issues (#1718 )	2025-01-16 15:43:51 +08:00
Fangjun Kuang	ffc6b480a0	Add C++ and Python API for Kokoro TTS models. (#1715 )	2025-01-16 14:24:51 +08:00
Fangjun Kuang	cbe07ac1b6	Release v1.10.39 (#1702 )	2025-01-13 10:28:05 +08:00
Fangjun Kuang	1fe5fe495f	Add Android demo for MatchaTTS models. (#1683 )	2025-01-06 06:44:09 +08:00
Fangjun Kuang	bf3330c906	Add HarmonyOS examples for MatchaTTS. (#1678 )	2025-01-03 17:09:29 +08:00
Fangjun Kuang	9aa4897a9e	Add C API for MatchaTTS models (#1675 )	2025-01-03 12:17:26 +08:00
Fangjun Kuang	a00d3b4821	Add Java API for Matcha-TTS models. (#1673 )	2025-01-02 15:15:30 +08:00
Fangjun Kuang	f457baea42	Support Matcha-TTS models using espeak-ng (#1672 )	2025-01-02 13:46:43 +08:00
Fangjun Kuang	3422b9388d	Add Kotlin API for Matcha-TTS models. (#1668 )	2024-12-31 19:20:52 +08:00
Fangjun Kuang	ebe92e523d	Remove spaces after punctuations for TTS (#1666 )	2024-12-31 16:06:27 +08:00
Fangjun Kuang	2c2926af7d	Add C++ runtime for Matcha-TTS (#1627 )	2024-12-31 12:44:14 +08:00
Fangjun Kuang	b6f0f5fc2e	Support removing invalid utf-8 sequences. (#1648 )	2024-12-25 19:32:13 +08:00
Fangjun Kuang	d00d1c6298	Fix GitHub actions. (#1642 )	2024-12-24 11:34:35 +08:00
Fangjun Kuang	b76cd9033a	Support decoding with byte-level BPE (bbpe) models. (#1633 )	2024-12-20 19:21:32 +08:00
Fangjun Kuang	1bae4085ca	Add speaker diarization API for HarmonyOS. (#1609 )	2024-12-10 16:03:03 +08:00
Fangjun Kuang	314545f938	Add speaker identification APIs for HarmonyOS (#1607 ) * Add speaker embedding extractor API for HarmonyOS * Add ArkTS API for speaker identification	2024-12-09 19:23:18 +08:00
Fangjun Kuang	a743a4400f	Add on-device real-time ASR demo for HarmonyOS (#1606 )	2024-12-09 16:40:15 +08:00
Fangjun Kuang	74a8735f7a	Add on-device tex-to-speech (TTS) demo for HarmonyOS (#1590 )	2024-12-04 14:27:12 +08:00
Fangjun Kuang	dc3287f3a8	Add HarmonyOS support for text-to-speech. (#1584 )	2024-12-01 21:43:34 +08:00
Fangjun Kuang	109fb799ca	fix building for Android (#1568 )	2024-11-27 10:36:16 +08:00
Fangjun Kuang	2101227269	Add streaming ASR support for HarmonyOS. (#1565 )	2024-11-26 18:36:56 +08:00
Fangjun Kuang	298b6b6fda	Add non-streaming ASR support for HarmonyOS. (#1564 )	2024-11-26 16:38:35 +08:00
Fangjun Kuang	31d6206fde	HarmonyOS support for VAD. (#1561 )	2024-11-24 16:29:24 +08:00
Fangjun Kuang	f97daed408	Fixes #1512 (#1522 )	2024-11-08 21:07:36 +08:00
Fangjun Kuang	4eeb336f59	Export the English TTS model from MeloTTS (#1509 )	2024-11-04 07:54:19 +08:00
Fangjun Kuang	6ee8c99c5d	Fix building (#1508 )	2024-11-03 19:47:04 +08:00
Fangjun Kuang	9ab89c33bc	Support building GPU-capable sherpa-onnx on Linux aarch64. (#1500 ) Thanks to @Peakyxh for providing pre-built onnxruntime libraries with CUDA support for Linux aarch64. Tested on Jetson nano b01	2024-11-01 11:16:28 +08:00
Fangjun Kuang	9fa3bc40d7	Fix reading tokens.txt on Windows. (#1497 )	2024-10-30 12:13:11 +08:00
Fangjun Kuang	669f5ef441	Add C++ runtime and Python APIs for Moonshine models (#1473 )	2024-10-26 14:34:07 +08:00
Fangjun Kuang	707cf792c5	Add GigaAM NeMo transducer model for Russian ASR (#1467 )	2024-10-25 15:20:13 +08:00
Fangjun Kuang	b41f6d2c94	Support GigaAM CTC models for Russian ASR (#1464 ) See also https://github.com/salute-developers/GigaAM	2024-10-25 10:55:16 +08:00
Fangjun Kuang	a5295aad10	Handle NaN embeddings in speaker diarization. (#1461 ) See also https://github.com/thewh1teagle/sherpa-rs/issues/33	2024-10-24 14:03:09 +08:00
Fangjun Kuang	b3e05f6dc4	Fix style issues (#1458 )	2024-10-24 11:15:08 +08:00
Fangjun Kuang	ceb69ebd94	Add C++ API for non-streaming ASR (#1456 )	2024-10-23 16:40:12 +08:00
Zazzle516	4783c8f590	fix "log10" compile error by import CMATH lib (#1438 )	2024-10-17 14:50:04 +08:00
Fangjun Kuang	94b26ff07c	Android JNI support for speaker diarization (#1421 )	2024-10-12 13:03:48 +08:00
Fangjun Kuang	1ed803adc1	Dart API for speaker diarization (#1418 )	2024-10-11 21:17:41 +08:00

1 2 3 4 5 ...

378 Commits