enginex-mr_series-sherpa-onnx

EngineX-Iluvatar/enginex-mr_series-sherpa-onnx

Archived

Author	SHA1	Message	Date
Fangjun Kuang	be4a2488a8	Use batch size 1 in generating subtitles. (#756 )	2024-04-11 15:58:11 +08:00
Manix	399d920b47	[feature] Configurable padding length in online websocket server (#755 ) Signed-off-by: manickavela29 <manickavela1998@gmail.com>	2024-04-11 14:57:11 +08:00
Fangjun Kuang	f204e62b44	Add C API for audio tagging (#754 )	2024-04-11 14:18:43 +08:00
Fangjun Kuang	34d70a259f	Add Python API and Python examples for audio tagging (#753 )	2024-04-11 11:12:48 +08:00
AHN Sung Hwan	904a3cc8a9	Fix a bug in mean calculation of 'ys_probs' (#748 )	2024-04-11 10:34:44 +08:00
布宝	d21c45d0ea	Add --continue to wget (#750 ) Also, switch to github mirror	2024-04-11 09:07:31 +08:00
Fangjun Kuang	042976ea6e	Add C++ microphone examples for audio tagging (#749 )	2024-04-10 21:00:35 +08:00
Fangjun Kuang	f20291cadc	Support audio tagging using zipformer (#747 )	2024-04-10 14:47:06 +08:00
Fangjun Kuang	c9ae7595d5	Fix go API examples with portaudio on Windows. (#746 )	2024-04-10 09:56:35 +08:00
Fangjun Kuang	db1b3ab1f3	Fix building OpenFst on Windows. (#744 )	2024-04-09 11:17:46 +08:00
Fangjun Kuang	0d90b34e4a	Support Chinese heteronyms on Android for TTS. (#742 )	2024-04-08 21:36:47 +08:00
Fangjun Kuang	6b3d2b87f9	Fix releasing GIL (#741 )	2024-04-08 17:22:48 +08:00
Fangjun Kuang	6fb8ceda57	Add VAD examples using ALSA for recording (#739 )	2024-04-08 16:41:01 +08:00
Fangjun Kuang	a5f8fbc83f	Support heteronyms in Chinese TTS (#738 )	2024-04-08 11:01:30 +08:00
Fangjun Kuang	c1c0f5bafd	return timestamps for WebAssembly (#737 )	2024-04-05 20:24:27 +08:00
Fangjun Kuang	dbff2eaadb	Add C API for streaming HLG decoding (#734 )	2024-04-05 10:31:20 +08:00
Fangjun Kuang	db67e00c77	Add HLG decoding for streaming CTC models (#731 )	2024-04-03 21:31:42 +08:00
yujinqiu	f8832cb5f2	Add language identification swiftui demo (#729 )	2024-04-01 20:34:14 +08:00
yujinqiu	fabd30e3bb	Fix microphone privacy config (#727 )	2024-04-01 14:59:40 +08:00
Fangjun Kuang	3acf373b07	add more piper models (#725 )	2024-04-01 11:39:52 +08:00
Fangjun Kuang	2ededa7e98	Fix building wasm in CI (#720 )	2024-03-31 20:50:56 +08:00
Fangjun Kuang	43af1e6951	Release v1.9.15 (#719 )	2024-03-29 19:58:04 +08:00
Fangjun Kuang	6da4a1c12f	Add Go API for speaker identification (#718 )	2024-03-29 19:25:55 +08:00
Fangjun Kuang	2e0bccad36	Add C API for speaker embedding extractor. (#711 )	2024-03-28 18:05:40 +08:00
Leo Huang	638f48f47a	Added progress for callback of tts generator (#712 ) Co-authored-by: leohwang <leohwang@360converter.com>	2024-03-28 17:12:20 +08:00
longshiming	de655e838e	delete incorrect logs (#714 ) Co-authored-by: longshiming <longshiming@greesoft.com>	2024-03-28 10:49:45 +08:00
Fangjun Kuang	559744ac60	Fix ios-swift to remove invalid references (#713 )	2024-03-28 09:39:43 +08:00
Fangjun Kuang	a042f44076	Add Golang API for spoken language identification. (#709 )	2024-03-27 19:40:25 +08:00
Fangjun Kuang	12efbf7397	Sign released TTS APKs (#710 )	2024-03-27 19:34:37 +08:00
Fangjun Kuang	69c7880c4d	Add Golang API for VAD (#708 )	2024-03-27 12:09:39 +08:00
hantengc	ccb2d435ec	add openfst.cmake file (#707 ) 1. When compiling locally, openfst is missing.so add this file to the sherpa-onnx/cmake folder	2024-03-27 11:31:26 +08:00
Fangjun Kuang	4e040c596e	Support including TTS conditionally. (#699 )	2024-03-26 17:21:35 +08:00
Fangjun Kuang	bd66f7a7d0	Build Android TTS APKs for coqui-ai/TTS models (#704 )	2024-03-26 14:05:26 +08:00
Fangjun Kuang	d364610605	Use a single thread when loading models (#703 )	2024-03-26 13:35:33 +08:00
Fangjun Kuang	305c373107	Add C# API for spoken language identification (#697 )	2024-03-25 18:45:09 +08:00
Fangjun Kuang	83a10a55a5	Add Swift API for spoken language identification. (#696 )	2024-03-25 16:22:25 +08:00
Fangjun Kuang	ab7cff2513	Add C API for spoken language identification. (#695 )	2024-03-25 15:16:47 +08:00
Fangjun Kuang	0d258dd150	Support spoken language identification with whisper (#694 )	2024-03-24 22:57:00 +08:00
Fangjun Kuang	3cdad9b5d1	Use manylinux in CI test (#692 )	2024-03-24 07:54:32 +08:00
Masoud	e60c897ce7	Update MainActivity.kt (#693 ) fix read-only test text box	2024-03-24 07:29:14 +08:00
Fangjun Kuang	1952772654	Add timestamps and tokens for .Net's online models. (#690 )	2024-03-23 18:51:56 +08:00
Fangjun Kuang	e6da2c5556	Fix build c api examples with alsa (#691 )	2024-03-23 16:16:24 +08:00
Karel Vesely	eaec4c83c2	Configurable low_freq high_freq, dithering (#664 )	2024-03-22 21:41:44 +08:00
Fangjun Kuang	2fc1201924	Add hotwords support to .Net (#689 )	2024-03-22 21:40:42 +08:00
Fangjun Kuang	24f437a6f1	Refactor github actions tests (#688 )	2024-03-22 21:22:42 +08:00
Masoud	1c77457d61	Update MainActivity.kt (#687 ) Appending a default text to test field. To faster check the voices	2024-03-22 19:04:14 +08:00
Fangjun Kuang	c8770aec20	Add nuget package for Windows x86 (#683 )	2024-03-21 14:57:01 +08:00
Fangjun Kuang	acf0975153	Support whisper language/task in various language bindings. (#679 )	2024-03-20 16:43:35 +08:00
Viggo	842d04d7ae	support whisper language (#678 )	2024-03-20 10:16:22 +08:00
Fangjun Kuang	6571fc9552	Add tts play example for .Net. (#676 ) It plays the generated audio via a speaker as it is generating.	2024-03-19 17:33:15 +08:00

1 2 3 4 5 ...

496 Commits