enginex-mr_series-sherpa-onnx

EngineX-Iluvatar/enginex-mr_series-sherpa-onnx

Archived

Author	SHA1	Message	Date
Fangjun Kuang	299f2392e2	Add CI to build HAPs for HarmonyOS (#1578 )	2024-11-29 21:13:01 +08:00
Fangjun Kuang	c34ab35591	Add Android APK for streaming Paraformer ASR (#1538 )	2024-11-14 20:57:35 +08:00
Fangjun Kuang	a3c89aa0d8	Add two-pass ASR Android APKs for Moonshine models. (#1499 )	2024-10-31 17:54:16 +08:00
Fangjun Kuang	bd4b223920	Add Kotlin and Java API for Moonshine models (#1474 )	2024-10-26 22:30:29 +08:00
Fangjun Kuang	707cf792c5	Add GigaAM NeMo transducer model for Russian ASR (#1467 )	2024-10-25 15:20:13 +08:00
Fangjun Kuang	b41f6d2c94	Support GigaAM CTC models for Russian ASR (#1464 ) See also https://github.com/salute-developers/GigaAM	2024-10-25 10:55:16 +08:00
Fangjun Kuang	e0586f1876	add more models for speaker diarization (#1440 )	2024-10-17 20:03:09 +08:00
Fangjun Kuang	620597f501	Support https://huggingface.co/Revai/reverb-diarization-v1 (#1437 )	2024-10-17 11:58:14 +08:00
Fangjun Kuang	5a22f74b2b	Android demo for speaker diarization (#1423 )	2024-10-13 14:02:57 +08:00
Fangjun Kuang	b965f14cf0	Add Python API for clustering (#1385 )	2024-09-30 11:33:15 +08:00
Fangjun Kuang	576a3aa90d	Add non-streaming ONNX models for Russian ASR (#1358 )	2024-09-18 13:43:49 +08:00
Fangjun Kuang	c38634dfcf	two-pass Android APK for SenseVoice (#1302 )	2024-08-29 12:08:49 +08:00
Fangjun Kuang	fb09f8fae3	Set batch size to 1 for more streaming ASR models (#1280 )	2024-08-23 11:06:55 +08:00
Fangjun Kuang	5a2aa110b8	Text to speech API for Object Pascal. (#1273 )	2024-08-20 20:52:16 +08:00
Fangjun Kuang	fbe35ba736	Add Lazarus example for generating subtitles using Silero VAD with non-streaming ASR (#1251 )	2024-08-15 22:19:45 +08:00
Fangjun Kuang	35c1b4a7a9	Add ReazonSpeech Japanese pre-trained model (#1203 )	2024-08-02 10:21:24 +08:00
Fangjun Kuang	dd300b1de5	Add Java and Kotlin API for sense voice (#1164 )	2024-07-22 14:08:40 +08:00
Fangjun Kuang	960eb7529e	Add C++ runtime for MeloTTS (#1138 )	2024-07-16 15:55:02 +08:00
Fangjun Kuang	fa07bbc176	Add APK for small paraformer (#1133 )	2024-07-15 19:44:36 +08:00
Fangjun Kuang	3951a12f8d	Add pre-trained models for the Libriheavy dataset (#1122 )	2024-07-13 19:21:13 +08:00
Fangjun Kuang	b5093e27f9	Fix publishing apks to huggingface (#1121 ) Save APKs for each release in a separate directory. Huggingface requires that each directory cannot contain more than 1000 files. Since we have so many tts models and for each model we need to build APKs of 4 different ABIs, it is a workaround for the huggingface's constraint by placing them into separate directories for different releases.	2024-07-13 16:14:00 +08:00
Fangjun Kuang	dd0ff2ca06	Support onnxruntime 1.18.0 (#906 )	2024-07-10 17:05:26 +08:00
Fangjun Kuang	9e446b8501	Fix typos (#1101 )	2024-07-09 20:08:47 +08:00
Fangjun Kuang	1f95bff719	Add non-streaming zipformer Android APK (#1052 )	2024-06-24 16:22:19 +08:00
Fangjun Kuang	36336b31f4	Build Android APK for Thai (#1036 )	2024-06-20 18:05:57 +08:00
Fangjun Kuang	6789c909d2	Inverse text normalization API of streaming ASR for various programming languages (#1022 )	2024-06-18 13:42:17 +08:00
Fangjun Kuang	e1201225f2	Add Android APK for Korean (#1015 )	2024-06-16 19:17:15 +08:00
Fangjun Kuang	09efe54808	add more text-to-speech models from piper (#988 )	2024-06-11 15:22:48 +08:00
Fangjun Kuang	fd5a0d1e00	Add C++ runtime for Tele-AI/TeleSpeech-ASR (#970 )	2024-06-05 00:26:40 +08:00
Fangjun Kuang	cd65e7627d	add a new tts piper model (#927 )	2024-05-28 10:43:13 +08:00
Fangjun Kuang	384f96c40f	Add streaming CTC ASR APIs for node-addon-api (#867 )	2024-05-13 11:58:25 +08:00
Fangjun Kuang	db85b2c1d8	Add Android APKs for NeMo CTC models. (#866 )	2024-05-12 14:58:36 +08:00
Fangjun Kuang	7322f4e0a3	Fix node addon tests (#865 ) * Install naudiodon2 manually. It is needed only when using a microphone. The CI tests don't need it.	2024-05-12 12:03:43 +08:00
Fangjun Kuang	d2e86b0415	Add links to pre-built APKs and pre-trained models to README. (#840 )	2024-05-07 12:28:42 +08:00
Fangjun Kuang	9b67a476e6	Refactor the JNI interface to make it more modular and maintainable (#802 )	2024-04-24 09:48:42 +08:00
Fangjun Kuang	7f3b9ffe5d	Refactor TTS Android code to support jieba for Chinese TTS models (#800 )	2024-04-22 17:21:05 +08:00
Fangjun Kuang	2e0ee0e8c8	fix a typo in building language ID apk (#795 )	2024-04-19 20:16:48 +08:00
Fangjun Kuang	c1608b3524	Support CED models (#792 )	2024-04-19 15:20:37 +08:00
Fangjun Kuang	d97a283dbb	Add Android demo for spoken language identification using Whisper multilingual models (#783 )	2024-04-18 14:33:59 +08:00
Fangjun Kuang	69440e481f	Add WearOS demo for audio tagging (#777 )	2024-04-17 12:22:17 +08:00
Fangjun Kuang	bcd9e48150	Add Android demo for audio tagging (#776 ) See https://k2-fsa.github.io/sherpa/onnx/audio-tagging/apk.html	2024-04-16 20:47:16 +08:00
Fangjun Kuang	a5f8fbc83f	Support heteronyms in Chinese TTS (#738 )	2024-04-08 11:01:30 +08:00
Fangjun Kuang	3acf373b07	add more piper models (#725 )	2024-04-01 11:39:52 +08:00
Fangjun Kuang	12efbf7397	Sign released TTS APKs (#710 )	2024-03-27 19:34:37 +08:00
Fangjun Kuang	69c7880c4d	Add Golang API for VAD (#708 )	2024-03-27 12:09:39 +08:00
Fangjun Kuang	bd66f7a7d0	Build Android TTS APKs for coqui-ai/TTS models (#704 )	2024-03-26 14:05:26 +08:00
Fangjun Kuang	a628002d8f	Release v1.9.12 (#661 )	2024-03-11 18:52:34 +08:00
Fangjun Kuang	d2cc48ded5	Add more Chinese TTS models (Mandarin and Cantonese) (#589 )	2024-02-20 15:05:35 +08:00
Fangjun Kuang	035a82df33	Add a new Persian tts model (#555 )	2024-01-27 20:47:54 +08:00
Fangjun Kuang	bbd7c7fc18	Add Android demo for speaker recognition (#536 ) See pre-built Android APKs at https://k2-fsa.github.io/sherpa/onnx/speaker-identification/apk.html	2024-01-23 16:50:52 +08:00

1 2

61 Commits