enginex-mr_series-sherpa-onnx

EngineX-Iluvatar/enginex-mr_series-sherpa-onnx

Archived

Author	SHA1	Message	Date
Fangjun Kuang	bc08160820	Export Pyannote speaker segmentation models to onnx (#1382 )	2024-09-29 14:23:56 +08:00
Fangjun Kuang	11f0cb7e1c	Support Parakeet models from NeMo (#1381 )	2024-09-27 17:12:00 +08:00
Fangjun Kuang	12d04ce8ed	Fix running MeloTTS models on GPU. (#1379 ) We need to use opset 18 to export the model to onnx.	2024-09-26 16:51:43 +08:00
Fangjun Kuang	d8809b520e	Fix CI errors introduced by supporting loading keywords from buffers (#1366 )	2024-09-20 19:04:21 +08:00
Fangjun Kuang	576a3aa90d	Add non-streaming ONNX models for Russian ASR (#1358 )	2024-09-18 13:43:49 +08:00
Fangjun Kuang	cddac52780	Support passing utf-8 strings from JavaScript to C++. (#1355 ) We first convert utf-16 strings to Uint8Array and then we pass the array to C++.	2024-09-18 11:03:42 +08:00
lllwan	bf06b268d0	Fix sherpa_onnx.go (#1353 )	2024-09-17 13:39:56 +08:00
Fangjun Kuang	e7ffcbd677	Add APIs about max speech duration in VAD for various programming languages (#1349 )	2024-09-14 12:30:13 +08:00
Fangjun Kuang	544857b097	Fix building (#1343 )	2024-09-13 13:33:52 +08:00
Fangjun Kuang	6b8877f185	Downgrade flutter sdk versions. (#1305 )	2024-08-30 11:47:27 +08:00
Fangjun Kuang	c38634dfcf	two-pass Android APK for SenseVoice (#1302 )	2024-08-29 12:08:49 +08:00
Fangjun Kuang	9064430c3e	Fix releasing wasm app for vad+asr (#1300 )	2024-08-29 08:47:38 +08:00
Emmanuel Schmidbauer	a8556e31ba	add Tokens []string, Timestamps []float32, Lang string, Emotion string, Event string (#1277 )	2024-08-27 06:35:59 +08:00
Fangjun Kuang	17c8237ee4	Fix releasing npm package and fix building Android VAD+ASR example (#1288 )	2024-08-26 10:18:48 +08:00
Fangjun Kuang	5ed8e31868	Add VAD and keyword spotting for the Node package with WebAssembly (#1286 )	2024-08-24 23:05:54 +08:00
Fangjun Kuang	537e163dd0	WebAssembly example for VAD + Non-streaming ASR (#1284 )	2024-08-24 13:24:52 +08:00
Fangjun Kuang	1ef8a7a202	Add WebAssembly for VAD (#1281 )	2024-08-23 17:08:37 +08:00
Fangjun Kuang	fb09f8fae3	Set batch size to 1 for more streaming ASR models (#1280 )	2024-08-23 11:06:55 +08:00
Fangjun Kuang	0e0d04a97a	Provide models for mobile-only platforms by fixing batch size to 1 (#1276 )	2024-08-22 19:36:24 +08:00
Fangjun Kuang	5a2aa110b8	Text to speech API for Object Pascal. (#1273 )	2024-08-20 20:52:16 +08:00
Fangjun Kuang	63713ecbf0	Build generating subtitles APPs for more models (#1265 )	2024-08-16 20:11:24 +08:00
Fangjun Kuang	fbe35ba736	Add Lazarus example for generating subtitles using Silero VAD with non-streaming ASR (#1251 )	2024-08-15 22:19:45 +08:00
Fangjun Kuang	94e256244d	Add blank penalty for various language bindings. (#1234 )	2024-08-08 10:43:31 +08:00
Parth Khiera	ba4cb6169f	feat: addition of blank_penalty config in online_recognizer (#1232 )	2024-08-08 09:10:17 +08:00
Fangjun Kuang	561d04dd92	describe how to add new words for MeloTTS models (#1209 )	2024-08-03 11:19:02 +08:00
Fangjun Kuang	35c1b4a7a9	Add ReazonSpeech Japanese pre-trained model (#1203 )	2024-08-02 10:21:24 +08:00
Fangjun Kuang	ec98110e11	Add speaker identification and verification exmaple for Dart API (#1194 )	2024-07-31 13:53:52 +08:00
Fangjun Kuang	646f99c870	Dart API for adding punctuations to text (#1182 )	2024-07-29 12:41:52 +08:00
Fangjun Kuang	cd1fedaa49	Add Dart API for audio tagging (#1181 )	2024-07-29 11:15:14 +08:00
Fangjun Kuang	69b6b47d91	Add vad with non-streaming ASR examples for Dart API (#1180 )	2024-07-28 23:01:03 +08:00
Fangjun Kuang	4e6aeff07e	Refactor C API to prefix each API with SherpaOnnx. (#1171 )	2024-07-26 18:47:02 +08:00
Fangjun Kuang	994c3e7c96	Add VAD + Non-streaming ASR example for JavaScript API. (#1170 )	2024-07-26 12:42:08 +08:00
Fangjun Kuang	dd300b1de5	Add Java and Kotlin API for sense voice (#1164 )	2024-07-22 14:08:40 +08:00
Fangjun Kuang	ac8223bd8a	Add Dart API for keyword spotter (#1162 )	2024-07-22 10:53:34 +08:00
Fangjun Kuang	ffdb23a8ec	Add dart API for SenseVoice (#1159 )	2024-07-21 21:48:12 +08:00
Fangjun Kuang	c3260ef842	Add JavaScript API for SenseVoice (#1157 )	2024-07-21 10:14:14 +08:00
Fangjun Kuang	8f4d332aab	Add Go API for SenseVoice (#1154 )	2024-07-20 23:41:53 +08:00
Fangjun Kuang	e472180f2c	Add C# API for SenseVoice models (#1151 )	2024-07-20 17:09:23 +08:00
Fangjun Kuang	25f0a10468	Add C++ runtime for SenseVoice models (#1148 )	2024-07-18 22:54:18 +08:00
Fangjun Kuang	3bae5c3fe5	test exported sense voice models (#1147 )	2024-07-18 12:12:44 +08:00
Fangjun Kuang	346f419f39	export sense-voice to onnx (#1144 )	2024-07-18 00:18:38 +08:00
Fangjun Kuang	9e448d03bc	Provide npm package for 32-bit Windows x86 (#1141 )	2024-07-17 12:33:15 +08:00
Fangjun Kuang	960eb7529e	Add C++ runtime for MeloTTS (#1138 )	2024-07-16 15:55:02 +08:00
Fangjun Kuang	95485411fa	Support English for MeloTTS models. (#1134 )	2024-07-15 19:49:22 +08:00
Fangjun Kuang	fa07bbc176	Add APK for small paraformer (#1133 )	2024-07-15 19:44:36 +08:00
Fangjun Kuang	c35200dccf	Revert to onnxruntime 1.17.1 (#1131 )	2024-07-15 14:24:08 +08:00
Fangjun Kuang	04c2319c2c	Export MeloTTS to ONNX (#1129 )	2024-07-15 10:47:19 +08:00
Fangjun Kuang	ab71c3976d	Add int8 quantized whisper large models (#1126 )	2024-07-13 22:30:06 +08:00
Fangjun Kuang	3951a12f8d	Add pre-trained models for the Libriheavy dataset (#1122 )	2024-07-13 19:21:13 +08:00
Fangjun Kuang	b5093e27f9	Fix publishing apks to huggingface (#1121 ) Save APKs for each release in a separate directory. Huggingface requires that each directory cannot contain more than 1000 files. Since we have so many tts models and for each model we need to build APKs of 4 different ABIs, it is a workaround for the huggingface's constraint by placing them into separate directories for different releases.	2024-07-13 16:14:00 +08:00

1 2 3 4 5

220 Commits