enginex-mr_series-sherpa-onnx

EngineX-Iluvatar/enginex-mr_series-sherpa-onnx

Archived

Author	SHA1	Message	Date
Fangjun Kuang	e7ffcbd677	Add APIs about max speech duration in VAD for various programming languages (#1349 )	2024-09-14 12:30:13 +08:00
Fangjun Kuang	544857b097	Fix building (#1343 )	2024-09-13 13:33:52 +08:00
lxiao336	65cfa7548a	re-pull-request allow tokens and hotwords be loaded from buffered string driectly (#1339 ) Co-authored-by: xiao <shawl336@163.com>	2024-09-13 09:58:17 +08:00
Fangjun Kuang	e66d4c414a	Fix releasing dart packages. (#1317 )	2024-09-04 12:12:13 +08:00
Fangjun Kuang	d60a4d418e	Provide prebuilt .jar files for different java versions. (#1307 )	2024-08-30 14:16:31 +08:00
Fangjun Kuang	6b8877f185	Downgrade flutter sdk versions. (#1305 )	2024-08-30 11:47:27 +08:00
Fangjun Kuang	0ccd3a4c3f	remove extra files from linux/macos/windows jni libs (#1301 )	2024-08-29 10:45:38 +08:00
Fangjun Kuang	9064430c3e	Fix releasing wasm app for vad+asr (#1300 )	2024-08-29 08:47:38 +08:00
Fangjun Kuang	6ec57327ce	add vad+sense voice example for C API (#1291 )	2024-08-27 16:11:24 +08:00
Fangjun Kuang	5ed8e31868	Add VAD and keyword spotting for the Node package with WebAssembly (#1286 )	2024-08-24 23:05:54 +08:00
Fangjun Kuang	537e163dd0	WebAssembly example for VAD + Non-streaming ASR (#1284 )	2024-08-24 13:24:52 +08:00
Fangjun Kuang	1ef8a7a202	Add WebAssembly for VAD (#1281 )	2024-08-23 17:08:37 +08:00
Fangjun Kuang	fb09f8fae3	Set batch size to 1 for more streaming ASR models (#1280 )	2024-08-23 11:06:55 +08:00
Fangjun Kuang	0e0d04a97a	Provide models for mobile-only platforms by fixing batch size to 1 (#1276 )	2024-08-22 19:36:24 +08:00
Fangjun Kuang	5a2aa110b8	Text to speech API for Object Pascal. (#1273 )	2024-08-20 20:52:16 +08:00
Fangjun Kuang	f93f0ca94d	Use a separate thread to initialize models for lazarus examples. (#1270 ) So that the main thread is not blocked and the user interface is responsive.	2024-08-18 14:59:48 +08:00
Fangjun Kuang	63713ecbf0	Build generating subtitles APPs for more models (#1265 )	2024-08-16 20:11:24 +08:00
Fangjun Kuang	fbe35ba736	Add Lazarus example for generating subtitles using Silero VAD with non-streaming ASR (#1251 )	2024-08-15 22:19:45 +08:00
Fangjun Kuang	ca729faebf	Support reading multi-channel wave files with 8/16/32-bit encoded samples (#1258 )	2024-08-15 14:54:43 +08:00
Han Zhu	f300ec0f98	Add more C API examples (#1255 ) C API examples for zipformer, paraformer, and TeleSpeech-ASR CTC models.	2024-08-14 10:52:47 +08:00
Fangjun Kuang	619279b162	Pascal API for VAD (#1249 )	2024-08-13 16:16:51 +08:00
Fangjun Kuang	a7dc6c2c16	Pascal API for non-streaming ASR (#1247 )	2024-08-12 23:33:35 +08:00
Fangjun Kuang	5791b695ea	Pascal API for streaming ASR (#1246 )	2024-08-12 19:55:51 +08:00
Fangjun Kuang	65f1c0fab2	Add Pascal API for reading wave files (#1243 )	2024-08-11 22:43:42 +08:00
Fangjun Kuang	9ee2943ed4	Add CI tests for online punctuation models (#1226 )	2024-08-06 18:10:30 +08:00
Fangjun Kuang	561d04dd92	describe how to add new words for MeloTTS models (#1209 )	2024-08-03 11:19:02 +08:00
Fangjun Kuang	35c1b4a7a9	Add ReazonSpeech Japanese pre-trained model (#1203 )	2024-08-02 10:21:24 +08:00
Fangjun Kuang	ec98110e11	Add speaker identification and verification exmaple for Dart API (#1194 )	2024-07-31 13:53:52 +08:00
Fangjun Kuang	06fd50f536	Add test about whisper large-v3 for .Net (#1187 )	2024-07-29 20:49:38 +08:00
Fangjun Kuang	b1711ecaa1	Fix ffmpeg c api example (#1185 )	2024-07-29 14:27:55 +08:00
Fangjun Kuang	646f99c870	Dart API for adding punctuations to text (#1182 )	2024-07-29 12:41:52 +08:00
Fangjun Kuang	cd1fedaa49	Add Dart API for audio tagging (#1181 )	2024-07-29 11:15:14 +08:00
Fangjun Kuang	69b6b47d91	Add vad with non-streaming ASR examples for Dart API (#1180 )	2024-07-28 23:01:03 +08:00
Fangjun Kuang	d279c8d20e	Add more Python examples for SenseVoice (#1179 )	2024-07-28 21:54:38 +08:00
Fangjun Kuang	ea1d81bdfe	C api example for sense voice (#1165 )	2024-07-22 16:54:00 +08:00
Fangjun Kuang	dd300b1de5	Add Java and Kotlin API for sense voice (#1164 )	2024-07-22 14:08:40 +08:00
Fangjun Kuang	ac8223bd8a	Add Dart API for keyword spotter (#1162 )	2024-07-22 10:53:34 +08:00
Fangjun Kuang	70d14353bb	Add WebAssembly for SenseVoice (#1158 )	2024-07-21 15:39:55 +08:00
Fangjun Kuang	8f4d332aab	Add Go API for SenseVoice (#1154 )	2024-07-20 23:41:53 +08:00
Fangjun Kuang	25f0a10468	Add C++ runtime for SenseVoice models (#1148 )	2024-07-18 22:54:18 +08:00
Fangjun Kuang	346f419f39	export sense-voice to onnx (#1144 )	2024-07-18 00:18:38 +08:00
Fangjun Kuang	4198d9a166	Provide pre-built wheels with CUDA support. (#1143 )	2024-07-17 22:59:13 +08:00
Fangjun Kuang	803c02db0a	publish all pre-built wheels to huggingface (#1142 ) pypi.org provides only 10GB of free space for open-source projects. Each new release of sherpa-onnx occupies about 800MB, so we have to delete previous releases otherwise pypi.org refuses to accept new releases due to limited spaces. To let users install previous versions, we also publish wheels to huggingface and users can find them at https://k2-fsa.github.io/sherpa/onnx/cpu.html and https://k2-fsa.github.io/sherpa/onnx/cpu-cn.html (for users without access to huggingface.co)	2024-07-17 14:41:27 +08:00
Fangjun Kuang	9e448d03bc	Provide npm package for 32-bit Windows x86 (#1141 )	2024-07-17 12:33:15 +08:00
Fangjun Kuang	960eb7529e	Add C++ runtime for MeloTTS (#1138 )	2024-07-16 15:55:02 +08:00
Fangjun Kuang	95485411fa	Support English for MeloTTS models. (#1134 )	2024-07-15 19:49:22 +08:00
Fangjun Kuang	c35200dccf	Revert to onnxruntime 1.17.1 (#1131 )	2024-07-15 14:24:08 +08:00
Fangjun Kuang	04c2319c2c	Export MeloTTS to ONNX (#1129 )	2024-07-15 10:47:19 +08:00
Fangjun Kuang	ab71c3976d	Add int8 quantized whisper large models (#1126 )	2024-07-13 22:30:06 +08:00
Fangjun Kuang	3951a12f8d	Add pre-trained models for the Libriheavy dataset (#1122 )	2024-07-13 19:21:13 +08:00

1 2 3 4 5 ...

350 Commits