enginex_bi_series-sherpa-onnx

EngineX-Iluvatar/enginex_bi_series-sherpa-onnx

Archived

Author	SHA1	Message	Date
Fangjun Kuang	17c8237ee4	Fix releasing npm package and fix building Android VAD+ASR example (#1288 )	2024-08-26 10:18:48 +08:00
Hán Trung Kiên	452555b218	update generate-asset-list.py (#1287 )	2024-08-25 08:57:16 +08:00
Fangjun Kuang	5ed8e31868	Add VAD and keyword spotting for the Node package with WebAssembly (#1286 )	2024-08-24 23:05:54 +08:00
Fangjun Kuang	537e163dd0	WebAssembly example for VAD + Non-streaming ASR (#1284 )	2024-08-24 13:24:52 +08:00
Fangjun Kuang	1ef8a7a202	Add WebAssembly for VAD (#1281 )	2024-08-23 17:08:37 +08:00
Fangjun Kuang	fb09f8fae3	Set batch size to 1 for more streaming ASR models (#1280 )	2024-08-23 11:06:55 +08:00
Malcolm Ke Win	c61423ec5a	Update wave-reader.cc (#1278 ) * Update wave-reader.cc missing "#include <cstdint>"	2024-08-22 23:22:45 +08:00
Fangjun Kuang	0e0d04a97a	Provide models for mobile-only platforms by fixing batch size to 1 (#1276 )	2024-08-22 19:36:24 +08:00
Robin Zhong	d8001d6edc	update kotlin api for better release native object and add user-friendly apis. (#1275 )	2024-08-22 19:18:11 +08:00
Fangjun Kuang	5a2aa110b8	Text to speech API for Object Pascal. (#1273 )	2024-08-20 20:52:16 +08:00
Fangjun Kuang	e34a1a2aa3	Object pascal examples for recording and playing audio with portaudio. (#1271 ) The recording example can be used for speech recognition while the playing example can be used for text to speech. The portaudio wrapper for object pascal is copied from https://github.com/UltraStar-Deluxe/USDX/blob/master/src/lib/portaudio/portaudio.pas	2024-08-18 19:51:08 +08:00
Fangjun Kuang	f93f0ca94d	Use a separate thread to initialize models for lazarus examples. (#1270 ) So that the main thread is not blocked and the user interface is responsive.	2024-08-18 14:59:48 +08:00
Emmanuel Schmidbauer	8c087d9110	flutter: add lang, emotion, event to OfflineRecognizerResult (#1268 )	2024-08-17 07:21:59 +08:00
Fangjun Kuang	88809753ab	Release v1.10.22 (#1267 )	2024-08-16 22:40:49 +08:00
Fangjun Kuang	9dcea49dba	Fix looking up OOVs in lexicon.txt for MeloTTS models. (#1266 ) If an English word does not exist in the lexicon, we split it into characters. For instance, if the word TTS does not exist in lexicon.txt, we split it into 3 characters T, T, and S.	2024-08-16 22:10:03 +08:00
Fangjun Kuang	63713ecbf0	Build generating subtitles APPs for more models (#1265 )	2024-08-16 20:11:24 +08:00
Ikko Eltociear Ashimine	a3e98750e9	chore: update online-stream.h (#1264 ) Fix typos.	2024-08-16 15:17:15 +08:00
Fangjun Kuang	fbe35ba736	Add Lazarus example for generating subtitles using Silero VAD with non-streaming ASR (#1251 )	2024-08-15 22:19:45 +08:00
Fangjun Kuang	97a6a2a16a	Enable IPO only for Release build. (#1261 )	2024-08-15 18:16:42 +08:00
Fangjun Kuang	ca729faebf	Support reading multi-channel wave files with 8/16/32-bit encoded samples (#1258 )	2024-08-15 14:54:43 +08:00
Robin Zhong	62c4d4ab62	Add emotion, event of SenseVoice. (#1257 ) * Add emotion, event of SenseVoice. * Fix tokens size check and update java api. https://github.com/k2-fsa/sherpa-onnx/pull/1257	2024-08-14 15:50:13 +08:00
Han Zhu	f300ec0f98	Add more C API examples (#1255 ) C API examples for zipformer, paraformer, and TeleSpeech-ASR CTC models.	2024-08-14 10:52:47 +08:00
ivan provalov	9f06b059d7	Update offline-recognizer.cc (#1253 ) Adding setConfig method to JNI to support setting a config on the previously initialized offline-recognizer.	2024-08-13 23:04:51 +08:00
Fangjun Kuang	619279b162	Pascal API for VAD (#1249 )	2024-08-13 16:16:51 +08:00
Fangjun Kuang	a7dc6c2c16	Pascal API for non-streaming ASR (#1247 )	2024-08-12 23:33:35 +08:00
Fangjun Kuang	5791b695ea	Pascal API for streaming ASR (#1246 )	2024-08-12 19:55:51 +08:00
Fangjun Kuang	65f1c0fab2	Add Pascal API for reading wave files (#1243 )	2024-08-11 22:43:42 +08:00
Fangjun Kuang	968623a477	Exclude .DS_Store files from flutter tts assets (#1238 )	2024-08-09 13:19:27 +08:00
Fangjun Kuang	94e256244d	Add blank penalty for various language bindings. (#1234 )	2024-08-08 10:43:31 +08:00
Parth Khiera	ba4cb6169f	feat: addition of blank_penalty config in online_recognizer (#1232 )	2024-08-08 09:10:17 +08:00
Fangjun Kuang	8a5f5c1999	Fix python two pass ASR examples (#1230 )	2024-08-07 18:35:38 +08:00
xsjk	1da75ee3c0	Fix typo in offline-lm-config.cc (#1229 )	2024-08-07 15:38:34 +08:00
Fangjun Kuang	9ee2943ed4	Add CI tests for online punctuation models (#1226 )	2024-08-06 18:10:30 +08:00
Fangjun Kuang	375c055ff8	Fix style issues for online punctuation source files (#1225 )	2024-08-06 17:43:24 +08:00
jianyou	1414e4dc61	Add online punctuation and casing prediction model for English language (#1224 )	2024-08-06 17:33:38 +08:00
Fangjun Kuang	52830cc910	Add MeloTTS example for ios (#1223 )	2024-08-06 14:48:54 +08:00
Fangjun Kuang	6422966a7f	Support passing TTS callback in Swift API (#1218 )	2024-08-05 14:06:21 +08:00
Fangjun Kuang	9caa488019	Fix setting SenseVoice language. (#1214 )	2024-08-04 19:02:23 +08:00
Fangjun Kuang	c2dce19140	Update README to include Rust. (#1212 )	2024-08-04 12:20:05 +08:00
Fangjun Kuang	d5f486878d	Remove libonnxruntime_providers_cuda.so as a dependency. (#1210 )	2024-08-03 16:25:23 +08:00
Fangjun Kuang	561d04dd92	describe how to add new words for MeloTTS models (#1209 )	2024-08-03 11:19:02 +08:00
Fangjun Kuang	35c1b4a7a9	Add ReazonSpeech Japanese pre-trained model (#1203 )	2024-08-02 10:21:24 +08:00
Fangjun Kuang	53484fcd9b	Fix reading non-standard wav files. (#1199 )	2024-08-01 17:48:04 +08:00
Fangjun Kuang	ec98110e11	Add speaker identification and verification exmaple for Dart API (#1194 )	2024-07-31 13:53:52 +08:00
Fangjun Kuang	963aaba82b	Add Chinese+English tts example for flutter (#1192 )	2024-07-30 18:38:43 +08:00
Fangjun Kuang	c1b5fce01b	Fix copying asset files for flutter examples. (#1191 ) If the target file exists but has a different file size, we need to copy the source file to the target file.	2024-07-30 18:24:56 +08:00
Fangjun Kuang	9e02f88dbb	Non-streaming WebSocket client for Java. (#1190 )	2024-07-30 17:21:33 +08:00
Fangjun Kuang	06fd50f536	Add test about whisper large-v3 for .Net (#1187 )	2024-07-29 20:49:38 +08:00
Fangjun Kuang	86b4c9f535	Fix splitting sentences for MeloTTS (#1186 )	2024-07-29 17:04:45 +08:00
Fangjun Kuang	b1711ecaa1	Fix ffmpeg c api example (#1185 )	2024-07-29 14:27:55 +08:00

1 2 3 4 5 ...

779 Commits