enginex-mr_series-sherpa-onnx

EngineX-Iluvatar/enginex-mr_series-sherpa-onnx

Archived

Author	SHA1	Message	Date
Fangjun Kuang	df4150dc5d	Upload speaker embedding models to huggingface (#1428 ) See also https://huggingface.co/spaces/k2-fsa/speaker-diarization	2024-10-14 16:20:00 +08:00
Fangjun Kuang	5a22f74b2b	Android demo for speaker diarization (#1423 )	2024-10-13 14:02:57 +08:00
Fangjun Kuang	5e273c5be4	Pascal API for speaker diarization (#1420 )	2024-10-12 12:28:38 +08:00
Fangjun Kuang	1ed803adc1	Dart API for speaker diarization (#1418 )	2024-10-11 21:17:41 +08:00
Fangjun Kuang	1851ff6337	Java API for speaker diarization (#1416 )	2024-10-11 16:51:40 +08:00
Fangjun Kuang	eefc172095	JavaScript API with WebAssembly for speaker diarization (#1414 ) #1408 uses [node-addon-api](https://github.com/nodejs/node-addon-api) to call C API from JavaScript, whereas this pull request uses WebAssembly to call C API from JavaScript.	2024-10-11 11:40:10 +08:00
Fangjun Kuang	1d061df355	WebAssembly exmaple for speaker diarization (#1411 )	2024-10-10 22:14:45 +08:00
Fangjun Kuang	a45e5dba99	C# API for speaker diarization (#1407 )	2024-10-10 14:29:05 +08:00
Fangjun Kuang	df681e9807	Go API for speaker diarization (#1403 )	2024-10-09 20:10:44 +08:00
Fangjun Kuang	8535b1d3bb	Python API for speaker diarization. (#1400 )	2024-10-09 14:13:26 +08:00
Fangjun Kuang	59407edcad	C++ API for speaker diarization (#1396 )	2024-10-09 12:01:20 +08:00
Fangjun Kuang	70165cb42d	Speaker diarization example with onnxruntime Python API (#1395 )	2024-10-06 16:37:29 +08:00
Fangjun Kuang	66feecb2b5	support whisper turbo (#1390 )	2024-10-02 18:13:34 +08:00
Fangjun Kuang	b965f14cf0	Add Python API for clustering (#1385 )	2024-09-30 11:33:15 +08:00
Fangjun Kuang	bc08160820	Export Pyannote speaker segmentation models to onnx (#1382 )	2024-09-29 14:23:56 +08:00
Fangjun Kuang	11f0cb7e1c	Support Parakeet models from NeMo (#1381 )	2024-09-27 17:12:00 +08:00
lxiao336	06b61ccad8	Allow more online models to load tokens file from the memory (#1352 ) Co-authored-by: xiao <shawl336@6163.com>	2024-09-20 16:38:41 +08:00
Fangjun Kuang	647b63ea44	Release v1.10.27 (#1359 )	2024-09-19 10:49:29 +08:00
Fangjun Kuang	576a3aa90d	Add non-streaming ONNX models for Russian ASR (#1358 )	2024-09-18 13:43:49 +08:00
Fangjun Kuang	e7ffcbd677	Add APIs about max speech duration in VAD for various programming languages (#1349 )	2024-09-14 12:30:13 +08:00
Fangjun Kuang	544857b097	Fix building (#1343 )	2024-09-13 13:33:52 +08:00
lxiao336	65cfa7548a	re-pull-request allow tokens and hotwords be loaded from buffered string driectly (#1339 ) Co-authored-by: xiao <shawl336@163.com>	2024-09-13 09:58:17 +08:00
Fangjun Kuang	e66d4c414a	Fix releasing dart packages. (#1317 )	2024-09-04 12:12:13 +08:00
Fangjun Kuang	d60a4d418e	Provide prebuilt .jar files for different java versions. (#1307 )	2024-08-30 14:16:31 +08:00
Fangjun Kuang	6b8877f185	Downgrade flutter sdk versions. (#1305 )	2024-08-30 11:47:27 +08:00
Fangjun Kuang	0ccd3a4c3f	remove extra files from linux/macos/windows jni libs (#1301 )	2024-08-29 10:45:38 +08:00
Fangjun Kuang	9064430c3e	Fix releasing wasm app for vad+asr (#1300 )	2024-08-29 08:47:38 +08:00
Fangjun Kuang	6ec57327ce	add vad+sense voice example for C API (#1291 )	2024-08-27 16:11:24 +08:00
Fangjun Kuang	5ed8e31868	Add VAD and keyword spotting for the Node package with WebAssembly (#1286 )	2024-08-24 23:05:54 +08:00
Fangjun Kuang	537e163dd0	WebAssembly example for VAD + Non-streaming ASR (#1284 )	2024-08-24 13:24:52 +08:00
Fangjun Kuang	1ef8a7a202	Add WebAssembly for VAD (#1281 )	2024-08-23 17:08:37 +08:00
Fangjun Kuang	fb09f8fae3	Set batch size to 1 for more streaming ASR models (#1280 )	2024-08-23 11:06:55 +08:00
Fangjun Kuang	0e0d04a97a	Provide models for mobile-only platforms by fixing batch size to 1 (#1276 )	2024-08-22 19:36:24 +08:00
Fangjun Kuang	5a2aa110b8	Text to speech API for Object Pascal. (#1273 )	2024-08-20 20:52:16 +08:00
Fangjun Kuang	f93f0ca94d	Use a separate thread to initialize models for lazarus examples. (#1270 ) So that the main thread is not blocked and the user interface is responsive.	2024-08-18 14:59:48 +08:00
Fangjun Kuang	63713ecbf0	Build generating subtitles APPs for more models (#1265 )	2024-08-16 20:11:24 +08:00
Fangjun Kuang	fbe35ba736	Add Lazarus example for generating subtitles using Silero VAD with non-streaming ASR (#1251 )	2024-08-15 22:19:45 +08:00
Fangjun Kuang	ca729faebf	Support reading multi-channel wave files with 8/16/32-bit encoded samples (#1258 )	2024-08-15 14:54:43 +08:00
Han Zhu	f300ec0f98	Add more C API examples (#1255 ) C API examples for zipformer, paraformer, and TeleSpeech-ASR CTC models.	2024-08-14 10:52:47 +08:00
Fangjun Kuang	619279b162	Pascal API for VAD (#1249 )	2024-08-13 16:16:51 +08:00
Fangjun Kuang	a7dc6c2c16	Pascal API for non-streaming ASR (#1247 )	2024-08-12 23:33:35 +08:00
Fangjun Kuang	5791b695ea	Pascal API for streaming ASR (#1246 )	2024-08-12 19:55:51 +08:00
Fangjun Kuang	65f1c0fab2	Add Pascal API for reading wave files (#1243 )	2024-08-11 22:43:42 +08:00
Fangjun Kuang	9ee2943ed4	Add CI tests for online punctuation models (#1226 )	2024-08-06 18:10:30 +08:00
Fangjun Kuang	561d04dd92	describe how to add new words for MeloTTS models (#1209 )	2024-08-03 11:19:02 +08:00
Fangjun Kuang	35c1b4a7a9	Add ReazonSpeech Japanese pre-trained model (#1203 )	2024-08-02 10:21:24 +08:00
Fangjun Kuang	ec98110e11	Add speaker identification and verification exmaple for Dart API (#1194 )	2024-07-31 13:53:52 +08:00
Fangjun Kuang	06fd50f536	Add test about whisper large-v3 for .Net (#1187 )	2024-07-29 20:49:38 +08:00
Fangjun Kuang	b1711ecaa1	Fix ffmpeg c api example (#1185 )	2024-07-29 14:27:55 +08:00
Fangjun Kuang	646f99c870	Dart API for adding punctuations to text (#1182 )	2024-07-29 12:41:52 +08:00

1 2 3 4 5 ...

319 Commits