enginex-mr_series-sherpa-onnx

EngineX-Iluvatar/enginex-mr_series-sherpa-onnx

Archived

Author	SHA1	Message	Date
Fangjun Kuang	df4150dc5d	Upload speaker embedding models to huggingface (#1428 ) See also https://huggingface.co/spaces/k2-fsa/speaker-diarization	2024-10-14 16:20:00 +08:00
Fangjun Kuang	99f320b893	Release v1.10.28 (#1424 )	2024-10-13 15:27:38 +08:00
Fangjun Kuang	5a22f74b2b	Android demo for speaker diarization (#1423 )	2024-10-13 14:02:57 +08:00
Fangjun Kuang	94b26ff07c	Android JNI support for speaker diarization (#1421 )	2024-10-12 13:03:48 +08:00
Fangjun Kuang	5e273c5be4	Pascal API for speaker diarization (#1420 )	2024-10-12 12:28:38 +08:00
Fangjun Kuang	1ed803adc1	Dart API for speaker diarization (#1418 )	2024-10-11 21:17:41 +08:00
Fangjun Kuang	1851ff6337	Java API for speaker diarization (#1416 )	2024-10-11 16:51:40 +08:00
Fangjun Kuang	2d412b1190	Kotlin API for speaker diarization (#1415 )	2024-10-11 14:41:53 +08:00
Fangjun Kuang	eefc172095	JavaScript API with WebAssembly for speaker diarization (#1414 ) #1408 uses [node-addon-api](https://github.com/nodejs/node-addon-api) to call C API from JavaScript, whereas this pull request uses WebAssembly to call C API from JavaScript.	2024-10-11 11:40:10 +08:00
Fangjun Kuang	f1b311ee4f	Handle audio files less than 10s long for speaker diarization. (#1412 ) If the input audio file is less than 10 seconds long, there is only one chunk, and there is no need to compute embeddings or do clustering. We can use the segmentation result from the speaker segmentation model directly.	2024-10-11 10:27:16 +08:00
Fangjun Kuang	1d061df355	WebAssembly exmaple for speaker diarization (#1411 )	2024-10-10 22:14:45 +08:00
Fangjun Kuang	67349b52f2	JavaScript API (node-addon) for speaker diarization (#1408 )	2024-10-10 15:51:31 +08:00
Fangjun Kuang	a45e5dba99	C# API for speaker diarization (#1407 )	2024-10-10 14:29:05 +08:00
Fangjun Kuang	bd50e79590	Update readme to include more external projects using sherpa-onnx (#1405 )	2024-10-10 10:27:14 +08:00
Fangjun Kuang	1571344509	Swift API for speaker diarization (#1404 )	2024-10-09 23:25:39 +08:00
Fangjun Kuang	df681e9807	Go API for speaker diarization (#1403 )	2024-10-09 20:10:44 +08:00
Yongzeng Liu	97654122fa	docs(nodejs-addon-examples): add guide for pnpm user (#1401 )	2024-10-09 18:12:41 +08:00
Fangjun Kuang	d468527f62	C API for speaker diarization (#1402 )	2024-10-09 17:10:03 +08:00
Fangjun Kuang	8535b1d3bb	Python API for speaker diarization. (#1400 )	2024-10-09 14:13:26 +08:00
Fangjun Kuang	59407edcad	C++ API for speaker diarization (#1396 )	2024-10-09 12:01:20 +08:00
Fangjun Kuang	70165cb42d	Speaker diarization example with onnxruntime Python API (#1395 )	2024-10-06 16:37:29 +08:00
Askars	5f50cbf65a	context_state is not set correctly when previous context is passed after reset (#1393 ) Co-authored-by: vsd-vector <askars.salimbajevs@tilde.lv>	2024-10-03 16:42:09 +08:00
Fangjun Kuang	66feecb2b5	support whisper turbo (#1390 )	2024-10-02 18:13:34 +08:00
Fangjun Kuang	b965f14cf0	Add Python API for clustering (#1385 )	2024-09-30 11:33:15 +08:00
Fangjun Kuang	70568c2df7	Support Agglomerative clustering. (#1384 ) We use the open-source implementation from https://github.com/cdalitz/hclust-cpp	2024-09-29 23:44:29 +08:00
Fangjun Kuang	bc08160820	Export Pyannote speaker segmentation models to onnx (#1382 )	2024-09-29 14:23:56 +08:00
Fangjun Kuang	11f0cb7e1c	Support Parakeet models from NeMo (#1381 )	2024-09-27 17:12:00 +08:00
Fangjun Kuang	12d04ce8ed	Fix running MeloTTS models on GPU. (#1379 ) We need to use opset 18 to export the model to onnx.	2024-09-26 16:51:43 +08:00
Flutter Painter	69c8e7b9d0	Update online_model.dart (#1375 ) fix typo on french joiner file	2024-09-25 06:40:22 +08:00
Fangjun Kuang	d8809b520e	Fix CI errors introduced by supporting loading keywords from buffers (#1366 )	2024-09-20 19:04:21 +08:00
lxiao336	06b61ccad8	Allow more online models to load tokens file from the memory (#1352 ) Co-authored-by: xiao <shawl336@6163.com>	2024-09-20 16:38:41 +08:00
Fangjun Kuang	73c90ec871	Fix swift example for generating subtitles. (#1362 ) We need to invoke vad.flush() at the end.	2024-09-20 11:44:25 +08:00
Fangjun Kuang	647b63ea44	Release v1.10.27 (#1359 )	2024-09-19 10:49:29 +08:00
Fangjun Kuang	576a3aa90d	Add non-streaming ONNX models for Russian ASR (#1358 )	2024-09-18 13:43:49 +08:00
Fangjun Kuang	7e642325f3	Fix building Flutter TTS examples for Linux (#1356 )	2024-09-18 12:04:02 +08:00
Fangjun Kuang	cddac52780	Support passing utf-8 strings from JavaScript to C++. (#1355 ) We first convert utf-16 strings to Uint8Array and then we pass the array to C++.	2024-09-18 11:03:42 +08:00
lllwan	bf06b268d0	Fix sherpa_onnx.go (#1353 )	2024-09-17 13:39:56 +08:00
Fangjun Kuang	9dade25d3e	Release v1.10.26 (#1350 )	2024-09-14 14:37:42 +08:00
Fangjun Kuang	e7ffcbd677	Add APIs about max speech duration in VAD for various programming languages (#1349 )	2024-09-14 12:30:13 +08:00
Fangjun Kuang	1423ddb1f0	Support specifying max speech duration for VAD. (#1348 )	2024-09-14 10:57:46 +08:00
Fangjun Kuang	5d761712db	Support lang/emotion/event results from SenseVoice in Swift API. (#1346 )	2024-09-13 19:43:46 +08:00
Fangjun Kuang	6bf9310cb4	Add links to projects using sherpa-onnx. (#1345 )	2024-09-13 19:17:08 +08:00
Fangjun Kuang	211786e798	Release v1.10.25 (#1344 )	2024-09-13 14:58:38 +08:00
Fangjun Kuang	544857b097	Fix building (#1343 )	2024-09-13 13:33:52 +08:00
lxiao336	65cfa7548a	re-pull-request allow tokens and hotwords be loaded from buffered string driectly (#1339 ) Co-authored-by: xiao <shawl336@163.com>	2024-09-13 09:58:17 +08:00
Fangjun Kuang	6b6e7635ed	Fix computing features for CED audio tagging models. (#1341 ) See also https://github.com/RicherMans/CED/blob/main/onnx_inference_with_kaldi.py	2024-09-12 19:38:18 +08:00
Askars	fa20ae1552	Preserve previous result as context for next segment (#1335 ) Co-authored-by: vsd-vector <askars.salimbajevs@tilde.lv>	2024-09-11 10:44:13 +08:00
Fangjun Kuang	ba7f1a7439	Fix building (#1331 )	2024-09-09 10:29:31 +08:00
Lim Yao Chong	3bffc24d64	Add Python binding for online punctuation models (#1312 )	2024-09-09 10:26:53 +08:00
Fangjun Kuang	857cb5075c	Fix typos (#1330 )	2024-09-09 10:22:42 +08:00

1 2 3 4 5 ...

848 Commits