enginex-mr_series-sherpa-onnx

EngineX-Iluvatar/enginex-mr_series-sherpa-onnx

Archived

Author	SHA1	Message	Date
Wei Kang	734bbd91dc	Add Python API for keyword spotting (#576 ) * Add alsa & microphone support for keyword spotting * Add python wrapper	2024-03-01 09:31:11 +08:00
Wei Kang	2ff1049079	change modelscope link to github for build-kws-apki (#540 )	2024-01-24 16:40:14 +08:00
Wei Kang	626775e5e2	Change model url from modelscope to github (#538 )	2024-01-23 10:15:58 +08:00
Wei Kang	b6c020901a	decoder for open vocabulary keyword spotting (#505 ) * various fixes to ContextGraph to support open vocabulary keywords decoder * Add keyword spotter runtime * Add binary * First version works * Minor fixes * update text2token * default values * Add jni for kws * add kws android project * Minor fixes * Remove unused interface * Minor fixes * Add workflow * handle extra info in texts * Minor fixes * Add more comments * Fix ci * fix cpp style * Add input box in android demo so that users can specify their keywords * Fix cpp style * Fix comments * Minor fixes * Minor fixes * minor fixes * Minor fixes * Minor fixes * Add CI * Fix code style * cpplint * Fix comments * Fix error	2024-01-20 22:52:41 +08:00
Fangjun Kuang	2024e96639	Add C++ runtime for speaker verification models from NeMo (#527 )	2024-01-13 21:42:09 +08:00
Fangjun Kuang	68a525a024	Export speaker verification models from NeMo to ONNX (#526 )	2024-01-13 19:49:45 +08:00
Fangjun Kuang	afc81ec122	Add C++ runtime for models from 3d-speaker (#523 )	2024-01-11 19:10:30 +08:00
Fangjun Kuang	e475e750ac	Support streaming zipformer CTC (#496 ) * Support streaming zipformer CTC * test online zipformer2 CTC * Update doc of sherpa-onnx.cc * Add Python APIs for streaming zipformer2 ctc * Add Python API examples for streaming zipformer2 ctc * Swift API for streaming zipformer2 CTC * NodeJS API for streaming zipformer2 CTC * Kotlin API for streaming zipformer2 CTC * Golang API for streaming zipformer2 CTC * C# API for streaming zipformer2 CTC * Release v1.9.6	2023-12-22 13:46:33 +08:00
Fangjun Kuang	868c339e5e	Support distil-small.en whisper (#472 )	2023-12-08 11:59:20 +08:00
Fangjun Kuang	3ae984f148	Remove the 30-second constraint from whisper. (#471 )	2023-12-07 17:47:08 +08:00
Fangjun Kuang	62dc3c3e46	Use piper-phonemize to convert text to token IDs (#453 )	2023-11-30 23:57:43 +08:00
Fangjun Kuang	db41778e99	Support piper-phonemize (#452 )	2023-11-28 19:12:58 +08:00
Fangjun Kuang	8dc08a9b97	Fix nodejs on Windows (#450 )	2023-11-25 21:23:15 +08:00
Fangjun Kuang	2f22e6ed63	Add Swift API for TTS (#439 )	2023-11-22 16:04:26 +08:00
Fangjun Kuang	fe977b8e8e	support nodejs (#438 )	2023-11-21 23:20:08 +08:00
Fangjun Kuang	049fb9f451	Add Python APIs for WeNet CTC models (#428 )	2023-11-16 14:20:41 +08:00
Fangjun Kuang	fac4f6bc7c	Support streaming conformer CTC models from wenet (#427 )	2023-11-16 10:35:23 +08:00
Fangjun Kuang	b83b3e3cd1	Support non-streaming WeNet CTC models. (#426 )	2023-11-15 14:23:20 +08:00
Fangjun Kuang	ea7c45b60c	Add C API for offline TTS. (#373 )	2023-10-19 17:38:23 +08:00
Fangjun Kuang	1ee79e3ff5	Support Chinese vits models (#368 )	2023-10-18 10:19:10 +08:00
Fangjun Kuang	655e0fa836	add python API and examples for TTS (#364 )	2023-10-14 14:21:53 +08:00
Fangjun Kuang	407602445d	Add CTC HLG decoding using OpenFst (#349 )	2023-10-08 11:32:39 +08:00
Fangjun Kuang	e2be532b32	Add timestamps for offline paraformer (#310 )	2023-09-14 19:33:41 +08:00
Wei Kang	47184f9db7	Refactor hotwords，support loading hotwords from file (#296 )	2023-09-14 19:33:17 +08:00
Fangjun Kuang	6038e2aa62	Support streaming paraformer (#263 )	2023-08-14 10:32:14 +08:00
Fangjun Kuang	a4bff28e21	Support TDNN models from the yesno recipe from icefall (#262 )	2023-08-12 19:50:22 +08:00
Fangjun Kuang	f7c05b1570	Add CI test for Whisper models (#239 )	2023-08-07 19:24:52 +08:00
Jingzhao Ou	0ed501b8f1	Added provider option to sherpa-onnx and decode-file-c-api (#162 )	2023-06-03 04:57:48 +08:00
Jingzhao Ou	de1880948b	Added streaming conformer test (#150 )	2023-05-12 08:37:24 +08:00
Fangjun Kuang	4024bfab32	Add examples for Kotlin API (#124 )	2023-04-19 17:29:35 +08:00
Fangjun Kuang	80060c276d	Begin to support CTC models (#119 ) Please see https://k2-fsa.github.io/sherpa/onnx/pretrained_models/offline-ctc/nemo/index.html for a list of pre-trained CTC models from NeMo.	2023-04-07 23:11:34 +08:00
Fangjun Kuang	5d3c8edbc9	add python tests (#111 )	2023-04-02 23:05:30 +08:00
Fangjun Kuang	f9f9f20d37	Test int8 models (#107 ) * Test int8 models * Fix displaying help messages * small fixes * Fix jni test	2023-03-31 22:34:10 +08:00
Fangjun Kuang	f69b4e9396	Add tests for zipformer (#106 )	2023-03-31 04:47:20 +08:00
Fangjun Kuang	6707ec4124	add offline websocket server/client (#98 )	2023-03-29 21:48:45 +08:00
Fangjun Kuang	423d89e9a5	Support paraformer. (#95 )	2023-03-28 17:59:54 +08:00
Fangjun Kuang	5572246253	Add non-streaming ASR (#92 )	2023-03-26 08:53:42 +08:00
Fangjun Kuang	5f31b22c12	Fix modified beam search for iOS and android (#76 ) * Use Int type for sampling rate * Fix swift * Fix iOS	2023-03-03 15:18:31 +08:00
Fangjun Kuang	343e732ccb	Refactor python examples (#67 )	2023-02-26 20:33:16 +08:00
Fangjun Kuang	ce4dd176e4	Add ffmpeg example (#61 )	2023-02-24 17:44:36 +08:00
Fangjun Kuang	9064b3f016	Support Android (#59 )	2023-02-24 13:57:03 +08:00
Fangjun Kuang	ef93dcd733	Add JNI (#57 )	2023-02-22 21:14:57 +08:00
Fangjun Kuang	f9010adc3c	Test streaming Zipformer transducer in CI (#56 )	2023-02-22 17:31:08 +08:00
Fangjun Kuang	ea09d5fbc5	Add Python API (#31 )	2023-02-19 19:36:03 +08:00
Fangjun Kuang	cb8f85ff83	Add online LSTM transducer model (#25 )	2023-02-18 21:35:15 +08:00

45 Commits