enginex-mr_series-sherpa-onnx

EngineX-Iluvatar/enginex-mr_series-sherpa-onnx

Archived

Author	SHA1	Message	Date
Fangjun Kuang	1ac2232e14	Support writing generated audio samples to wave files (#363 )	2023-10-13 23:36:03 +08:00
Fangjun Kuang	536d5804ba	Add TTS with VITS (#360 )	2023-10-13 19:30:38 +08:00
Fangjun Kuang	407602445d	Add CTC HLG decoding using OpenFst (#349 )	2023-10-08 11:32:39 +08:00
poor1017	c2518a5826	Supports cmake compilation compatible with v3.13. (#340 ) Co-authored-by: chenyu <cheny65@chinatelecom.cn>	2023-09-25 11:48:55 +08:00
Fangjun Kuang	532ed142d2	Support linking onnxruntime lib statically on Linux (#326 )	2023-09-21 10:15:42 +08:00
keanu	bd173b27cc	Offline decode support multi threads (#306 ) Co-authored-by: cuidongcai1035 <cuidongcai1035@wezhuiyi.com>	2023-09-19 21:04:13 +08:00
Fangjun Kuang	c471423125	Add Silero VAD (#313 )	2023-09-17 14:54:38 +08:00
Wei Kang	47184f9db7	Refactor hotwords，support loading hotwords from file (#296 )	2023-09-14 19:33:17 +08:00
Fangjun Kuang	6038e2aa62	Support streaming paraformer (#263 )	2023-08-14 10:32:14 +08:00
Fangjun Kuang	a4bff28e21	Support TDNN models from the yesno recipe from icefall (#262 )	2023-08-12 19:50:22 +08:00
Fangjun Kuang	79c2ce5dd4	Refactor online recognizer (#250 ) * Refactor online recognizer. Make it easier to support other streaming models. Note that it is a breaking change for the Python API. `sherpa_onnx.OnlineRecognizer()` used before should be replaced by `sherpa_onnx.OnlineRecognizer.from_transducer()`.	2023-08-09 20:27:31 +08:00
Fangjun Kuang	6061318e3f	fix building on linux with GPU (#249 )	2023-08-09 20:21:28 +08:00
Fangjun Kuang	45b9d4ab37	Support whisper models (#238 )	2023-08-07 12:34:18 +08:00
Fangjun Kuang	6125d9e063	Refactor onnxruntime.cmake (#220 )	2023-07-18 15:44:54 +08:00
Fangjun Kuang	bebc1f1398	Use static libraries for MFC examples (#210 )	2023-07-13 14:52:43 +08:00
danfu	1c3dac9001	support streaming zipformer2 (#185 ) Co-authored-by: danfu <danfu@tencent.com>	2023-06-26 11:09:43 +08:00
Wei Kang	8562711252	Implement context biasing with a Aho Corasick automata (#145 ) * Implement context graph * Modify the interface to support context biasing * Support context biasing in modified beam search; add python wrapper * Support context biasing in python api example * Minor fixes * Fix context graph * Minor fixes * Fix tests * Fix style * Fix style * Fix comments * Minor fixes * Add missing header * Replace std::shared_ptr with std::unique_ptr for effciency * Build graph in constructor * Fix comments * Minor fixes * Fix docs	2023-06-16 14:26:36 +08:00
Yuekai Zhang	b8fbf8e5ce	Add onnxruntime gpu for cmake (#153 ) * add onnxruntime gpu for cmake * fix clang * fix typo * cpplint	2023-05-12 22:30:47 +08:00
Fangjun Kuang	cea718e3d8	Support CoreML for macOS (#151 )	2023-05-12 15:57:44 +08:00
Jingzhao Ou	0992063de8	Stack and streaming conformer support (#141 ) * added csrc/stack.cc * stack: added checks * added copyright info * passed cpp style checks * formatted code * added some support for streaming conformer model support (not verified) * code lint * made more progress with streaming conformer support (not working yet) * passed style check * changes as suggested by @csukuangfj * added some debug info * fixed style check * Use Cat to replace Stack * remove debug statements --------- Co-authored-by: Jingzhao Ou (jou2019) <jou2019@cisco.com> Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>	2023-05-11 14:30:39 +08:00
PF Luo	8c6a6768d5	Add lm rescore to online-modified-beam-search (#133 )	2023-05-05 21:23:54 +08:00
Fangjun Kuang	86017f9833	Add RNN LM rescore for offline ASR with modified_beam_search (#125 )	2023-04-23 17:15:18 +08:00
Fangjun Kuang	80060c276d	Begin to support CTC models (#119 ) Please see https://k2-fsa.github.io/sherpa/onnx/pretrained_models/offline-ctc/nemo/index.html for a list of pre-trained CTC models from NeMo.	2023-04-07 23:11:34 +08:00
Fangjun Kuang	726680c5e0	Install binaries via pip install (#112 ) When pepole use pip install sherpa-onnx, they also get the following binaries: (py38) fangjuns-MacBook-Pro:bin fangjun$ ls -lh sherpa-onnx* -rwxr-xr-x 1 fangjun staff 36K Apr 4 13:48 sherpa-onnx -rwxr-xr-x 1 fangjun staff 52K Apr 4 13:48 sherpa-onnx-microphone -rwxr-xr-x 1 fangjun staff 54K Apr 4 13:48 sherpa-onnx-microphone-offline -rwxr-xr-x 1 fangjun staff 37K Apr 4 13:48 sherpa-onnx-offline -rwxr-xr-x 1 fangjun staff 634K Apr 4 13:48 sherpa-onnx-offline-websocket-server -rwxr-xr-x 1 fangjun staff 710K Apr 4 13:48 sherpa-onnx-online-websocket-client -rwxr-xr-x 1 fangjun staff 651K Apr 4 13:48 sherpa-onnx-online-websocket-server (py38) fangjuns-MacBook-Pro:bin fangjun$ pwd /Users/fangjun/py38/bin	2023-04-04 15:45:59 +08:00
Fangjun Kuang	b911915a32	Add microphone support for offline recognizer (#104 )	2023-03-30 19:43:05 +08:00
Fangjun Kuang	6707ec4124	add offline websocket server/client (#98 )	2023-03-29 21:48:45 +08:00
Fangjun Kuang	423d89e9a5	Support paraformer. (#95 )	2023-03-28 17:59:54 +08:00
Fangjun Kuang	dffb0fd43c	Refactor offline recognizer. (#94 ) * Refactor offline recognizer. The purpose is to make it easier to support different types of models.	2023-03-27 14:59:40 +08:00
Fangjun Kuang	5572246253	Add non-streaming ASR (#92 )	2023-03-26 08:53:42 +08:00
Fangjun Kuang	6f92bc7362	Fix issues about DLLs when installing pre-compiled wheels on windows (#91 )	2023-03-18 13:55:59 +08:00
Fangjun Kuang	8c6a289e3d	Add PackPaddedSequence (#85 )	2023-03-08 14:12:20 +08:00
Fangjun Kuang	da5c80cc74	add pad_sequence (#84 )	2023-03-05 22:02:50 +08:00
Fangjun Kuang	7cae710751	Support slicing a shallow copy of a 3-d tensor (#83 )	2023-03-05 19:18:31 +08:00
Fangjun Kuang	422221b3b4	Add transpose (#82 )	2023-03-05 11:27:17 +08:00
Fangjun Kuang	7f72c13d9a	Code refactoring (#74 ) * Don't reset model state and feature extractor on endpointing * support passing decoding_method from commandline * Add modified_beam_search to Python API * fix C API example * Fix style issues	2023-03-03 12:10:59 +08:00
PF Luo	5326d0f81f	add modified beam search (#69 )	2023-03-01 15:32:54 +08:00
Fangjun Kuang	40522f037b	add streaming websocket server and client (#62 )	2023-02-24 21:39:51 +08:00
Fangjun Kuang	5a5d029490	Add build script for Android armv8a (#58 )	2023-02-22 22:36:05 +08:00
Fangjun Kuang	a65dcf77b3	add portaudio for reading microphones (#55 )	2023-02-22 17:13:07 +08:00
Fangjun Kuang	124384369a	Add endpointing (#54 )	2023-02-22 15:35:55 +08:00
Fangjun Kuang	d1994f1fd8	Support cross compiling for aarch64 (#52 )	2023-02-21 22:04:21 +08:00
Fangjun Kuang	3ea6aa949d	Add Streaming zipformer (#50 )	2023-02-21 20:00:03 +08:00
Fangjun Kuang	ea09d5fbc5	Add Python API (#31 )	2023-02-19 19:36:03 +08:00
Fangjun Kuang	ebc3b47fb8	add online-recognizer (#29 )	2023-02-19 12:45:38 +08:00
Fangjun Kuang	d4b0c0590a	Add online stream. (#28 )	2023-02-19 11:42:15 +08:00
Fangjun Kuang	0f6f58d1d3	Add online transducer decoder (#27 )	2023-02-19 10:39:07 +08:00
Fangjun Kuang	cb8f85ff83	Add online LSTM transducer model (#25 )	2023-02-18 21:35:15 +08:00
Fangjun Kuang	f5f90a0742	Release v1.0 (#18 ) * Release v1.0 * Add link to doc	2022-10-14 12:13:51 +08:00
Fangjun Kuang	8ee02c28b0	Refactor the code (#15 ) * code refactoring * Remove reference files * Update README and CI * small fixes * fix style issues * add style check for CI * fix style issues * remove kaldi-native-io	2022-10-12 21:35:07 +08:00
Fangjun Kuang	77ccd625b8	code refactoring and add CI (#11 )	2022-10-12 11:27:05 +08:00

1 2

53 Commits