enginex_bi_series-sherpa-onnx

EngineX-Iluvatar/enginex_bi_series-sherpa-onnx

Archived

Author	SHA1	Message	Date
Fangjun Kuang	9efe69720d	Support VITS VCTK models (#367 ) * Support VITS VCTK models * Release v1.8.1	2023-10-16 17:22:30 +08:00
Fangjun Kuang	655e0fa836	add python API and examples for TTS (#364 )	2023-10-14 14:21:53 +08:00
Peng He	4771c9275c	Add lm decode for the Python API. (#353 ) * Add lm decode for the Python API. * fix style. * Fix LogAdd, Shouldn't double lm_log_prob when merge same prefix path * sort the import alphabetically	2023-10-13 11:15:16 +08:00
Fangjun Kuang	be081017de	Fix typos/bugs (#351 )	2023-10-08 11:39:59 +08:00
Fangjun Kuang	36017d49c4	add a comment about how to download silero_vad.onnx (#346 )	2023-09-26 17:58:53 +08:00
Fangjun Kuang	969fff5622	Add VAD + Non-streaming ASR Python example. (#332 )	2023-09-22 11:53:47 +08:00
Fangjun Kuang	2d51ca49b7	Generate subtitles (#315 )	2023-09-18 10:44:06 +08:00
Fangjun Kuang	c471423125	Add Silero VAD (#313 )	2023-09-17 14:54:38 +08:00
Wei Kang	47184f9db7	Refactor hotwords，support loading hotwords from file (#296 )	2023-09-14 19:33:17 +08:00
Fangjun Kuang	8982984ea2	add a two-pass python example (#303 )	2023-09-10 17:56:13 +08:00
Fangjun Kuang	f709c95c5f	Support multilingual whisper models (#274 )	2023-08-16 00:28:52 +08:00
Fangjun Kuang	313debe45c	small fixes to python api examples (#269 )	2023-08-14 20:53:36 +08:00
Fangjun Kuang	6038e2aa62	Support streaming paraformer (#263 )	2023-08-14 10:32:14 +08:00
Fangjun Kuang	a4bff28e21	Support TDNN models from the yesno recipe from icefall (#262 )	2023-08-12 19:50:22 +08:00
Fangjun Kuang	b094868fb8	Add non-streaming websocket server for python (#259 )	2023-08-11 15:56:24 +08:00
Fangjun Kuang	79c2ce5dd4	Refactor online recognizer (#250 ) * Refactor online recognizer. Make it easier to support other streaming models. Note that it is a breaking change for the Python API. `sherpa_onnx.OnlineRecognizer()` used before should be replaced by `sherpa_onnx.OnlineRecognizer.from_transducer()`.	2023-08-09 20:27:31 +08:00
Fangjun Kuang	aeb112dd06	Support specifying provider for python examples (#244 )	2023-08-09 10:00:34 +08:00
Fangjun Kuang	45b9d4ab37	Support whisper models (#238 )	2023-08-07 12:34:18 +08:00
frankyoujian	801693a4d4	Support real time hotwords on python (#230 ) * support real time hotwords on python * fix comments	2023-08-03 15:50:11 +08:00
Fangjun Kuang	1f02f7c349	Support recognition from URLs. (#194 )	2023-07-04 10:16:11 +08:00
Wei Kang	513dfaa552	Support contextual-biasing for streaming model (#184 ) * Support contextual-biasing for streaming model * The whole pipeline runs normally * Fix comments	2023-06-30 16:46:24 +08:00
fx	81579bbddd	fix numpy bug (#181 )	2023-06-20 20:55:47 +08:00
Wei Kang	8562711252	Implement context biasing with a Aho Corasick automata (#145 ) * Implement context graph * Modify the interface to support context biasing * Support context biasing in modified beam search; add python wrapper * Support context biasing in python api example * Minor fixes * Fix context graph * Minor fixes * Fix tests * Fix style * Fix style * Fix comments * Minor fixes * Add missing header * Replace std::shared_ptr with std::unique_ptr for effciency * Build graph in constructor * Fix comments * Minor fixes * Fix docs	2023-06-16 14:26:36 +08:00
Fangjun Kuang	5e2dc5ceea	add streaming-server with web client (#164 ) * add streaming-server with web client * small fixes	2023-05-30 22:46:52 +08:00
Fangjun Kuang	80060c276d	Begin to support CTC models (#119 ) Please see https://k2-fsa.github.io/sherpa/onnx/pretrained_models/offline-ctc/nemo/index.html for a list of pre-trained CTC models from NeMo.	2023-04-07 23:11:34 +08:00
Fangjun Kuang	5d3c8edbc9	add python tests (#111 )	2023-04-02 23:05:30 +08:00
manyeyes	3f7e0c23ac	adding a python api for offline decode (#110 )	2023-04-02 13:17:43 +08:00
eee	94d77fa52e	remove sherpa_onnx.Display (#109 ) * fix garbled console output with chinese characters * use print to instead sherpa_onnx.Display	2023-04-01 18:14:33 +08:00
eee	c0620a1fe1	fix garbled console output with chinese characters (#108 )	2023-03-31 22:26:47 +08:00
Fangjun Kuang	6707ec4124	add offline websocket server/client (#98 )	2023-03-29 21:48:45 +08:00
Fangjun Kuang	5572246253	Add non-streaming ASR (#92 )	2023-03-26 08:53:42 +08:00
Fangjun Kuang	355c5ef541	fix typos in comments (#90 )	2023-03-18 10:44:10 +08:00
manyeyes	2f9cd1007e	add "import sys", (#89 )	2023-03-16 10:49:37 +08:00
Fangjun Kuang	9d8fddef01	Support resampling (#77 )	2023-03-03 16:42:33 +08:00
Fangjun Kuang	7f72c13d9a	Code refactoring (#74 ) * Don't reset model state and feature extractor on endpointing * support passing decoding_method from commandline * Add modified_beam_search to Python API * fix C API example * Fix style issues	2023-03-03 12:10:59 +08:00
Fangjun Kuang	343e732ccb	Refactor python examples (#67 )	2023-02-26 20:33:16 +08:00
Fangjun Kuang	fb1e24bebb	Fix endpointing with microphone (#64 )	2023-02-25 14:30:44 +08:00
Fangjun Kuang	e4b79ad34b	Add Python websocket client (#63 )	2023-02-24 22:46:30 +08:00
Fangjun Kuang	124384369a	Add endpointing (#54 )	2023-02-22 15:35:55 +08:00
Yifan Yang	7ece27cd30	Add python-api-examples: speech-recognition-from-microphone.py (#46 )	2023-02-20 19:44:12 +08:00
Fangjun Kuang	ea09d5fbc5	Add Python API (#31 )	2023-02-19 19:36:03 +08:00

41 Commits