Commit Graph

41 Commits

Author SHA1 Message Date
Fangjun Kuang
9efe69720d Support VITS VCTK models (#367)
* Support VITS VCTK models

* Release v1.8.1
2023-10-16 17:22:30 +08:00
Fangjun Kuang
655e0fa836 add python API and examples for TTS (#364) 2023-10-14 14:21:53 +08:00
Peng He
4771c9275c Add lm decode for the Python API. (#353)
* Add lm decode for the Python API.

* fix style.

* Fix LogAdd,

	Shouldn't double lm_log_prob when merge same prefix path

* sort the import alphabetically
2023-10-13 11:15:16 +08:00
Fangjun Kuang
be081017de Fix typos/bugs (#351) 2023-10-08 11:39:59 +08:00
Fangjun Kuang
36017d49c4 add a comment about how to download silero_vad.onnx (#346) 2023-09-26 17:58:53 +08:00
Fangjun Kuang
969fff5622 Add VAD + Non-streaming ASR Python example. (#332) 2023-09-22 11:53:47 +08:00
Fangjun Kuang
2d51ca49b7 Generate subtitles (#315) 2023-09-18 10:44:06 +08:00
Fangjun Kuang
c471423125 Add Silero VAD (#313) 2023-09-17 14:54:38 +08:00
Wei Kang
47184f9db7 Refactor hotwords,support loading hotwords from file (#296) 2023-09-14 19:33:17 +08:00
Fangjun Kuang
8982984ea2 add a two-pass python example (#303) 2023-09-10 17:56:13 +08:00
Fangjun Kuang
f709c95c5f Support multilingual whisper models (#274) 2023-08-16 00:28:52 +08:00
Fangjun Kuang
313debe45c small fixes to python api examples (#269) 2023-08-14 20:53:36 +08:00
Fangjun Kuang
6038e2aa62 Support streaming paraformer (#263) 2023-08-14 10:32:14 +08:00
Fangjun Kuang
a4bff28e21 Support TDNN models from the yesno recipe from icefall (#262) 2023-08-12 19:50:22 +08:00
Fangjun Kuang
b094868fb8 Add non-streaming websocket server for python (#259) 2023-08-11 15:56:24 +08:00
Fangjun Kuang
79c2ce5dd4 Refactor online recognizer (#250)
* Refactor online recognizer.

Make it easier to support other streaming models.

Note that it is a breaking change for the Python API.
`sherpa_onnx.OnlineRecognizer()` used before should be
replaced by `sherpa_onnx.OnlineRecognizer.from_transducer()`.
2023-08-09 20:27:31 +08:00
Fangjun Kuang
aeb112dd06 Support specifying provider for python examples (#244) 2023-08-09 10:00:34 +08:00
Fangjun Kuang
45b9d4ab37 Support whisper models (#238) 2023-08-07 12:34:18 +08:00
frankyoujian
801693a4d4 Support real time hotwords on python (#230)
* support real time hotwords on python

* fix comments
2023-08-03 15:50:11 +08:00
Fangjun Kuang
1f02f7c349 Support recognition from URLs. (#194) 2023-07-04 10:16:11 +08:00
Wei Kang
513dfaa552 Support contextual-biasing for streaming model (#184)
* Support contextual-biasing for streaming model

* The whole pipeline runs normally

* Fix comments
2023-06-30 16:46:24 +08:00
fx
81579bbddd fix numpy bug (#181) 2023-06-20 20:55:47 +08:00
Wei Kang
8562711252 Implement context biasing with a Aho Corasick automata (#145)
* Implement context graph

* Modify the interface to support context biasing

* Support context biasing in modified beam search; add python wrapper

* Support context biasing in python api example

* Minor fixes

* Fix context graph

* Minor fixes

* Fix tests

* Fix style

* Fix style

* Fix comments

* Minor fixes

* Add missing header

* Replace std::shared_ptr with std::unique_ptr for effciency

* Build graph in constructor

* Fix comments

* Minor fixes

* Fix docs
2023-06-16 14:26:36 +08:00
Fangjun Kuang
5e2dc5ceea add streaming-server with web client (#164)
* add streaming-server with web client

* small fixes
2023-05-30 22:46:52 +08:00
Fangjun Kuang
80060c276d Begin to support CTC models (#119)
Please see https://k2-fsa.github.io/sherpa/onnx/pretrained_models/offline-ctc/nemo/index.html for a list of pre-trained CTC models from NeMo.
2023-04-07 23:11:34 +08:00
Fangjun Kuang
5d3c8edbc9 add python tests (#111) 2023-04-02 23:05:30 +08:00
manyeyes
3f7e0c23ac adding a python api for offline decode (#110) 2023-04-02 13:17:43 +08:00
eee
94d77fa52e remove sherpa_onnx.Display (#109)
* fix garbled console output with chinese characters

* use print to instead  sherpa_onnx.Display
2023-04-01 18:14:33 +08:00
eee
c0620a1fe1 fix garbled console output with chinese characters (#108) 2023-03-31 22:26:47 +08:00
Fangjun Kuang
6707ec4124 add offline websocket server/client (#98) 2023-03-29 21:48:45 +08:00
Fangjun Kuang
5572246253 Add non-streaming ASR (#92) 2023-03-26 08:53:42 +08:00
Fangjun Kuang
355c5ef541 fix typos in comments (#90) 2023-03-18 10:44:10 +08:00
manyeyes
2f9cd1007e add "import sys", (#89) 2023-03-16 10:49:37 +08:00
Fangjun Kuang
9d8fddef01 Support resampling (#77) 2023-03-03 16:42:33 +08:00
Fangjun Kuang
7f72c13d9a Code refactoring (#74)
* Don't reset model state and feature extractor on endpointing

* support passing decoding_method from commandline

* Add modified_beam_search to Python API

* fix C API example

* Fix style issues
2023-03-03 12:10:59 +08:00
Fangjun Kuang
343e732ccb Refactor python examples (#67) 2023-02-26 20:33:16 +08:00
Fangjun Kuang
fb1e24bebb Fix endpointing with microphone (#64) 2023-02-25 14:30:44 +08:00
Fangjun Kuang
e4b79ad34b Add Python websocket client (#63) 2023-02-24 22:46:30 +08:00
Fangjun Kuang
124384369a Add endpointing (#54) 2023-02-22 15:35:55 +08:00
Yifan Yang
7ece27cd30 Add python-api-examples: speech-recognition-from-microphone.py (#46) 2023-02-20 19:44:12 +08:00
Fangjun Kuang
ea09d5fbc5 Add Python API (#31) 2023-02-19 19:36:03 +08:00