Wei Kang
734bbd91dc
Add Python API for keyword spotting ( #576 )
...
* Add alsa & microphone support for keyword spotting
* Add python wrapper
2024-03-01 09:31:11 +08:00
chiiyeh
e7b18a2139
add blank_penalty for online transducer ( #548 )
2024-01-26 12:12:13 +08:00
chiiyeh
3bb3849ec5
add blank_penalty for offline transducer ( #542 )
2024-01-25 15:00:09 +08:00
Fangjun Kuang
59e28518b4
Add Python API examples for speaker recognition with VAD and ASR. ( #532 )
2024-01-15 21:40:30 +08:00
Fangjun Kuang
68a525a024
Export speaker verification models from NeMo to ONNX ( #526 )
2024-01-13 19:49:45 +08:00
Fangjun Kuang
55266918c8
Add runtime support for wespeaker models ( #516 )
2024-01-09 22:06:08 +08:00
Fangjun Kuang
547a22f7d9
Fix #510 ( #513 )
2024-01-04 12:32:19 +08:00
Fangjun Kuang
e475e750ac
Support streaming zipformer CTC ( #496 )
...
* Support streaming zipformer CTC
* test online zipformer2 CTC
* Update doc of sherpa-onnx.cc
* Add Python APIs for streaming zipformer2 ctc
* Add Python API examples for streaming zipformer2 ctc
* Swift API for streaming zipformer2 CTC
* NodeJS API for streaming zipformer2 CTC
* Kotlin API for streaming zipformer2 CTC
* Golang API for streaming zipformer2 CTC
* C# API for streaming zipformer2 CTC
* Release v1.9.6
2023-12-22 13:46:33 +08:00
Fangjun Kuang
0e23f82691
Give an informative log for whisper on exceptions. ( #473 )
2023-12-08 14:33:59 +08:00
Fangjun Kuang
23cf92daf7
Use espeak-ng for coqui-ai/TTS VITS English models. ( #466 )
2023-12-06 11:00:38 +08:00
Fangjun Kuang
99ff6a834c
Play generated audio as it is generating. ( #457 )
2023-12-02 15:35:11 +08:00
Fangjun Kuang
62dc3c3e46
Use piper-phonemize to convert text to token IDs ( #453 )
2023-11-30 23:57:43 +08:00
Fangjun Kuang
87a47d7db4
Release GIL to support multithreading in websocket servers. ( #451 )
2023-11-27 13:44:03 +08:00
Fangjun Kuang
049fb9f451
Add Python APIs for WeNet CTC models ( #428 )
2023-11-16 14:20:41 +08:00
longshiming
10d6dba187
add --tts-rule-fsts argument at offline-tts.py ( #413 )
...
Co-authored-by: longshiming <longshiming@greesoft.com >
2023-11-07 14:18:18 +08:00
Fangjun Kuang
1249710e1d
support specifying speed for tts Python APIs ( #384 )
2023-10-24 21:38:58 +08:00
Fangjun Kuang
8545c3b7f0
Validate input sid ( #369 )
2023-10-18 14:02:01 +08:00
Fangjun Kuang
1ee79e3ff5
Support Chinese vits models ( #368 )
2023-10-18 10:19:10 +08:00
Fangjun Kuang
9efe69720d
Support VITS VCTK models ( #367 )
...
* Support VITS VCTK models
* Release v1.8.1
2023-10-16 17:22:30 +08:00
Fangjun Kuang
655e0fa836
add python API and examples for TTS ( #364 )
2023-10-14 14:21:53 +08:00
Peng He
4771c9275c
Add lm decode for the Python API. ( #353 )
...
* Add lm decode for the Python API.
* fix style.
* Fix LogAdd,
Shouldn't double lm_log_prob when merge same prefix path
* sort the import alphabetically
2023-10-13 11:15:16 +08:00
Fangjun Kuang
be081017de
Fix typos/bugs ( #351 )
2023-10-08 11:39:59 +08:00
Fangjun Kuang
36017d49c4
add a comment about how to download silero_vad.onnx ( #346 )
2023-09-26 17:58:53 +08:00
Fangjun Kuang
969fff5622
Add VAD + Non-streaming ASR Python example. ( #332 )
2023-09-22 11:53:47 +08:00
Fangjun Kuang
2d51ca49b7
Generate subtitles ( #315 )
2023-09-18 10:44:06 +08:00
Fangjun Kuang
c471423125
Add Silero VAD ( #313 )
2023-09-17 14:54:38 +08:00
Wei Kang
47184f9db7
Refactor hotwords,support loading hotwords from file ( #296 )
2023-09-14 19:33:17 +08:00
Fangjun Kuang
8982984ea2
add a two-pass python example ( #303 )
2023-09-10 17:56:13 +08:00
Fangjun Kuang
f709c95c5f
Support multilingual whisper models ( #274 )
2023-08-16 00:28:52 +08:00
Fangjun Kuang
313debe45c
small fixes to python api examples ( #269 )
2023-08-14 20:53:36 +08:00
Fangjun Kuang
6038e2aa62
Support streaming paraformer ( #263 )
2023-08-14 10:32:14 +08:00
Fangjun Kuang
a4bff28e21
Support TDNN models from the yesno recipe from icefall ( #262 )
2023-08-12 19:50:22 +08:00
Fangjun Kuang
b094868fb8
Add non-streaming websocket server for python ( #259 )
2023-08-11 15:56:24 +08:00
Fangjun Kuang
79c2ce5dd4
Refactor online recognizer ( #250 )
...
* Refactor online recognizer.
Make it easier to support other streaming models.
Note that it is a breaking change for the Python API.
`sherpa_onnx.OnlineRecognizer()` used before should be
replaced by `sherpa_onnx.OnlineRecognizer.from_transducer()`.
2023-08-09 20:27:31 +08:00
Fangjun Kuang
aeb112dd06
Support specifying provider for python examples ( #244 )
2023-08-09 10:00:34 +08:00
Fangjun Kuang
45b9d4ab37
Support whisper models ( #238 )
2023-08-07 12:34:18 +08:00
frankyoujian
801693a4d4
Support real time hotwords on python ( #230 )
...
* support real time hotwords on python
* fix comments
2023-08-03 15:50:11 +08:00
Fangjun Kuang
1f02f7c349
Support recognition from URLs. ( #194 )
2023-07-04 10:16:11 +08:00
Wei Kang
513dfaa552
Support contextual-biasing for streaming model ( #184 )
...
* Support contextual-biasing for streaming model
* The whole pipeline runs normally
* Fix comments
2023-06-30 16:46:24 +08:00
fx
81579bbddd
fix numpy bug ( #181 )
2023-06-20 20:55:47 +08:00
Wei Kang
8562711252
Implement context biasing with a Aho Corasick automata ( #145 )
...
* Implement context graph
* Modify the interface to support context biasing
* Support context biasing in modified beam search; add python wrapper
* Support context biasing in python api example
* Minor fixes
* Fix context graph
* Minor fixes
* Fix tests
* Fix style
* Fix style
* Fix comments
* Minor fixes
* Add missing header
* Replace std::shared_ptr with std::unique_ptr for effciency
* Build graph in constructor
* Fix comments
* Minor fixes
* Fix docs
2023-06-16 14:26:36 +08:00
Fangjun Kuang
5e2dc5ceea
add streaming-server with web client ( #164 )
...
* add streaming-server with web client
* small fixes
2023-05-30 22:46:52 +08:00
Fangjun Kuang
80060c276d
Begin to support CTC models ( #119 )
...
Please see https://k2-fsa.github.io/sherpa/onnx/pretrained_models/offline-ctc/nemo/index.html for a list of pre-trained CTC models from NeMo.
2023-04-07 23:11:34 +08:00
Fangjun Kuang
5d3c8edbc9
add python tests ( #111 )
2023-04-02 23:05:30 +08:00
manyeyes
3f7e0c23ac
adding a python api for offline decode ( #110 )
2023-04-02 13:17:43 +08:00
eee
94d77fa52e
remove sherpa_onnx.Display ( #109 )
...
* fix garbled console output with chinese characters
* use print to instead sherpa_onnx.Display
2023-04-01 18:14:33 +08:00
eee
c0620a1fe1
fix garbled console output with chinese characters ( #108 )
2023-03-31 22:26:47 +08:00
Fangjun Kuang
6707ec4124
add offline websocket server/client ( #98 )
2023-03-29 21:48:45 +08:00
Fangjun Kuang
5572246253
Add non-streaming ASR ( #92 )
2023-03-26 08:53:42 +08:00
Fangjun Kuang
355c5ef541
fix typos in comments ( #90 )
2023-03-18 10:44:10 +08:00