Fangjun Kuang
1ac2232e14
Support writing generated audio samples to wave files ( #363 )
2023-10-13 23:36:03 +08:00
Fangjun Kuang
536d5804ba
Add TTS with VITS ( #360 )
2023-10-13 19:30:38 +08:00
Fangjun Kuang
407602445d
Add CTC HLG decoding using OpenFst ( #349 )
2023-10-08 11:32:39 +08:00
poor1017
c2518a5826
Supports cmake compilation compatible with v3.13. ( #340 )
...
Co-authored-by: chenyu <cheny65@chinatelecom.cn >
2023-09-25 11:48:55 +08:00
Fangjun Kuang
532ed142d2
Support linking onnxruntime lib statically on Linux ( #326 )
2023-09-21 10:15:42 +08:00
keanu
bd173b27cc
Offline decode support multi threads ( #306 )
...
Co-authored-by: cuidongcai1035 <cuidongcai1035@wezhuiyi.com >
2023-09-19 21:04:13 +08:00
Fangjun Kuang
c471423125
Add Silero VAD ( #313 )
2023-09-17 14:54:38 +08:00
Wei Kang
47184f9db7
Refactor hotwords,support loading hotwords from file ( #296 )
2023-09-14 19:33:17 +08:00
Fangjun Kuang
6038e2aa62
Support streaming paraformer ( #263 )
2023-08-14 10:32:14 +08:00
Fangjun Kuang
a4bff28e21
Support TDNN models from the yesno recipe from icefall ( #262 )
2023-08-12 19:50:22 +08:00
Fangjun Kuang
79c2ce5dd4
Refactor online recognizer ( #250 )
...
* Refactor online recognizer.
Make it easier to support other streaming models.
Note that it is a breaking change for the Python API.
`sherpa_onnx.OnlineRecognizer()` used before should be
replaced by `sherpa_onnx.OnlineRecognizer.from_transducer()`.
2023-08-09 20:27:31 +08:00
Fangjun Kuang
6061318e3f
fix building on linux with GPU ( #249 )
2023-08-09 20:21:28 +08:00
Fangjun Kuang
45b9d4ab37
Support whisper models ( #238 )
2023-08-07 12:34:18 +08:00
Fangjun Kuang
6125d9e063
Refactor onnxruntime.cmake ( #220 )
2023-07-18 15:44:54 +08:00
Fangjun Kuang
bebc1f1398
Use static libraries for MFC examples ( #210 )
2023-07-13 14:52:43 +08:00
danfu
1c3dac9001
support streaming zipformer2 ( #185 )
...
Co-authored-by: danfu <danfu@tencent.com >
2023-06-26 11:09:43 +08:00
Wei Kang
8562711252
Implement context biasing with a Aho Corasick automata ( #145 )
...
* Implement context graph
* Modify the interface to support context biasing
* Support context biasing in modified beam search; add python wrapper
* Support context biasing in python api example
* Minor fixes
* Fix context graph
* Minor fixes
* Fix tests
* Fix style
* Fix style
* Fix comments
* Minor fixes
* Add missing header
* Replace std::shared_ptr with std::unique_ptr for effciency
* Build graph in constructor
* Fix comments
* Minor fixes
* Fix docs
2023-06-16 14:26:36 +08:00
Yuekai Zhang
b8fbf8e5ce
Add onnxruntime gpu for cmake ( #153 )
...
* add onnxruntime gpu for cmake
* fix clang
* fix typo
* cpplint
2023-05-12 22:30:47 +08:00
Fangjun Kuang
cea718e3d8
Support CoreML for macOS ( #151 )
2023-05-12 15:57:44 +08:00
Jingzhao Ou
0992063de8
Stack and streaming conformer support ( #141 )
...
* added csrc/stack.cc
* stack: added checks
* added copyright info
* passed cpp style checks
* formatted code
* added some support for streaming conformer model support (not verified)
* code lint
* made more progress with streaming conformer support (not working yet)
* passed style check
* changes as suggested by @csukuangfj
* added some debug info
* fixed style check
* Use Cat to replace Stack
* remove debug statements
---------
Co-authored-by: Jingzhao Ou (jou2019) <jou2019@cisco.com >
Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com >
2023-05-11 14:30:39 +08:00
PF Luo
8c6a6768d5
Add lm rescore to online-modified-beam-search ( #133 )
2023-05-05 21:23:54 +08:00
Fangjun Kuang
86017f9833
Add RNN LM rescore for offline ASR with modified_beam_search ( #125 )
2023-04-23 17:15:18 +08:00
Fangjun Kuang
80060c276d
Begin to support CTC models ( #119 )
...
Please see https://k2-fsa.github.io/sherpa/onnx/pretrained_models/offline-ctc/nemo/index.html for a list of pre-trained CTC models from NeMo.
2023-04-07 23:11:34 +08:00
Fangjun Kuang
726680c5e0
Install binaries via pip install ( #112 )
...
When pepole use pip install sherpa-onnx, they also get the following binaries:
(py38) fangjuns-MacBook-Pro:bin fangjun$ ls -lh sherpa-onnx*
-rwxr-xr-x 1 fangjun staff 36K Apr 4 13:48 sherpa-onnx
-rwxr-xr-x 1 fangjun staff 52K Apr 4 13:48 sherpa-onnx-microphone
-rwxr-xr-x 1 fangjun staff 54K Apr 4 13:48 sherpa-onnx-microphone-offline
-rwxr-xr-x 1 fangjun staff 37K Apr 4 13:48 sherpa-onnx-offline
-rwxr-xr-x 1 fangjun staff 634K Apr 4 13:48 sherpa-onnx-offline-websocket-server
-rwxr-xr-x 1 fangjun staff 710K Apr 4 13:48 sherpa-onnx-online-websocket-client
-rwxr-xr-x 1 fangjun staff 651K Apr 4 13:48 sherpa-onnx-online-websocket-server
(py38) fangjuns-MacBook-Pro:bin fangjun$ pwd
/Users/fangjun/py38/bin
2023-04-04 15:45:59 +08:00
Fangjun Kuang
b911915a32
Add microphone support for offline recognizer ( #104 )
2023-03-30 19:43:05 +08:00
Fangjun Kuang
6707ec4124
add offline websocket server/client ( #98 )
2023-03-29 21:48:45 +08:00
Fangjun Kuang
423d89e9a5
Support paraformer. ( #95 )
2023-03-28 17:59:54 +08:00
Fangjun Kuang
dffb0fd43c
Refactor offline recognizer. ( #94 )
...
* Refactor offline recognizer.
The purpose is to make it easier to support different types of models.
2023-03-27 14:59:40 +08:00
Fangjun Kuang
5572246253
Add non-streaming ASR ( #92 )
2023-03-26 08:53:42 +08:00
Fangjun Kuang
6f92bc7362
Fix issues about DLLs when installing pre-compiled wheels on windows ( #91 )
2023-03-18 13:55:59 +08:00
Fangjun Kuang
8c6a289e3d
Add PackPaddedSequence ( #85 )
2023-03-08 14:12:20 +08:00
Fangjun Kuang
da5c80cc74
add pad_sequence ( #84 )
2023-03-05 22:02:50 +08:00
Fangjun Kuang
7cae710751
Support slicing a shallow copy of a 3-d tensor ( #83 )
2023-03-05 19:18:31 +08:00
Fangjun Kuang
422221b3b4
Add transpose ( #82 )
2023-03-05 11:27:17 +08:00
Fangjun Kuang
7f72c13d9a
Code refactoring ( #74 )
...
* Don't reset model state and feature extractor on endpointing
* support passing decoding_method from commandline
* Add modified_beam_search to Python API
* fix C API example
* Fix style issues
2023-03-03 12:10:59 +08:00
PF Luo
5326d0f81f
add modified beam search ( #69 )
2023-03-01 15:32:54 +08:00
Fangjun Kuang
40522f037b
add streaming websocket server and client ( #62 )
2023-02-24 21:39:51 +08:00
Fangjun Kuang
5a5d029490
Add build script for Android armv8a ( #58 )
2023-02-22 22:36:05 +08:00
Fangjun Kuang
a65dcf77b3
add portaudio for reading microphones ( #55 )
2023-02-22 17:13:07 +08:00
Fangjun Kuang
124384369a
Add endpointing ( #54 )
2023-02-22 15:35:55 +08:00
Fangjun Kuang
d1994f1fd8
Support cross compiling for aarch64 ( #52 )
2023-02-21 22:04:21 +08:00
Fangjun Kuang
3ea6aa949d
Add Streaming zipformer ( #50 )
2023-02-21 20:00:03 +08:00
Fangjun Kuang
ea09d5fbc5
Add Python API ( #31 )
2023-02-19 19:36:03 +08:00
Fangjun Kuang
ebc3b47fb8
add online-recognizer ( #29 )
2023-02-19 12:45:38 +08:00
Fangjun Kuang
d4b0c0590a
Add online stream. ( #28 )
2023-02-19 11:42:15 +08:00
Fangjun Kuang
0f6f58d1d3
Add online transducer decoder ( #27 )
2023-02-19 10:39:07 +08:00
Fangjun Kuang
cb8f85ff83
Add online LSTM transducer model ( #25 )
2023-02-18 21:35:15 +08:00
Fangjun Kuang
f5f90a0742
Release v1.0 ( #18 )
...
* Release v1.0
* Add link to doc
2022-10-14 12:13:51 +08:00
Fangjun Kuang
8ee02c28b0
Refactor the code ( #15 )
...
* code refactoring
* Remove reference files
* Update README and CI
* small fixes
* fix style issues
* add style check for CI
* fix style issues
* remove kaldi-native-io
2022-10-12 21:35:07 +08:00
Fangjun Kuang
77ccd625b8
code refactoring and add CI ( #11 )
2022-10-12 11:27:05 +08:00