Fangjun Kuang
692a47dd80
Add Swift example for generating subtitles ( #318 )
2023-09-18 15:16:54 +08:00
Peng He
5ca0ff8811
Fix LogAdd ( #316 )
...
Using 0 as the initial value, should not perform addition when both values are 0
2023-09-18 10:43:04 +08:00
Fangjun Kuang
c471423125
Add Silero VAD ( #313 )
2023-09-17 14:54:38 +08:00
Fangjun Kuang
e2be532b32
Add timestamps for offline paraformer ( #310 )
2023-09-14 19:33:41 +08:00
Wei Kang
47184f9db7
Refactor hotwords,support loading hotwords from file ( #296 )
2023-09-14 19:33:17 +08:00
Fangjun Kuang
d46b7ec178
Catch exception from non-streaming paraformer. ( #307 )
2023-09-12 16:44:33 +08:00
Fangjun Kuang
debab7c091
Add two-pass speech recognition Android/iOS demo ( #304 )
2023-09-12 15:40:16 +08:00
Fangjun Kuang
a12ebfab22
treat unk as blank ( #299 )
2023-09-07 15:12:29 +08:00
Fangjun Kuang
a0a747a0c0
add endpointing for online websocket server ( #294 )
2023-08-31 14:41:04 +08:00
Wei Kang
2b0152d2a2
Fix context graph ( #292 )
2023-08-28 19:39:22 +08:00
Fangjun Kuang
eb22b4845a
Fix a bug for multilingual ASR ( #281 )
2023-08-17 10:43:26 +08:00
Fangjun Kuang
f709c95c5f
Support multilingual whisper models ( #274 )
2023-08-16 00:28:52 +08:00
Fangjun Kuang
35526e26e1
Support paraformer on Android ( #264 )
2023-08-14 12:26:15 +08:00
Fangjun Kuang
6038e2aa62
Support streaming paraformer ( #263 )
2023-08-14 10:32:14 +08:00
Fangjun Kuang
a4bff28e21
Support TDNN models from the yesno recipe from icefall ( #262 )
2023-08-12 19:50:22 +08:00
frankyoujian
9dcad7e963
Reinitialize context state after Reset stream when using contexts ( #257 )
2023-08-10 14:19:40 +08:00
Fangjun Kuang
865fd1e017
Support pkg-config ( #253 )
2023-08-10 11:22:36 +08:00
Fangjun Kuang
79c2ce5dd4
Refactor online recognizer ( #250 )
...
* Refactor online recognizer.
Make it easier to support other streaming models.
Note that it is a breaking change for the Python API.
`sherpa_onnx.OnlineRecognizer()` used before should be
replaced by `sherpa_onnx.OnlineRecognizer.from_transducer()`.
2023-08-09 20:27:31 +08:00
Fangjun Kuang
6061318e3f
fix building on linux with GPU ( #249 )
2023-08-09 20:21:28 +08:00
Fangjun Kuang
92bfee0424
Flush stderr on write ( #248 )
2023-08-09 15:33:01 +08:00
Fangjun Kuang
aa48b76d4b
Fix initial tokens to decoding ( #246 )
2023-08-09 12:33:47 +08:00
Fangjun Kuang
45b9d4ab37
Support whisper models ( #238 )
2023-08-07 12:34:18 +08:00
Fangjun Kuang
c5756734a9
Use parse options to parse arguments from sherpa-onnx-microphone ( #237 )
2023-08-05 18:05:18 +08:00
Jingzhao Ou
daffdab52a
Updated hypothesis key generation to be the same as sherpa ( #226 )
2023-07-28 14:19:49 +08:00
Fangjun Kuang
6125d9e063
Refactor onnxruntime.cmake ( #220 )
2023-07-18 15:44:54 +08:00
Wilson Wongso
5a6b55c5a7
Reduce model initialization time for online speech recognition ( #215 )
...
* Reduce model initialization time for online speech recognition
* Fixed Styling
---------
Co-authored-by: w11wo <wilsowong961@gmail.com >
2023-07-14 21:20:10 +08:00
Fangjun Kuang
f3206c49dc
Reduce model initialization time for offline speech recognition ( #213 )
2023-07-14 18:07:27 +08:00
Fangjun Kuang
bebc1f1398
Use static libraries for MFC examples ( #210 )
2023-07-13 14:52:43 +08:00
Wei Kang
513dfaa552
Support contextual-biasing for streaming model ( #184 )
...
* Support contextual-biasing for streaming model
* The whole pipeline runs normally
* Fix comments
2023-06-30 16:46:24 +08:00
danfu
1c3dac9001
support streaming zipformer2 ( #185 )
...
Co-authored-by: danfu <danfu@tencent.com >
2023-06-26 11:09:43 +08:00
Fangjun Kuang
d82251cba6
Support the latest offline zipformer model ( #180 )
2023-06-19 14:09:24 +08:00
Wei Kang
8562711252
Implement context biasing with a Aho Corasick automata ( #145 )
...
* Implement context graph
* Modify the interface to support context biasing
* Support context biasing in modified beam search; add python wrapper
* Support context biasing in python api example
* Minor fixes
* Fix context graph
* Minor fixes
* Fix tests
* Fix style
* Fix style
* Fix comments
* Minor fixes
* Add missing header
* Replace std::shared_ptr with std::unique_ptr for effciency
* Build graph in constructor
* Fix comments
* Minor fixes
* Fix docs
2023-06-16 14:26:36 +08:00
keanu
1a1b9fd236
RNNLM model support lm_num_thread and lm_provider setting ( #173 )
...
* rnnlm model inference supports num_threads setting
* rnnlm params decouple num_thread and provider with Transducer.
* fix python csrc bug which offline-lm-config.cc and online-lm-config.cc arguments problem
* lm_num_threads and lm_provider set default values
---------
Co-authored-by: cuidongcai1035 <cuidongcai1035@wezhuiyi.com >
2023-06-12 15:51:27 +08:00
keanu
9c017c2ccb
rnnlm model inference supports num_threads setting ( #169 )
...
Co-authored-by: cuidongcai1035 <cuidongcai1035@wezhuiyi.com >
2023-06-07 09:32:27 +08:00
Jingzhao Ou
fdd49d0538
add batch processing to sherpa-onnx ( #166 )
2023-06-04 14:13:55 +08:00
Jingzhao Ou
0ed501b8f1
Added provider option to sherpa-onnx and decode-file-c-api ( #162 )
2023-06-03 04:57:48 +08:00
Fangjun Kuang
d7114da441
Minor fixes ( #161 )
2023-05-23 15:57:33 +08:00
Fangjun Kuang
44821ae2fb
Use fixed decimal point for offline timestamp ( #158 )
2023-05-22 16:52:38 +08:00
PF Luo
655c619bf3
Fix lm fusion ( #157 )
...
* share GetHypsRowSplits interface and fix getting Topk not taking logprob
* fix lm score of lm fusion and make padding len same with 'icefall/egs/librispeech/ASR/pruned_transducer_stateless7_streaming/decode.py'
2023-05-15 10:48:45 +08:00
Yuekai Zhang
b8fbf8e5ce
Add onnxruntime gpu for cmake ( #153 )
...
* add onnxruntime gpu for cmake
* fix clang
* fix typo
* cpplint
2023-05-12 22:30:47 +08:00
Fangjun Kuang
cea718e3d8
Support CoreML for macOS ( #151 )
2023-05-12 15:57:44 +08:00
Jingzhao Ou
0992063de8
Stack and streaming conformer support ( #141 )
...
* added csrc/stack.cc
* stack: added checks
* added copyright info
* passed cpp style checks
* formatted code
* added some support for streaming conformer model support (not verified)
* code lint
* made more progress with streaming conformer support (not working yet)
* passed style check
* changes as suggested by @csukuangfj
* added some debug info
* fixed style check
* Use Cat to replace Stack
* remove debug statements
---------
Co-authored-by: Jingzhao Ou (jou2019) <jou2019@cisco.com >
Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com >
2023-05-11 14:30:39 +08:00
PF Luo
824b0809a4
add shallow fusion ( #147 )
2023-05-10 22:30:57 +08:00
cooldoomsday
0bc571f6ee
Return timestamp info and tokens in offline ASR
...
Co-authored-by: zhangbaofeng@npnets.com <41259@Zbf>
2023-05-06 10:20:46 +08:00
PF Luo
8c6a6768d5
Add lm rescore to online-modified-beam-search ( #133 )
2023-05-05 21:23:54 +08:00
PF Luo
aa7108729b
share GetHypsRowSplits interface and fix getting Topk not taking logprob ( #131 )
2023-04-26 11:41:04 +08:00
Fangjun Kuang
86017f9833
Add RNN LM rescore for offline ASR with modified_beam_search ( #125 )
2023-04-23 17:15:18 +08:00
Fangjun Kuang
ad05f52666
Add timestamps for streaming ASR. ( #123 )
2023-04-19 16:02:37 +08:00
彭震东
d781fcdeef
Use log probs for paraformer ( #120 )
...
* Use log probs for paraformer
* Fix
2023-04-09 23:28:10 +08:00
Fangjun Kuang
80060c276d
Begin to support CTC models ( #119 )
...
Please see https://k2-fsa.github.io/sherpa/onnx/pretrained_models/offline-ctc/nemo/index.html for a list of pre-trained CTC models from NeMo.
2023-04-07 23:11:34 +08:00