Commit Graph

102 Commits

Author SHA1 Message Date
Fangjun Kuang
692a47dd80 Add Swift example for generating subtitles (#318) 2023-09-18 15:16:54 +08:00
Peng He
5ca0ff8811 Fix LogAdd (#316)
Using 0 as the initial value,  should not perform addition when both values are 0
2023-09-18 10:43:04 +08:00
Fangjun Kuang
c471423125 Add Silero VAD (#313) 2023-09-17 14:54:38 +08:00
Fangjun Kuang
e2be532b32 Add timestamps for offline paraformer (#310) 2023-09-14 19:33:41 +08:00
Wei Kang
47184f9db7 Refactor hotwords,support loading hotwords from file (#296) 2023-09-14 19:33:17 +08:00
Fangjun Kuang
d46b7ec178 Catch exception from non-streaming paraformer. (#307) 2023-09-12 16:44:33 +08:00
Fangjun Kuang
debab7c091 Add two-pass speech recognition Android/iOS demo (#304) 2023-09-12 15:40:16 +08:00
Fangjun Kuang
a12ebfab22 treat unk as blank (#299) 2023-09-07 15:12:29 +08:00
Fangjun Kuang
a0a747a0c0 add endpointing for online websocket server (#294) 2023-08-31 14:41:04 +08:00
Wei Kang
2b0152d2a2 Fix context graph (#292) 2023-08-28 19:39:22 +08:00
Fangjun Kuang
eb22b4845a Fix a bug for multilingual ASR (#281) 2023-08-17 10:43:26 +08:00
Fangjun Kuang
f709c95c5f Support multilingual whisper models (#274) 2023-08-16 00:28:52 +08:00
Fangjun Kuang
35526e26e1 Support paraformer on Android (#264) 2023-08-14 12:26:15 +08:00
Fangjun Kuang
6038e2aa62 Support streaming paraformer (#263) 2023-08-14 10:32:14 +08:00
Fangjun Kuang
a4bff28e21 Support TDNN models from the yesno recipe from icefall (#262) 2023-08-12 19:50:22 +08:00
frankyoujian
9dcad7e963 Reinitialize context state after Reset stream when using contexts (#257) 2023-08-10 14:19:40 +08:00
Fangjun Kuang
865fd1e017 Support pkg-config (#253) 2023-08-10 11:22:36 +08:00
Fangjun Kuang
79c2ce5dd4 Refactor online recognizer (#250)
* Refactor online recognizer.

Make it easier to support other streaming models.

Note that it is a breaking change for the Python API.
`sherpa_onnx.OnlineRecognizer()` used before should be
replaced by `sherpa_onnx.OnlineRecognizer.from_transducer()`.
2023-08-09 20:27:31 +08:00
Fangjun Kuang
6061318e3f fix building on linux with GPU (#249) 2023-08-09 20:21:28 +08:00
Fangjun Kuang
92bfee0424 Flush stderr on write (#248) 2023-08-09 15:33:01 +08:00
Fangjun Kuang
aa48b76d4b Fix initial tokens to decoding (#246) 2023-08-09 12:33:47 +08:00
Fangjun Kuang
45b9d4ab37 Support whisper models (#238) 2023-08-07 12:34:18 +08:00
Fangjun Kuang
c5756734a9 Use parse options to parse arguments from sherpa-onnx-microphone (#237) 2023-08-05 18:05:18 +08:00
Jingzhao Ou
daffdab52a Updated hypothesis key generation to be the same as sherpa (#226) 2023-07-28 14:19:49 +08:00
Fangjun Kuang
6125d9e063 Refactor onnxruntime.cmake (#220) 2023-07-18 15:44:54 +08:00
Wilson Wongso
5a6b55c5a7 Reduce model initialization time for online speech recognition (#215)
* Reduce model initialization time for online speech recognition

* Fixed Styling

---------

Co-authored-by: w11wo <wilsowong961@gmail.com>
2023-07-14 21:20:10 +08:00
Fangjun Kuang
f3206c49dc Reduce model initialization time for offline speech recognition (#213) 2023-07-14 18:07:27 +08:00
Fangjun Kuang
bebc1f1398 Use static libraries for MFC examples (#210) 2023-07-13 14:52:43 +08:00
Wei Kang
513dfaa552 Support contextual-biasing for streaming model (#184)
* Support contextual-biasing for streaming model

* The whole pipeline runs normally

* Fix comments
2023-06-30 16:46:24 +08:00
danfu
1c3dac9001 support streaming zipformer2 (#185)
Co-authored-by: danfu <danfu@tencent.com>
2023-06-26 11:09:43 +08:00
Fangjun Kuang
d82251cba6 Support the latest offline zipformer model (#180) 2023-06-19 14:09:24 +08:00
Wei Kang
8562711252 Implement context biasing with a Aho Corasick automata (#145)
* Implement context graph

* Modify the interface to support context biasing

* Support context biasing in modified beam search; add python wrapper

* Support context biasing in python api example

* Minor fixes

* Fix context graph

* Minor fixes

* Fix tests

* Fix style

* Fix style

* Fix comments

* Minor fixes

* Add missing header

* Replace std::shared_ptr with std::unique_ptr for effciency

* Build graph in constructor

* Fix comments

* Minor fixes

* Fix docs
2023-06-16 14:26:36 +08:00
keanu
1a1b9fd236 RNNLM model support lm_num_thread and lm_provider setting (#173)
* rnnlm model inference supports num_threads setting

* rnnlm params decouple num_thread and provider with Transducer.

* fix python csrc bug which offline-lm-config.cc and online-lm-config.cc arguments problem

* lm_num_threads and lm_provider set default values

---------

Co-authored-by: cuidongcai1035 <cuidongcai1035@wezhuiyi.com>
2023-06-12 15:51:27 +08:00
keanu
9c017c2ccb rnnlm model inference supports num_threads setting (#169)
Co-authored-by: cuidongcai1035 <cuidongcai1035@wezhuiyi.com>
2023-06-07 09:32:27 +08:00
Jingzhao Ou
fdd49d0538 add batch processing to sherpa-onnx (#166) 2023-06-04 14:13:55 +08:00
Jingzhao Ou
0ed501b8f1 Added provider option to sherpa-onnx and decode-file-c-api (#162) 2023-06-03 04:57:48 +08:00
Fangjun Kuang
d7114da441 Minor fixes (#161) 2023-05-23 15:57:33 +08:00
Fangjun Kuang
44821ae2fb Use fixed decimal point for offline timestamp (#158) 2023-05-22 16:52:38 +08:00
PF Luo
655c619bf3 Fix lm fusion (#157)
* share GetHypsRowSplits interface and fix getting Topk not taking logprob

* fix lm score of lm fusion and make padding len same with 'icefall/egs/librispeech/ASR/pruned_transducer_stateless7_streaming/decode.py'
2023-05-15 10:48:45 +08:00
Yuekai Zhang
b8fbf8e5ce Add onnxruntime gpu for cmake (#153)
* add onnxruntime gpu for cmake

* fix clang

* fix typo

* cpplint
2023-05-12 22:30:47 +08:00
Fangjun Kuang
cea718e3d8 Support CoreML for macOS (#151) 2023-05-12 15:57:44 +08:00
Jingzhao Ou
0992063de8 Stack and streaming conformer support (#141)
* added csrc/stack.cc

* stack: added checks

* added copyright info

* passed cpp style checks

* formatted code

* added some support for streaming conformer model support (not verified)

* code lint

* made more progress with streaming conformer support (not working yet)

* passed style check

* changes as suggested by @csukuangfj

* added some debug info

* fixed style check

* Use Cat to replace Stack

* remove debug statements

---------

Co-authored-by: Jingzhao Ou (jou2019) <jou2019@cisco.com>
Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>
2023-05-11 14:30:39 +08:00
PF Luo
824b0809a4 add shallow fusion (#147) 2023-05-10 22:30:57 +08:00
cooldoomsday
0bc571f6ee Return timestamp info and tokens in offline ASR
Co-authored-by: zhangbaofeng@npnets.com <41259@Zbf>
2023-05-06 10:20:46 +08:00
PF Luo
8c6a6768d5 Add lm rescore to online-modified-beam-search (#133) 2023-05-05 21:23:54 +08:00
PF Luo
aa7108729b share GetHypsRowSplits interface and fix getting Topk not taking logprob (#131) 2023-04-26 11:41:04 +08:00
Fangjun Kuang
86017f9833 Add RNN LM rescore for offline ASR with modified_beam_search (#125) 2023-04-23 17:15:18 +08:00
Fangjun Kuang
ad05f52666 Add timestamps for streaming ASR. (#123) 2023-04-19 16:02:37 +08:00
彭震东
d781fcdeef Use log probs for paraformer (#120)
* Use log probs for paraformer

* Fix
2023-04-09 23:28:10 +08:00
Fangjun Kuang
80060c276d Begin to support CTC models (#119)
Please see https://k2-fsa.github.io/sherpa/onnx/pretrained_models/offline-ctc/nemo/index.html for a list of pre-trained CTC models from NeMo.
2023-04-07 23:11:34 +08:00