Commit Graph

237 Commits

Author SHA1 Message Date
Fangjun Kuang
cf199ad466 Support onnxruntime 1.16.0 (#330) 2023-09-21 20:39:24 +08:00
Fangjun Kuang
532ed142d2 Support linking onnxruntime lib statically on Linux (#326) 2023-09-21 10:15:42 +08:00
Fangjun Kuang
6afa9c85f6 Fix tokens for byte-level BPE token. (#324) 2023-09-20 07:49:53 +08:00
keanu
bd173b27cc Offline decode support multi threads (#306)
Co-authored-by: cuidongcai1035 <cuidongcai1035@wezhuiyi.com>
2023-09-19 21:04:13 +08:00
Nick Fisher
b3e9986825 Add CreateOnlineStreamWithHotwords to C API (#323)
* add default visibility to SHERPA_ONNX_EXPORT

* expose CreateOnlineStreamWithHotwords method via C API

Co-authored-by: Nick Fisher <nick.fisher@polyvox.app>
2023-09-19 17:32:42 +08:00
Wei Kang
d7eab95439 Add java api for hotwords (#319)
* Add java api

* support websocket

* Fix kotlin
2023-09-18 22:44:29 +08:00
Wei Kang
a5d1c90807 Support c-api (#317) 2023-09-18 16:24:57 +08:00
Fangjun Kuang
692a47dd80 Add Swift example for generating subtitles (#318) 2023-09-18 15:16:54 +08:00
Peng He
5ca0ff8811 Fix LogAdd (#316)
Using 0 as the initial value,  should not perform addition when both values are 0
2023-09-18 10:43:04 +08:00
Fangjun Kuang
c471423125 Add Silero VAD (#313) 2023-09-17 14:54:38 +08:00
Fangjun Kuang
e2be532b32 Add timestamps for offline paraformer (#310) 2023-09-14 19:33:41 +08:00
Wei Kang
47184f9db7 Refactor hotwords,support loading hotwords from file (#296) 2023-09-14 19:33:17 +08:00
Fangjun Kuang
d46b7ec178 Catch exception from non-streaming paraformer. (#307) 2023-09-12 16:44:33 +08:00
Fangjun Kuang
debab7c091 Add two-pass speech recognition Android/iOS demo (#304) 2023-09-12 15:40:16 +08:00
Fangjun Kuang
a12ebfab22 treat unk as blank (#299) 2023-09-07 15:12:29 +08:00
Fangjun Kuang
a0a747a0c0 add endpointing for online websocket server (#294) 2023-08-31 14:41:04 +08:00
Wei Kang
2b0152d2a2 Fix context graph (#292) 2023-08-28 19:39:22 +08:00
Fangjun Kuang
eb22b4845a Fix a bug for multilingual ASR (#281) 2023-08-17 10:43:26 +08:00
Fangjun Kuang
e31f9e48c2 Fix various language binding APIs for tdnn and whisper models (#278) 2023-08-16 22:15:10 +08:00
zhaomingwork
3ab135c1eb update Makefile for paraformer java (#277) 2023-08-16 22:11:50 +08:00
zhaomingwork
256a8ecb50 update java for paraformer (#276) 2023-08-16 20:16:51 +08:00
Fangjun Kuang
f709c95c5f Support multilingual whisper models (#274) 2023-08-16 00:28:52 +08:00
Fangjun Kuang
bc791d4996 Fix C api for Go and MFC to support streaming paraformer (#268) 2023-08-14 17:02:23 +08:00
Fangjun Kuang
a8bdb4b38a Support paraformer on iOS (#265)
* Fix C API to support streaming paraformer

* Fix Swift API

* Support paraformer in iOS
2023-08-14 14:38:41 +08:00
Fangjun Kuang
35526e26e1 Support paraformer on Android (#264) 2023-08-14 12:26:15 +08:00
Fangjun Kuang
6038e2aa62 Support streaming paraformer (#263) 2023-08-14 10:32:14 +08:00
Fangjun Kuang
a4bff28e21 Support TDNN models from the yesno recipe from icefall (#262) 2023-08-12 19:50:22 +08:00
Fangjun Kuang
b094868fb8 Add non-streaming websocket server for python (#259) 2023-08-11 15:56:24 +08:00
frankyoujian
9dcad7e963 Reinitialize context state after Reset stream when using contexts (#257) 2023-08-10 14:19:40 +08:00
Fangjun Kuang
865fd1e017 Support pkg-config (#253) 2023-08-10 11:22:36 +08:00
Fangjun Kuang
79c2ce5dd4 Refactor online recognizer (#250)
* Refactor online recognizer.

Make it easier to support other streaming models.

Note that it is a breaking change for the Python API.
`sherpa_onnx.OnlineRecognizer()` used before should be
replaced by `sherpa_onnx.OnlineRecognizer.from_transducer()`.
2023-08-09 20:27:31 +08:00
Fangjun Kuang
6061318e3f fix building on linux with GPU (#249) 2023-08-09 20:21:28 +08:00
Fangjun Kuang
92bfee0424 Flush stderr on write (#248) 2023-08-09 15:33:01 +08:00
Fangjun Kuang
aa48b76d4b Fix initial tokens to decoding (#246) 2023-08-09 12:33:47 +08:00
Fangjun Kuang
45b9d4ab37 Support whisper models (#238) 2023-08-07 12:34:18 +08:00
Wilson Wongso
64efbd82af Implement Tokens in Swift and Kotlin (#227)
Co-authored-by: duc <duc@appiphany.com.au>
2023-08-05 18:37:03 +08:00
Fangjun Kuang
c5756734a9 Use parse options to parse arguments from sherpa-onnx-microphone (#237) 2023-08-05 18:05:18 +08:00
zhaomingwork
5a549615df Java api update for adding modelType in config class (#228) 2023-07-30 17:04:18 +08:00
Jingzhao Ou
daffdab52a Updated hypothesis key generation to be the same as sherpa (#226) 2023-07-28 14:19:49 +08:00
Fangjun Kuang
6125d9e063 Refactor onnxruntime.cmake (#220) 2023-07-18 15:44:54 +08:00
Fangjun Kuang
de2673680e Fix model_type for jni, c# and iOS. (#216) 2023-07-14 22:24:38 +08:00
Wilson Wongso
5a6b55c5a7 Reduce model initialization time for online speech recognition (#215)
* Reduce model initialization time for online speech recognition

* Fixed Styling

---------

Co-authored-by: w11wo <wilsowong961@gmail.com>
2023-07-14 21:20:10 +08:00
Fangjun Kuang
f3206c49dc Reduce model initialization time for offline speech recognition (#213) 2023-07-14 18:07:27 +08:00
Fangjun Kuang
0abd7ce881 Add non-streaming speech recognition examples for MFC (#212) 2023-07-14 17:00:14 +08:00
Fangjun Kuang
bebc1f1398 Use static libraries for MFC examples (#210) 2023-07-13 14:52:43 +08:00
Fangjun Kuang
5cd72ba3aa Fix setting context lists. (#207) 2023-07-12 09:18:56 +08:00
Wilson Wongso
b2364b0374 Implemented tokens and timestamps in Python API (#205) 2023-07-12 09:12:31 +08:00
Fangjun Kuang
33bf8dc1f4 Support specifying providers in Python API (#198) 2023-07-06 10:14:01 +08:00
Wei Kang
513dfaa552 Support contextual-biasing for streaming model (#184)
* Support contextual-biasing for streaming model

* The whole pipeline runs normally

* Fix comments
2023-06-30 16:46:24 +08:00
danfu
1c3dac9001 support streaming zipformer2 (#185)
Co-authored-by: danfu <danfu@tencent.com>
2023-06-26 11:09:43 +08:00