Commit Graph

255 Commits

Author SHA1 Message Date
yujinqiu
f6566c8ace Expose VAD isDetected api to Swift (#356) 2023-10-12 15:11:58 +08:00
Fangjun Kuang
32da5ecf02 Add script to convert vits models (#355) 2023-10-12 11:59:19 +08:00
Fangjun Kuang
98b67ad850 Fix reading hotwords file for android (#354) 2023-10-11 12:20:50 +08:00
zr_jin
845505708e Update speech-recognition-from-microphone-with-endpoint-detection.py (#352) 2023-10-08 14:37:09 +08:00
Fangjun Kuang
be081017de Fix typos/bugs (#351) 2023-10-08 11:39:59 +08:00
Fangjun Kuang
407602445d Add CTC HLG decoding using OpenFst (#349) 2023-10-08 11:32:39 +08:00
Nickolay V. Shmyrev
c12286fe5e Proper convolution mode for fast GPU processing (#350) 2023-10-07 20:24:57 +08:00
Fangjun Kuang
36017d49c4 add a comment about how to download silero_vad.onnx (#346) 2023-09-26 17:58:53 +08:00
Fangjun Kuang
33a5765169 Print a more user-friendly error message when using --hotwords-file. (#344) 2023-09-26 11:04:20 +08:00
yujinqiu
9091917eab Generate SRT from audio (#341) 2023-09-25 16:36:16 +08:00
Fangjun Kuang
552a267c23 Set is_final and start_time for online websocket server. (#342)
* Set is_final and start_time for online websocket server.

* Convert timestamps to a json array
2023-09-25 15:12:07 +08:00
poor1017
c2518a5826 Supports cmake compilation compatible with v3.13. (#340)
Co-authored-by: chenyu <cheny65@chinatelecom.cn>
2023-09-25 11:48:55 +08:00
dym21
fef61080de Added #include <cstdint> to fix gcc 13.2 compilation error. (#339) 2023-09-25 10:38:26 +08:00
Fangjun Kuang
f91c49a7ba Provide pre-built wheels for 32-bit arm (e.g., 32-bit Raspberry Pi) (#338) 2023-09-25 09:33:43 +08:00
Fangjun Kuang
7649bd862c Fix building APKs (#337) 2023-09-24 14:16:14 +08:00
Fangjun Kuang
4ca60e2603 Add VAD + Non-streaming ASR model for Android (#336) 2023-09-23 22:48:38 +08:00
Fangjun Kuang
6e60a77d89 Add Android APK for Silero VAD (#335) 2023-09-23 20:39:13 +08:00
Fangjun Kuang
65ec4dc741 Update alternative URLs for onnxruntime 1.16.0 (#334) 2023-09-22 18:50:52 +08:00
Fangjun Kuang
43b2b7760d Fix tokens processing for byte-level BPE (#333) 2023-09-22 13:28:19 +08:00
Fangjun Kuang
969fff5622 Add VAD + Non-streaming ASR Python example. (#332) 2023-09-22 11:53:47 +08:00
Fangjun Kuang
cf199ad466 Support onnxruntime 1.16.0 (#330) 2023-09-21 20:39:24 +08:00
zr_jin
b640c295b9 Swift API for hotwords support (#331) 2023-09-21 20:32:13 +08:00
Fangjun Kuang
532ed142d2 Support linking onnxruntime lib statically on Linux (#326) 2023-09-21 10:15:42 +08:00
Fangjun Kuang
f5c060dd61 Reduce whisper decoder file size with onnx export (#328) 2023-09-20 19:33:26 +08:00
Fangjun Kuang
6afa9c85f6 Fix tokens for byte-level BPE token. (#324) 2023-09-20 07:49:53 +08:00
keanu
bd173b27cc Offline decode support multi threads (#306)
Co-authored-by: cuidongcai1035 <cuidongcai1035@wezhuiyi.com>
2023-09-19 21:04:13 +08:00
Nick Fisher
b3e9986825 Add CreateOnlineStreamWithHotwords to C API (#323)
* add default visibility to SHERPA_ONNX_EXPORT

* expose CreateOnlineStreamWithHotwords method via C API

Co-authored-by: Nick Fisher <nick.fisher@polyvox.app>
2023-09-19 17:32:42 +08:00
Wei Kang
d7eab95439 Add java api for hotwords (#319)
* Add java api

* support websocket

* Fix kotlin
2023-09-18 22:44:29 +08:00
Wei Kang
4dfc11066a format to linux file (\r\n -> \n) (#320) 2023-09-18 16:57:12 +08:00
Wei Kang
a5d1c90807 Support c-api (#317) 2023-09-18 16:24:57 +08:00
Fangjun Kuang
bd18e073f4 Fix building MFC examples (#321) 2023-09-18 16:04:07 +08:00
Fangjun Kuang
692a47dd80 Add Swift example for generating subtitles (#318) 2023-09-18 15:16:54 +08:00
Fangjun Kuang
2d51ca49b7 Generate subtitles (#315) 2023-09-18 10:44:06 +08:00
Peng He
5ca0ff8811 Fix LogAdd (#316)
Using 0 as the initial value,  should not perform addition when both values are 0
2023-09-18 10:43:04 +08:00
Fangjun Kuang
c471423125 Add Silero VAD (#313) 2023-09-17 14:54:38 +08:00
Fangjun Kuang
3a20e332bf Fix symlink for Windows (#312) 2023-09-15 16:03:09 +08:00
Fangjun Kuang
1e95fbf15f release v1.7.14 (#311) 2023-09-14 19:35:29 +08:00
Fangjun Kuang
e2be532b32 Add timestamps for offline paraformer (#310) 2023-09-14 19:33:41 +08:00
Wei Kang
47184f9db7 Refactor hotwords,support loading hotwords from file (#296) 2023-09-14 19:33:17 +08:00
Fangjun Kuang
087367d7fe Build APKs for two-pass speech recognition (#308) 2023-09-12 18:18:24 +08:00
Fangjun Kuang
d46b7ec178 Catch exception from non-streaming paraformer. (#307) 2023-09-12 16:44:33 +08:00
Fangjun Kuang
debab7c091 Add two-pass speech recognition Android/iOS demo (#304) 2023-09-12 15:40:16 +08:00
Fangjun Kuang
8982984ea2 add a two-pass python example (#303) 2023-09-10 17:56:13 +08:00
Fangjun Kuang
86b18184c9 Fix Go examples (#300) 2023-09-07 15:27:41 +08:00
Fangjun Kuang
a12ebfab22 treat unk as blank (#299) 2023-09-07 15:12:29 +08:00
Fangjun Kuang
ffeff3b8a3 Fix a typo for Go (#298) 2023-09-07 11:29:00 +08:00
Fangjun Kuang
a0a747a0c0 add endpointing for online websocket server (#294) 2023-08-31 14:41:04 +08:00
Wei Kang
2b0152d2a2 Fix context graph (#292) 2023-08-28 19:39:22 +08:00
Fangjun Kuang
49ec7e8f57 Fix building APK (#286) 2023-08-18 15:28:06 +08:00
Fangjun Kuang
e955f9c752 Release pre-built APKs (#285) 2023-08-18 14:28:44 +08:00