Fangjun Kuang
9efe69720d
Support VITS VCTK models ( #367 )
...
* Support VITS VCTK models
* Release v1.8.1
2023-10-16 17:22:30 +08:00
yujinqiu
d01682d968
Add vad clear api for better performance ( #366 )
...
* Add vad clear api for better performance
* rename to make naming consistent and remove macro
* Fix linker error
* Fix Vad.kt
2023-10-16 14:40:47 +08:00
longshiming
55b6893885
add SherpaOnnxWebSocket ( #362 )
...
* add SherpaOnnxWebSocket
* add SherpaOnnxWebSocket
* update text show style
---------
Co-authored-by: longshiming <longshiming@greesoft.com >
2023-10-16 14:39:32 +08:00
Fangjun Kuang
0df0a735d7
Fix building wheels for windows ( #365 )
2023-10-14 23:14:45 +08:00
Fangjun Kuang
655e0fa836
add python API and examples for TTS ( #364 )
2023-10-14 14:21:53 +08:00
Fangjun Kuang
1ac2232e14
Support writing generated audio samples to wave files ( #363 )
2023-10-13 23:36:03 +08:00
Fangjun Kuang
536d5804ba
Add TTS with VITS ( #360 )
2023-10-13 19:30:38 +08:00
Peng He
4771c9275c
Add lm decode for the Python API. ( #353 )
...
* Add lm decode for the Python API.
* fix style.
* Fix LogAdd,
Shouldn't double lm_log_prob when merge same prefix path
* sort the import alphabetically
2023-10-13 11:15:16 +08:00
Fangjun Kuang
323f532ad2
Fix symbol table for byte bpe ( #361 )
2023-10-13 10:51:59 +08:00
Fangjun Kuang
efd3cd3312
Fix pkg-config for linux ( #358 )
2023-10-12 16:21:06 +08:00
yujinqiu
64795144f3
Fix linker issue ( #357 )
...
* Expose VAD isDetected api to Swift
* Fix link issue
2023-10-12 16:16:02 +08:00
yujinqiu
f6566c8ace
Expose VAD isDetected api to Swift ( #356 )
2023-10-12 15:11:58 +08:00
Fangjun Kuang
32da5ecf02
Add script to convert vits models ( #355 )
2023-10-12 11:59:19 +08:00
Fangjun Kuang
98b67ad850
Fix reading hotwords file for android ( #354 )
2023-10-11 12:20:50 +08:00
zr_jin
845505708e
Update speech-recognition-from-microphone-with-endpoint-detection.py ( #352 )
2023-10-08 14:37:09 +08:00
Fangjun Kuang
be081017de
Fix typos/bugs ( #351 )
2023-10-08 11:39:59 +08:00
Fangjun Kuang
407602445d
Add CTC HLG decoding using OpenFst ( #349 )
2023-10-08 11:32:39 +08:00
Nickolay V. Shmyrev
c12286fe5e
Proper convolution mode for fast GPU processing ( #350 )
2023-10-07 20:24:57 +08:00
Fangjun Kuang
36017d49c4
add a comment about how to download silero_vad.onnx ( #346 )
2023-09-26 17:58:53 +08:00
Fangjun Kuang
33a5765169
Print a more user-friendly error message when using --hotwords-file. ( #344 )
2023-09-26 11:04:20 +08:00
yujinqiu
9091917eab
Generate SRT from audio ( #341 )
2023-09-25 16:36:16 +08:00
Fangjun Kuang
552a267c23
Set is_final and start_time for online websocket server. ( #342 )
...
* Set is_final and start_time for online websocket server.
* Convert timestamps to a json array
2023-09-25 15:12:07 +08:00
poor1017
c2518a5826
Supports cmake compilation compatible with v3.13. ( #340 )
...
Co-authored-by: chenyu <cheny65@chinatelecom.cn >
2023-09-25 11:48:55 +08:00
dym21
fef61080de
Added #include <cstdint> to fix gcc 13.2 compilation error. ( #339 )
2023-09-25 10:38:26 +08:00
Fangjun Kuang
f91c49a7ba
Provide pre-built wheels for 32-bit arm (e.g., 32-bit Raspberry Pi) ( #338 )
2023-09-25 09:33:43 +08:00
Fangjun Kuang
7649bd862c
Fix building APKs ( #337 )
2023-09-24 14:16:14 +08:00
Fangjun Kuang
4ca60e2603
Add VAD + Non-streaming ASR model for Android ( #336 )
2023-09-23 22:48:38 +08:00
Fangjun Kuang
6e60a77d89
Add Android APK for Silero VAD ( #335 )
2023-09-23 20:39:13 +08:00
Fangjun Kuang
65ec4dc741
Update alternative URLs for onnxruntime 1.16.0 ( #334 )
2023-09-22 18:50:52 +08:00
Fangjun Kuang
43b2b7760d
Fix tokens processing for byte-level BPE ( #333 )
2023-09-22 13:28:19 +08:00
Fangjun Kuang
969fff5622
Add VAD + Non-streaming ASR Python example. ( #332 )
2023-09-22 11:53:47 +08:00
Fangjun Kuang
cf199ad466
Support onnxruntime 1.16.0 ( #330 )
2023-09-21 20:39:24 +08:00
zr_jin
b640c295b9
Swift API for hotwords support ( #331 )
2023-09-21 20:32:13 +08:00
Fangjun Kuang
532ed142d2
Support linking onnxruntime lib statically on Linux ( #326 )
2023-09-21 10:15:42 +08:00
Fangjun Kuang
f5c060dd61
Reduce whisper decoder file size with onnx export ( #328 )
2023-09-20 19:33:26 +08:00
Fangjun Kuang
6afa9c85f6
Fix tokens for byte-level BPE token. ( #324 )
2023-09-20 07:49:53 +08:00
keanu
bd173b27cc
Offline decode support multi threads ( #306 )
...
Co-authored-by: cuidongcai1035 <cuidongcai1035@wezhuiyi.com >
2023-09-19 21:04:13 +08:00
Nick Fisher
b3e9986825
Add CreateOnlineStreamWithHotwords to C API ( #323 )
...
* add default visibility to SHERPA_ONNX_EXPORT
* expose CreateOnlineStreamWithHotwords method via C API
Co-authored-by: Nick Fisher <nick.fisher@polyvox.app >
2023-09-19 17:32:42 +08:00
Wei Kang
d7eab95439
Add java api for hotwords ( #319 )
...
* Add java api
* support websocket
* Fix kotlin
2023-09-18 22:44:29 +08:00
Wei Kang
4dfc11066a
format to linux file (\r\n -> \n) ( #320 )
2023-09-18 16:57:12 +08:00
Wei Kang
a5d1c90807
Support c-api ( #317 )
2023-09-18 16:24:57 +08:00
Fangjun Kuang
bd18e073f4
Fix building MFC examples ( #321 )
2023-09-18 16:04:07 +08:00
Fangjun Kuang
692a47dd80
Add Swift example for generating subtitles ( #318 )
2023-09-18 15:16:54 +08:00
Fangjun Kuang
2d51ca49b7
Generate subtitles ( #315 )
2023-09-18 10:44:06 +08:00
Peng He
5ca0ff8811
Fix LogAdd ( #316 )
...
Using 0 as the initial value, should not perform addition when both values are 0
2023-09-18 10:43:04 +08:00
Fangjun Kuang
c471423125
Add Silero VAD ( #313 )
2023-09-17 14:54:38 +08:00
Fangjun Kuang
3a20e332bf
Fix symlink for Windows ( #312 )
2023-09-15 16:03:09 +08:00
Fangjun Kuang
1e95fbf15f
release v1.7.14 ( #311 )
2023-09-14 19:35:29 +08:00
Fangjun Kuang
e2be532b32
Add timestamps for offline paraformer ( #310 )
2023-09-14 19:33:41 +08:00
Wei Kang
47184f9db7
Refactor hotwords,support loading hotwords from file ( #296 )
2023-09-14 19:33:17 +08:00