Commit Graph

206 Commits

Author SHA1 Message Date
Fangjun Kuang
66cad9fa93 Fix reading tokens.txt on Windows (#448) 2023-11-25 14:22:26 +08:00
HieDean
2a91524dbf Lock before push_back the deque for thread safety (#445)
Co-authored-by: hiedean <hiedean@tju.edu.cn>
2023-11-24 10:23:25 +08:00
Fangjun Kuang
fe977b8e8e support nodejs (#438) 2023-11-21 23:20:08 +08:00
HieDean
e6a2d0da3b Replace Clone() with View() (#432)
Co-authored-by: hiedean <hiedean@tju.edu.cn>
2023-11-20 09:20:50 +08:00
HieDean
1a6a41eb2c Judge before UseCachedDecoderOut (#431)
Co-authored-by: hiedean <hiedean@tju.edu.cn>
2023-11-17 12:07:47 +08:00
Fangjun Kuang
fac4f6bc7c Support streaming conformer CTC models from wenet (#427) 2023-11-16 10:35:23 +08:00
Fangjun Kuang
b83b3e3cd1 Support non-streaming WeNet CTC models. (#426) 2023-11-15 14:23:20 +08:00
Fangjun Kuang
097d641869 Resize circular buffer on overflow (#422) 2023-11-13 12:07:51 +08:00
Fangjun Kuang
68f0e59688 Add a C++ example to show streaming VAD + non-streaming ASR. (#420) 2023-11-11 22:54:27 +08:00
Fangjun Kuang
47947ffae9 Fix punctuations in tts (#417) 2023-11-10 17:09:48 +08:00
Fangjun Kuang
61341b7187 Support VITS TTS models from coqui-ai/TTS (#416)
* Support VITS TTS models from coqui-ai/TTS

* release v1.8.9
2023-11-10 16:24:11 +08:00
Fangjun Kuang
86baf43c6b support reading rule FST for Android TTS (#410) 2023-11-06 10:38:40 +08:00
Fangjun Kuang
723e5265bb Support Chinese polyphones in TTS (#409) 2023-11-05 13:06:00 +08:00
Fangjun Kuang
606cb26a62 Catch exception from whisper (#408) 2023-11-05 11:10:24 +08:00
Fangjun Kuang
d1a450bf82 Support text normalization via rule FST (#407) 2023-11-05 08:59:03 +08:00
Fangjun Kuang
b80b7e5144 Support linking onnxruntime statically for macOS (#403) 2023-10-31 20:24:43 +08:00
Fangjun Kuang
fabbc70633 Support static linking onnxruntime for 64-bit ARM (#402) 2023-10-31 16:51:04 +08:00
Fangjun Kuang
2f2d3bbd82 Support static linking onnxruntime lib for 32-bit arm (#401) 2023-10-31 11:19:01 +08:00
Fangjun Kuang
157628b257 Support French in TTS (#397) 2023-10-28 22:22:00 +08:00
Fangjun Kuang
64ab1ea9f8 Support Spanish in TTS (#396) 2023-10-28 11:09:34 +08:00
Fangjun Kuang
69e985f701 Support German umlauts in splitting UTF8 strings. (#395) 2023-10-27 16:11:38 +08:00
Fangjun Kuang
fbf4c903e1 Support German TTS (#394) 2023-10-27 11:12:45 +08:00
Fangjun Kuang
44512858d6 Support vits models from piper (#390) 2023-10-26 14:10:24 +08:00
Fangjun Kuang
a8fed2a9ce Fix splitting words containing ', e.g., I've (#389) 2023-10-26 13:07:30 +08:00
Peter Ross
fcde4c4944 include cstdint (debian, gcc-13.2) (#388) 2023-10-26 08:10:48 +08:00
Fangjun Kuang
29a5d06691 Fix utf8 spliting for English (#386) 2023-10-25 14:55:27 +08:00
Fangjun Kuang
6e5efa48c5 Fix splitting utf8 string into words (#385) 2023-10-25 11:49:27 +08:00
Fangjun Kuang
0fdb2044e1 Add jni interface and kotlin API examples for TTS. (#381) 2023-10-23 12:31:54 +08:00
Fangjun Kuang
1937717705 Add MFC TTS example on Windows (#378) 2023-10-21 00:13:07 +08:00
Fangjun Kuang
3ba9a4932f Support printing input text and words after splitting (#376) 2023-10-20 12:06:30 +08:00
Fangjun Kuang
ea7c45b60c Add C API for offline TTS. (#373) 2023-10-19 17:38:23 +08:00
Fangjun Kuang
eead16e27f Fix CI for pip install (#371) 2023-10-19 10:43:14 +08:00
Fangjun Kuang
8545c3b7f0 Validate input sid (#369) 2023-10-18 14:02:01 +08:00
Fangjun Kuang
1ee79e3ff5 Support Chinese vits models (#368) 2023-10-18 10:19:10 +08:00
Fangjun Kuang
9efe69720d Support VITS VCTK models (#367)
* Support VITS VCTK models

* Release v1.8.1
2023-10-16 17:22:30 +08:00
yujinqiu
d01682d968 Add vad clear api for better performance (#366)
* Add vad clear api for better performance

* rename to make naming consistent and remove macro

* Fix linker error

* Fix Vad.kt
2023-10-16 14:40:47 +08:00
Fangjun Kuang
655e0fa836 add python API and examples for TTS (#364) 2023-10-14 14:21:53 +08:00
Fangjun Kuang
1ac2232e14 Support writing generated audio samples to wave files (#363) 2023-10-13 23:36:03 +08:00
Fangjun Kuang
536d5804ba Add TTS with VITS (#360) 2023-10-13 19:30:38 +08:00
Peng He
4771c9275c Add lm decode for the Python API. (#353)
* Add lm decode for the Python API.

* fix style.

* Fix LogAdd,

	Shouldn't double lm_log_prob when merge same prefix path

* sort the import alphabetically
2023-10-13 11:15:16 +08:00
Fangjun Kuang
323f532ad2 Fix symbol table for byte bpe (#361) 2023-10-13 10:51:59 +08:00
Fangjun Kuang
98b67ad850 Fix reading hotwords file for android (#354) 2023-10-11 12:20:50 +08:00
Fangjun Kuang
be081017de Fix typos/bugs (#351) 2023-10-08 11:39:59 +08:00
Fangjun Kuang
407602445d Add CTC HLG decoding using OpenFst (#349) 2023-10-08 11:32:39 +08:00
Nickolay V. Shmyrev
c12286fe5e Proper convolution mode for fast GPU processing (#350) 2023-10-07 20:24:57 +08:00
Fangjun Kuang
33a5765169 Print a more user-friendly error message when using --hotwords-file. (#344) 2023-09-26 11:04:20 +08:00
Fangjun Kuang
552a267c23 Set is_final and start_time for online websocket server. (#342)
* Set is_final and start_time for online websocket server.

* Convert timestamps to a json array
2023-09-25 15:12:07 +08:00
poor1017
c2518a5826 Supports cmake compilation compatible with v3.13. (#340)
Co-authored-by: chenyu <cheny65@chinatelecom.cn>
2023-09-25 11:48:55 +08:00
dym21
fef61080de Added #include <cstdint> to fix gcc 13.2 compilation error. (#339) 2023-09-25 10:38:26 +08:00
Fangjun Kuang
6e60a77d89 Add Android APK for Silero VAD (#335) 2023-09-23 20:39:13 +08:00