Fangjun Kuang
|
0e23f82691
|
Give an informative log for whisper on exceptions. (#473)
|
2023-12-08 14:33:59 +08:00 |
|
Fangjun Kuang
|
868c339e5e
|
Support distil-small.en whisper (#472)
|
2023-12-08 11:59:20 +08:00 |
|
Fangjun Kuang
|
3ae984f148
|
Remove the 30-second constraint from whisper. (#471)
|
2023-12-07 17:47:08 +08:00 |
|
Fangjun Kuang
|
d34161413d
|
Support Ukrainian VITS models from coqui-ai/TTS (#469)
|
2023-12-06 19:37:11 +08:00 |
|
Fangjun Kuang
|
23cf92daf7
|
Use espeak-ng for coqui-ai/TTS VITS English models. (#466)
|
2023-12-06 11:00:38 +08:00 |
|
Fangjun Kuang
|
86b4be5260
|
Break text into sentences for tts. (#460)
This is for models that are not using piper-phonemize as their front-end.
|
2023-12-03 11:50:25 +08:00 |
|
Fangjun Kuang
|
99ff6a834c
|
Play generated audio as it is generating. (#457)
|
2023-12-02 15:35:11 +08:00 |
|
Fangjun Kuang
|
539b27e575
|
Fix CI (#456)
|
2023-12-01 11:00:16 +08:00 |
|
Fangjun Kuang
|
62dc3c3e46
|
Use piper-phonemize to convert text to token IDs (#453)
|
2023-11-30 23:57:43 +08:00 |
|
Fangjun Kuang
|
db41778e99
|
Support piper-phonemize (#452)
|
2023-11-28 19:12:58 +08:00 |
|
Fangjun Kuang
|
66cad9fa93
|
Fix reading tokens.txt on Windows (#448)
|
2023-11-25 14:22:26 +08:00 |
|
HieDean
|
2a91524dbf
|
Lock before push_back the deque for thread safety (#445)
Co-authored-by: hiedean <hiedean@tju.edu.cn>
|
2023-11-24 10:23:25 +08:00 |
|
Fangjun Kuang
|
fe977b8e8e
|
support nodejs (#438)
|
2023-11-21 23:20:08 +08:00 |
|
HieDean
|
e6a2d0da3b
|
Replace Clone() with View() (#432)
Co-authored-by: hiedean <hiedean@tju.edu.cn>
|
2023-11-20 09:20:50 +08:00 |
|
HieDean
|
1a6a41eb2c
|
Judge before UseCachedDecoderOut (#431)
Co-authored-by: hiedean <hiedean@tju.edu.cn>
|
2023-11-17 12:07:47 +08:00 |
|
Fangjun Kuang
|
fac4f6bc7c
|
Support streaming conformer CTC models from wenet (#427)
|
2023-11-16 10:35:23 +08:00 |
|
Fangjun Kuang
|
b83b3e3cd1
|
Support non-streaming WeNet CTC models. (#426)
|
2023-11-15 14:23:20 +08:00 |
|
Fangjun Kuang
|
097d641869
|
Resize circular buffer on overflow (#422)
|
2023-11-13 12:07:51 +08:00 |
|
Fangjun Kuang
|
68f0e59688
|
Add a C++ example to show streaming VAD + non-streaming ASR. (#420)
|
2023-11-11 22:54:27 +08:00 |
|
Fangjun Kuang
|
47947ffae9
|
Fix punctuations in tts (#417)
|
2023-11-10 17:09:48 +08:00 |
|
Fangjun Kuang
|
61341b7187
|
Support VITS TTS models from coqui-ai/TTS (#416)
* Support VITS TTS models from coqui-ai/TTS
* release v1.8.9
|
2023-11-10 16:24:11 +08:00 |
|
Fangjun Kuang
|
86baf43c6b
|
support reading rule FST for Android TTS (#410)
|
2023-11-06 10:38:40 +08:00 |
|
Fangjun Kuang
|
723e5265bb
|
Support Chinese polyphones in TTS (#409)
|
2023-11-05 13:06:00 +08:00 |
|
Fangjun Kuang
|
606cb26a62
|
Catch exception from whisper (#408)
|
2023-11-05 11:10:24 +08:00 |
|
Fangjun Kuang
|
d1a450bf82
|
Support text normalization via rule FST (#407)
|
2023-11-05 08:59:03 +08:00 |
|
Fangjun Kuang
|
b80b7e5144
|
Support linking onnxruntime statically for macOS (#403)
|
2023-10-31 20:24:43 +08:00 |
|
Fangjun Kuang
|
fabbc70633
|
Support static linking onnxruntime for 64-bit ARM (#402)
|
2023-10-31 16:51:04 +08:00 |
|
Fangjun Kuang
|
2f2d3bbd82
|
Support static linking onnxruntime lib for 32-bit arm (#401)
|
2023-10-31 11:19:01 +08:00 |
|
Fangjun Kuang
|
157628b257
|
Support French in TTS (#397)
|
2023-10-28 22:22:00 +08:00 |
|
Fangjun Kuang
|
64ab1ea9f8
|
Support Spanish in TTS (#396)
|
2023-10-28 11:09:34 +08:00 |
|
Fangjun Kuang
|
69e985f701
|
Support German umlauts in splitting UTF8 strings. (#395)
|
2023-10-27 16:11:38 +08:00 |
|
Fangjun Kuang
|
fbf4c903e1
|
Support German TTS (#394)
|
2023-10-27 11:12:45 +08:00 |
|
Fangjun Kuang
|
44512858d6
|
Support vits models from piper (#390)
|
2023-10-26 14:10:24 +08:00 |
|
Fangjun Kuang
|
a8fed2a9ce
|
Fix splitting words containing ', e.g., I've (#389)
|
2023-10-26 13:07:30 +08:00 |
|
Peter Ross
|
fcde4c4944
|
include cstdint (debian, gcc-13.2) (#388)
|
2023-10-26 08:10:48 +08:00 |
|
Fangjun Kuang
|
29a5d06691
|
Fix utf8 spliting for English (#386)
|
2023-10-25 14:55:27 +08:00 |
|
Fangjun Kuang
|
6e5efa48c5
|
Fix splitting utf8 string into words (#385)
|
2023-10-25 11:49:27 +08:00 |
|
Fangjun Kuang
|
0fdb2044e1
|
Add jni interface and kotlin API examples for TTS. (#381)
|
2023-10-23 12:31:54 +08:00 |
|
Fangjun Kuang
|
1937717705
|
Add MFC TTS example on Windows (#378)
|
2023-10-21 00:13:07 +08:00 |
|
Fangjun Kuang
|
3ba9a4932f
|
Support printing input text and words after splitting (#376)
|
2023-10-20 12:06:30 +08:00 |
|
Fangjun Kuang
|
ea7c45b60c
|
Add C API for offline TTS. (#373)
|
2023-10-19 17:38:23 +08:00 |
|
Fangjun Kuang
|
eead16e27f
|
Fix CI for pip install (#371)
|
2023-10-19 10:43:14 +08:00 |
|
Fangjun Kuang
|
8545c3b7f0
|
Validate input sid (#369)
|
2023-10-18 14:02:01 +08:00 |
|
Fangjun Kuang
|
1ee79e3ff5
|
Support Chinese vits models (#368)
|
2023-10-18 10:19:10 +08:00 |
|
Fangjun Kuang
|
9efe69720d
|
Support VITS VCTK models (#367)
* Support VITS VCTK models
* Release v1.8.1
|
2023-10-16 17:22:30 +08:00 |
|
yujinqiu
|
d01682d968
|
Add vad clear api for better performance (#366)
* Add vad clear api for better performance
* rename to make naming consistent and remove macro
* Fix linker error
* Fix Vad.kt
|
2023-10-16 14:40:47 +08:00 |
|
Fangjun Kuang
|
655e0fa836
|
add python API and examples for TTS (#364)
|
2023-10-14 14:21:53 +08:00 |
|
Fangjun Kuang
|
1ac2232e14
|
Support writing generated audio samples to wave files (#363)
|
2023-10-13 23:36:03 +08:00 |
|
Fangjun Kuang
|
536d5804ba
|
Add TTS with VITS (#360)
|
2023-10-13 19:30:38 +08:00 |
|
Peng He
|
4771c9275c
|
Add lm decode for the Python API. (#353)
* Add lm decode for the Python API.
* fix style.
* Fix LogAdd,
Shouldn't double lm_log_prob when merge same prefix path
* sort the import alphabetically
|
2023-10-13 11:15:16 +08:00 |
|