Fangjun Kuang
55266918c8
Add runtime support for wespeaker models ( #516 )
2024-01-09 22:06:08 +08:00
Fangjun Kuang
0be71a31f5
Use high_freq -400 in computing fbank features. ( #515 )
...
Fixes #514
2024-01-04 12:39:06 +08:00
Fangjun Kuang
e215d0c39a
Fix Byte BPE string results for Python. ( #512 )
...
It ignores invalid UTF8 strings.
2024-01-03 16:03:24 +08:00
Fangjun Kuang
d7e10bb3f8
Replace Android system TTS engine ( #508 )
2023-12-31 23:02:35 +08:00
Fangjun Kuang
e475e750ac
Support streaming zipformer CTC ( #496 )
...
* Support streaming zipformer CTC
* test online zipformer2 CTC
* Update doc of sherpa-onnx.cc
* Add Python APIs for streaming zipformer2 ctc
* Add Python API examples for streaming zipformer2 ctc
* Swift API for streaming zipformer2 CTC
* NodeJS API for streaming zipformer2 CTC
* Kotlin API for streaming zipformer2 CTC
* Golang API for streaming zipformer2 CTC
* C# API for streaming zipformer2 CTC
* Release v1.9.6
2023-12-22 13:46:33 +08:00
Fangjun Kuang
03ff9db56e
Keep multiple threads from calling into espeak-ng at the same time ( #489 )
2023-12-15 17:44:33 +08:00
Fangjun Kuang
ad72e7afc3
Print informative error messages for sherpa-onnx-alsa on errors. ( #486 )
2023-12-15 11:10:39 +08:00
Fangjun Kuang
b18812ceff
Play generated audio using alsa for TTS ( #482 )
2023-12-13 22:28:03 +08:00
Fangjun Kuang
0e23f82691
Give an informative log for whisper on exceptions. ( #473 )
2023-12-08 14:33:59 +08:00
Fangjun Kuang
868c339e5e
Support distil-small.en whisper ( #472 )
2023-12-08 11:59:20 +08:00
Fangjun Kuang
3ae984f148
Remove the 30-second constraint from whisper. ( #471 )
2023-12-07 17:47:08 +08:00
Fangjun Kuang
d34161413d
Support Ukrainian VITS models from coqui-ai/TTS ( #469 )
2023-12-06 19:37:11 +08:00
Fangjun Kuang
23cf92daf7
Use espeak-ng for coqui-ai/TTS VITS English models. ( #466 )
2023-12-06 11:00:38 +08:00
Fangjun Kuang
86b4be5260
Break text into sentences for tts. ( #460 )
...
This is for models that are not using piper-phonemize as their front-end.
2023-12-03 11:50:25 +08:00
Fangjun Kuang
99ff6a834c
Play generated audio as it is generating. ( #457 )
2023-12-02 15:35:11 +08:00
Fangjun Kuang
539b27e575
Fix CI ( #456 )
2023-12-01 11:00:16 +08:00
Fangjun Kuang
62dc3c3e46
Use piper-phonemize to convert text to token IDs ( #453 )
2023-11-30 23:57:43 +08:00
Fangjun Kuang
db41778e99
Support piper-phonemize ( #452 )
2023-11-28 19:12:58 +08:00
Fangjun Kuang
66cad9fa93
Fix reading tokens.txt on Windows ( #448 )
2023-11-25 14:22:26 +08:00
HieDean
2a91524dbf
Lock before push_back the deque for thread safety ( #445 )
...
Co-authored-by: hiedean <hiedean@tju.edu.cn >
2023-11-24 10:23:25 +08:00
Fangjun Kuang
fe977b8e8e
support nodejs ( #438 )
2023-11-21 23:20:08 +08:00
HieDean
e6a2d0da3b
Replace Clone() with View() ( #432 )
...
Co-authored-by: hiedean <hiedean@tju.edu.cn >
2023-11-20 09:20:50 +08:00
HieDean
1a6a41eb2c
Judge before UseCachedDecoderOut ( #431 )
...
Co-authored-by: hiedean <hiedean@tju.edu.cn >
2023-11-17 12:07:47 +08:00
Fangjun Kuang
fac4f6bc7c
Support streaming conformer CTC models from wenet ( #427 )
2023-11-16 10:35:23 +08:00
Fangjun Kuang
b83b3e3cd1
Support non-streaming WeNet CTC models. ( #426 )
2023-11-15 14:23:20 +08:00
Fangjun Kuang
097d641869
Resize circular buffer on overflow ( #422 )
2023-11-13 12:07:51 +08:00
Fangjun Kuang
68f0e59688
Add a C++ example to show streaming VAD + non-streaming ASR. ( #420 )
2023-11-11 22:54:27 +08:00
Fangjun Kuang
47947ffae9
Fix punctuations in tts ( #417 )
2023-11-10 17:09:48 +08:00
Fangjun Kuang
61341b7187
Support VITS TTS models from coqui-ai/TTS ( #416 )
...
* Support VITS TTS models from coqui-ai/TTS
* release v1.8.9
2023-11-10 16:24:11 +08:00
Fangjun Kuang
86baf43c6b
support reading rule FST for Android TTS ( #410 )
2023-11-06 10:38:40 +08:00
Fangjun Kuang
723e5265bb
Support Chinese polyphones in TTS ( #409 )
2023-11-05 13:06:00 +08:00
Fangjun Kuang
606cb26a62
Catch exception from whisper ( #408 )
2023-11-05 11:10:24 +08:00
Fangjun Kuang
d1a450bf82
Support text normalization via rule FST ( #407 )
2023-11-05 08:59:03 +08:00
Fangjun Kuang
b80b7e5144
Support linking onnxruntime statically for macOS ( #403 )
2023-10-31 20:24:43 +08:00
Fangjun Kuang
fabbc70633
Support static linking onnxruntime for 64-bit ARM ( #402 )
2023-10-31 16:51:04 +08:00
Fangjun Kuang
2f2d3bbd82
Support static linking onnxruntime lib for 32-bit arm ( #401 )
2023-10-31 11:19:01 +08:00
Fangjun Kuang
157628b257
Support French in TTS ( #397 )
2023-10-28 22:22:00 +08:00
Fangjun Kuang
64ab1ea9f8
Support Spanish in TTS ( #396 )
2023-10-28 11:09:34 +08:00
Fangjun Kuang
69e985f701
Support German umlauts in splitting UTF8 strings. ( #395 )
2023-10-27 16:11:38 +08:00
Fangjun Kuang
fbf4c903e1
Support German TTS ( #394 )
2023-10-27 11:12:45 +08:00
Fangjun Kuang
44512858d6
Support vits models from piper ( #390 )
2023-10-26 14:10:24 +08:00
Fangjun Kuang
a8fed2a9ce
Fix splitting words containing ', e.g., I've ( #389 )
2023-10-26 13:07:30 +08:00
Peter Ross
fcde4c4944
include cstdint (debian, gcc-13.2) ( #388 )
2023-10-26 08:10:48 +08:00
Fangjun Kuang
29a5d06691
Fix utf8 spliting for English ( #386 )
2023-10-25 14:55:27 +08:00
Fangjun Kuang
6e5efa48c5
Fix splitting utf8 string into words ( #385 )
2023-10-25 11:49:27 +08:00
Fangjun Kuang
0fdb2044e1
Add jni interface and kotlin API examples for TTS. ( #381 )
2023-10-23 12:31:54 +08:00
Fangjun Kuang
1937717705
Add MFC TTS example on Windows ( #378 )
2023-10-21 00:13:07 +08:00
Fangjun Kuang
3ba9a4932f
Support printing input text and words after splitting ( #376 )
2023-10-20 12:06:30 +08:00
Fangjun Kuang
ea7c45b60c
Add C API for offline TTS. ( #373 )
2023-10-19 17:38:23 +08:00
Fangjun Kuang
eead16e27f
Fix CI for pip install ( #371 )
2023-10-19 10:43:14 +08:00