Fangjun Kuang
8b7928e7d6
Fix computing features for whisper. ( #617 )
2024-02-29 16:56:29 +08:00
Fangjun Kuang
16ba7e274a
Add WebAssembly for ASR ( #604 )
2024-02-23 17:39:11 +08:00
Fangjun Kuang
7c4b59932a
Refactor WebAssembly build script. ( #598 )
...
Make it easier to build WebAssembly for ASR.
2024-02-21 16:51:15 +08:00
Fangjun Kuang
d2cc48ded5
Add more Chinese TTS models (Mandarin and Cantonese) ( #589 )
2024-02-20 15:05:35 +08:00
Fangjun Kuang
5f075d0fce
Support MinSizeRel and RelWithDebInfo build on Windows. ( #586 )
2024-02-20 10:22:02 +08:00
Fangjun Kuang
d771762868
Support WebAssembly for text-to-speech ( #577 )
2024-02-08 23:39:12 +08:00
Fangjun Kuang
fa2af5dc69
Add TTS demo for C# API ( #557 )
2024-01-28 23:29:39 +08:00
Fangjun Kuang
7ae73e75ba
Run TTS engine service without starting the app. ( #553 )
2024-01-26 22:28:21 +08:00
Fangjun Kuang
bbd7c7fc18
Add Android demo for speaker recognition ( #536 )
...
See pre-built Android APKs at
https://k2-fsa.github.io/sherpa/onnx/speaker-identification/apk.html
2024-01-23 16:50:52 +08:00
Fangjun Kuang
581eceb4d5
Build text-to-speech engine APKs ( #509 )
2024-01-01 12:44:20 +08:00
Fangjun Kuang
e475e750ac
Support streaming zipformer CTC ( #496 )
...
* Support streaming zipformer CTC
* test online zipformer2 CTC
* Update doc of sherpa-onnx.cc
* Add Python APIs for streaming zipformer2 ctc
* Add Python API examples for streaming zipformer2 ctc
* Swift API for streaming zipformer2 CTC
* NodeJS API for streaming zipformer2 CTC
* Kotlin API for streaming zipformer2 CTC
* Golang API for streaming zipformer2 CTC
* C# API for streaming zipformer2 CTC
* Release v1.9.6
2023-12-22 13:46:33 +08:00
Fangjun Kuang
b18812ceff
Play generated audio using alsa for TTS ( #482 )
2023-12-13 22:28:03 +08:00
Fangjun Kuang
cae0231f93
Fix releasing go packages ( #476 )
2023-12-09 00:07:52 +08:00
Fangjun Kuang
868c339e5e
Support distil-small.en whisper ( #472 )
2023-12-08 11:59:20 +08:00
Fangjun Kuang
a7d69359c9
Release v1.9.0 ( #470 )
2023-12-06 19:46:50 +08:00
Fangjun Kuang
db41778e99
Support piper-phonemize ( #452 )
2023-11-28 19:12:58 +08:00
Fangjun Kuang
fe977b8e8e
support nodejs ( #438 )
2023-11-21 23:20:08 +08:00
Fangjun Kuang
049fb9f451
Add Python APIs for WeNet CTC models ( #428 )
2023-11-16 14:20:41 +08:00
Fangjun Kuang
61341b7187
Support VITS TTS models from coqui-ai/TTS ( #416 )
...
* Support VITS TTS models from coqui-ai/TTS
* release v1.8.9
2023-11-10 16:24:11 +08:00
Fangjun Kuang
ab0e830bee
Release v1.8.8 ( #414 )
2023-11-07 15:58:23 +08:00
Fangjun Kuang
d1a450bf82
Support text normalization via rule FST ( #407 )
2023-11-05 08:59:03 +08:00
Fangjun Kuang
cca744e34e
Update to onnxruntime v1.16.1 ( #406 )
2023-11-01 16:23:31 +08:00
Fangjun Kuang
157628b257
Support French in TTS ( #397 )
2023-10-28 22:22:00 +08:00
Fangjun Kuang
93ef4ee4bc
Release v1.8.6 ( #391 )
2023-10-26 14:53:09 +08:00
Fangjun Kuang
6e5efa48c5
Fix splitting utf8 string into words ( #385 )
2023-10-25 11:49:27 +08:00
Fangjun Kuang
a69d0a950e
Add Go API for TTS ( #377 )
2023-10-20 15:57:52 +08:00
Fangjun Kuang
33ad3fa610
Release v1.8.3 ( #370 )
2023-10-18 16:50:47 +08:00
Fangjun Kuang
8545c3b7f0
Validate input sid ( #369 )
2023-10-18 14:02:01 +08:00
Fangjun Kuang
1ee79e3ff5
Support Chinese vits models ( #368 )
2023-10-18 10:19:10 +08:00
Fangjun Kuang
9efe69720d
Support VITS VCTK models ( #367 )
...
* Support VITS VCTK models
* Release v1.8.1
2023-10-16 17:22:30 +08:00
Fangjun Kuang
655e0fa836
add python API and examples for TTS ( #364 )
2023-10-14 14:21:53 +08:00
Fangjun Kuang
efd3cd3312
Fix pkg-config for linux ( #358 )
2023-10-12 16:21:06 +08:00
Fangjun Kuang
407602445d
Add CTC HLG decoding using OpenFst ( #349 )
2023-10-08 11:32:39 +08:00
Fangjun Kuang
552a267c23
Set is_final and start_time for online websocket server. ( #342 )
...
* Set is_final and start_time for online websocket server.
* Convert timestamps to a json array
2023-09-25 15:12:07 +08:00
poor1017
c2518a5826
Supports cmake compilation compatible with v3.13. ( #340 )
...
Co-authored-by: chenyu <cheny65@chinatelecom.cn >
2023-09-25 11:48:55 +08:00
Fangjun Kuang
7649bd862c
Fix building APKs ( #337 )
2023-09-24 14:16:14 +08:00
Fangjun Kuang
4ca60e2603
Add VAD + Non-streaming ASR model for Android ( #336 )
2023-09-23 22:48:38 +08:00
Fangjun Kuang
6e60a77d89
Add Android APK for Silero VAD ( #335 )
2023-09-23 20:39:13 +08:00
Fangjun Kuang
43b2b7760d
Fix tokens processing for byte-level BPE ( #333 )
2023-09-22 13:28:19 +08:00
Fangjun Kuang
cf199ad466
Support onnxruntime 1.16.0 ( #330 )
2023-09-21 20:39:24 +08:00
Fangjun Kuang
532ed142d2
Support linking onnxruntime lib statically on Linux ( #326 )
2023-09-21 10:15:42 +08:00
Fangjun Kuang
6afa9c85f6
Fix tokens for byte-level BPE token. ( #324 )
2023-09-20 07:49:53 +08:00
Fangjun Kuang
c471423125
Add Silero VAD ( #313 )
2023-09-17 14:54:38 +08:00
Fangjun Kuang
1e95fbf15f
release v1.7.14 ( #311 )
2023-09-14 19:35:29 +08:00
Fangjun Kuang
087367d7fe
Build APKs for two-pass speech recognition ( #308 )
2023-09-12 18:18:24 +08:00
Fangjun Kuang
ffeff3b8a3
Fix a typo for Go ( #298 )
2023-09-07 11:29:00 +08:00
Fangjun Kuang
49ec7e8f57
Fix building APK ( #286 )
2023-08-18 15:28:06 +08:00
Fangjun Kuang
e955f9c752
Release pre-built APKs ( #285 )
2023-08-18 14:28:44 +08:00
Fangjun Kuang
eb22b4845a
Fix a bug for multilingual ASR ( #281 )
2023-08-17 10:43:26 +08:00
Fangjun Kuang
e31f9e48c2
Fix various language binding APIs for tdnn and whisper models ( #278 )
2023-08-16 22:15:10 +08:00