Commit Graph

110 Commits

Author SHA1 Message Date
Fangjun Kuang
43af1e6951 Release v1.9.15 (#719) 2024-03-29 19:58:04 +08:00
Fangjun Kuang
4e040c596e Support including TTS conditionally. (#699) 2024-03-26 17:21:35 +08:00
Fangjun Kuang
0d258dd150 Support spoken language identification with whisper (#694) 2024-03-24 22:57:00 +08:00
Fangjun Kuang
3cdad9b5d1 Use manylinux in CI test (#692) 2024-03-24 07:54:32 +08:00
Lovemefan
009ed2cd30 add WebAssembly for Kws (#648) 2024-03-11 21:02:31 +08:00
Fangjun Kuang
a628002d8f Release v1.9.12 (#661) 2024-03-11 18:52:34 +08:00
Fangjun Kuang
d3287f9494 Add Python ASR examples with alsa (#646) 2024-03-08 11:34:48 +08:00
Fangjun Kuang
bdf9243940 Allow to not use pre-installed onnxruntime libs. (#636) 2024-03-06 14:40:23 +08:00
Fangjun Kuang
ed06ced16f Add WebAssembly for NodeJS. (#628) 2024-03-03 20:00:36 +08:00
Fangjun Kuang
8b7928e7d6 Fix computing features for whisper. (#617) 2024-02-29 16:56:29 +08:00
Fangjun Kuang
16ba7e274a Add WebAssembly for ASR (#604) 2024-02-23 17:39:11 +08:00
Fangjun Kuang
7c4b59932a Refactor WebAssembly build script. (#598)
Make it easier to build WebAssembly for ASR.
2024-02-21 16:51:15 +08:00
Fangjun Kuang
d2cc48ded5 Add more Chinese TTS models (Mandarin and Cantonese) (#589) 2024-02-20 15:05:35 +08:00
Fangjun Kuang
5f075d0fce Support MinSizeRel and RelWithDebInfo build on Windows. (#586) 2024-02-20 10:22:02 +08:00
Fangjun Kuang
d771762868 Support WebAssembly for text-to-speech (#577) 2024-02-08 23:39:12 +08:00
Fangjun Kuang
fa2af5dc69 Add TTS demo for C# API (#557) 2024-01-28 23:29:39 +08:00
Fangjun Kuang
7ae73e75ba Run TTS engine service without starting the app. (#553) 2024-01-26 22:28:21 +08:00
Fangjun Kuang
bbd7c7fc18 Add Android demo for speaker recognition (#536)
See pre-built Android APKs at 
https://k2-fsa.github.io/sherpa/onnx/speaker-identification/apk.html
2024-01-23 16:50:52 +08:00
Fangjun Kuang
581eceb4d5 Build text-to-speech engine APKs (#509) 2024-01-01 12:44:20 +08:00
Fangjun Kuang
e475e750ac Support streaming zipformer CTC (#496)
* Support streaming zipformer CTC

* test online zipformer2 CTC

* Update doc of sherpa-onnx.cc

* Add Python APIs for streaming zipformer2 ctc

* Add Python API examples for streaming zipformer2 ctc

* Swift API for streaming zipformer2 CTC

* NodeJS API for streaming zipformer2 CTC

* Kotlin API for streaming zipformer2 CTC

* Golang API for streaming zipformer2 CTC

* C# API for streaming zipformer2 CTC

* Release v1.9.6
2023-12-22 13:46:33 +08:00
Fangjun Kuang
b18812ceff Play generated audio using alsa for TTS (#482) 2023-12-13 22:28:03 +08:00
Fangjun Kuang
cae0231f93 Fix releasing go packages (#476) 2023-12-09 00:07:52 +08:00
Fangjun Kuang
868c339e5e Support distil-small.en whisper (#472) 2023-12-08 11:59:20 +08:00
Fangjun Kuang
a7d69359c9 Release v1.9.0 (#470) 2023-12-06 19:46:50 +08:00
Fangjun Kuang
db41778e99 Support piper-phonemize (#452) 2023-11-28 19:12:58 +08:00
Fangjun Kuang
fe977b8e8e support nodejs (#438) 2023-11-21 23:20:08 +08:00
Fangjun Kuang
049fb9f451 Add Python APIs for WeNet CTC models (#428) 2023-11-16 14:20:41 +08:00
Fangjun Kuang
61341b7187 Support VITS TTS models from coqui-ai/TTS (#416)
* Support VITS TTS models from coqui-ai/TTS

* release v1.8.9
2023-11-10 16:24:11 +08:00
Fangjun Kuang
ab0e830bee Release v1.8.8 (#414) 2023-11-07 15:58:23 +08:00
Fangjun Kuang
d1a450bf82 Support text normalization via rule FST (#407) 2023-11-05 08:59:03 +08:00
Fangjun Kuang
cca744e34e Update to onnxruntime v1.16.1 (#406) 2023-11-01 16:23:31 +08:00
Fangjun Kuang
157628b257 Support French in TTS (#397) 2023-10-28 22:22:00 +08:00
Fangjun Kuang
93ef4ee4bc Release v1.8.6 (#391) 2023-10-26 14:53:09 +08:00
Fangjun Kuang
6e5efa48c5 Fix splitting utf8 string into words (#385) 2023-10-25 11:49:27 +08:00
Fangjun Kuang
a69d0a950e Add Go API for TTS (#377) 2023-10-20 15:57:52 +08:00
Fangjun Kuang
33ad3fa610 Release v1.8.3 (#370) 2023-10-18 16:50:47 +08:00
Fangjun Kuang
8545c3b7f0 Validate input sid (#369) 2023-10-18 14:02:01 +08:00
Fangjun Kuang
1ee79e3ff5 Support Chinese vits models (#368) 2023-10-18 10:19:10 +08:00
Fangjun Kuang
9efe69720d Support VITS VCTK models (#367)
* Support VITS VCTK models

* Release v1.8.1
2023-10-16 17:22:30 +08:00
Fangjun Kuang
655e0fa836 add python API and examples for TTS (#364) 2023-10-14 14:21:53 +08:00
Fangjun Kuang
efd3cd3312 Fix pkg-config for linux (#358) 2023-10-12 16:21:06 +08:00
Fangjun Kuang
407602445d Add CTC HLG decoding using OpenFst (#349) 2023-10-08 11:32:39 +08:00
Fangjun Kuang
552a267c23 Set is_final and start_time for online websocket server. (#342)
* Set is_final and start_time for online websocket server.

* Convert timestamps to a json array
2023-09-25 15:12:07 +08:00
poor1017
c2518a5826 Supports cmake compilation compatible with v3.13. (#340)
Co-authored-by: chenyu <cheny65@chinatelecom.cn>
2023-09-25 11:48:55 +08:00
Fangjun Kuang
7649bd862c Fix building APKs (#337) 2023-09-24 14:16:14 +08:00
Fangjun Kuang
4ca60e2603 Add VAD + Non-streaming ASR model for Android (#336) 2023-09-23 22:48:38 +08:00
Fangjun Kuang
6e60a77d89 Add Android APK for Silero VAD (#335) 2023-09-23 20:39:13 +08:00
Fangjun Kuang
43b2b7760d Fix tokens processing for byte-level BPE (#333) 2023-09-22 13:28:19 +08:00
Fangjun Kuang
cf199ad466 Support onnxruntime 1.16.0 (#330) 2023-09-21 20:39:24 +08:00
Fangjun Kuang
532ed142d2 Support linking onnxruntime lib statically on Linux (#326) 2023-09-21 10:15:42 +08:00