Fangjun Kuang
d08cc04567
Add VAD example for Dart API ( #996 )
2024-06-14 10:37:16 +08:00
Fangjun Kuang
6c12590d21
Fix CI errors. ( #993 )
2024-06-12 11:42:19 +08:00
Fangjun Kuang
fd5a0d1e00
Add C++ runtime for Tele-AI/TeleSpeech-ASR ( #970 )
2024-06-05 00:26:40 +08:00
Fangjun Kuang
f1cff83ef9
Add address sanitizer and undefined behavior sanitizer ( #951 )
2024-05-31 13:17:01 +08:00
Fangjun Kuang
1e277ae9b7
release v1.9.26 ( #947 )
2024-05-30 19:37:26 +08:00
Fangjun Kuang
4593ab49d1
Add Flutter example for speaker identification ( #894 )
2024-05-21 20:38:52 +08:00
Wei Kang
b012b78ceb
Encode hotwords in C++ side ( #828 )
...
* Encode hotwords in C++ side
2024-05-20 19:41:36 +08:00
Fangjun Kuang
8af2af8466
Add tail_paddings to Whisper C API. ( #886 )
2024-05-17 09:20:07 +08:00
Fangjun Kuang
677bc1da3e
Add Speaker ID demo for C# ( #862 )
2024-05-11 13:27:33 +08:00
Fangjun Kuang
4f758e6cd3
Publish node-addon-api wrapper for sherpa-onnx as npm packages ( #829 )
2024-05-04 13:27:39 +08:00
Fangjun Kuang
6686c7d3e6
Add dict_dir arg to c api to support Chinese TTS models using jieba ( #809 )
2024-04-25 12:28:31 +08:00
Fangjun Kuang
6b353bfb42
Add jieba for Chinese TTS models ( #797 )
2024-04-21 14:47:13 +08:00
Fangjun Kuang
37831fe89c
Release v1.9.22 ( #794 )
2024-04-19 18:37:47 +08:00
Fangjun Kuang
13730ecbd8
Add C API for punctuation ( #768 )
2024-04-14 19:02:34 +08:00
Fangjun Kuang
983df28a83
Fix a punctuation bug ( #764 )
2024-04-13 19:08:46 +08:00
Fangjun Kuang
b6ad0436fa
Release v1.9.18 ( #763 )
2024-04-13 16:34:15 +08:00
Fangjun Kuang
a5f8fbc83f
Support heteronyms in Chinese TTS ( #738 )
2024-04-08 11:01:30 +08:00
Fangjun Kuang
c1c0f5bafd
return timestamps for WebAssembly ( #737 )
2024-04-05 20:24:27 +08:00
Fangjun Kuang
43af1e6951
Release v1.9.15 ( #719 )
2024-03-29 19:58:04 +08:00
Fangjun Kuang
4e040c596e
Support including TTS conditionally. ( #699 )
2024-03-26 17:21:35 +08:00
Fangjun Kuang
0d258dd150
Support spoken language identification with whisper ( #694 )
2024-03-24 22:57:00 +08:00
Fangjun Kuang
3cdad9b5d1
Use manylinux in CI test ( #692 )
2024-03-24 07:54:32 +08:00
Lovemefan
009ed2cd30
add WebAssembly for Kws ( #648 )
2024-03-11 21:02:31 +08:00
Fangjun Kuang
a628002d8f
Release v1.9.12 ( #661 )
2024-03-11 18:52:34 +08:00
Fangjun Kuang
d3287f9494
Add Python ASR examples with alsa ( #646 )
2024-03-08 11:34:48 +08:00
Fangjun Kuang
bdf9243940
Allow to not use pre-installed onnxruntime libs. ( #636 )
2024-03-06 14:40:23 +08:00
Fangjun Kuang
ed06ced16f
Add WebAssembly for NodeJS. ( #628 )
2024-03-03 20:00:36 +08:00
Fangjun Kuang
8b7928e7d6
Fix computing features for whisper. ( #617 )
2024-02-29 16:56:29 +08:00
Fangjun Kuang
16ba7e274a
Add WebAssembly for ASR ( #604 )
2024-02-23 17:39:11 +08:00
Fangjun Kuang
7c4b59932a
Refactor WebAssembly build script. ( #598 )
...
Make it easier to build WebAssembly for ASR.
2024-02-21 16:51:15 +08:00
Fangjun Kuang
d2cc48ded5
Add more Chinese TTS models (Mandarin and Cantonese) ( #589 )
2024-02-20 15:05:35 +08:00
Fangjun Kuang
5f075d0fce
Support MinSizeRel and RelWithDebInfo build on Windows. ( #586 )
2024-02-20 10:22:02 +08:00
Fangjun Kuang
d771762868
Support WebAssembly for text-to-speech ( #577 )
2024-02-08 23:39:12 +08:00
Fangjun Kuang
fa2af5dc69
Add TTS demo for C# API ( #557 )
2024-01-28 23:29:39 +08:00
Fangjun Kuang
7ae73e75ba
Run TTS engine service without starting the app. ( #553 )
2024-01-26 22:28:21 +08:00
Fangjun Kuang
bbd7c7fc18
Add Android demo for speaker recognition ( #536 )
...
See pre-built Android APKs at
https://k2-fsa.github.io/sherpa/onnx/speaker-identification/apk.html
2024-01-23 16:50:52 +08:00
Fangjun Kuang
581eceb4d5
Build text-to-speech engine APKs ( #509 )
2024-01-01 12:44:20 +08:00
Fangjun Kuang
e475e750ac
Support streaming zipformer CTC ( #496 )
...
* Support streaming zipformer CTC
* test online zipformer2 CTC
* Update doc of sherpa-onnx.cc
* Add Python APIs for streaming zipformer2 ctc
* Add Python API examples for streaming zipformer2 ctc
* Swift API for streaming zipformer2 CTC
* NodeJS API for streaming zipformer2 CTC
* Kotlin API for streaming zipformer2 CTC
* Golang API for streaming zipformer2 CTC
* C# API for streaming zipformer2 CTC
* Release v1.9.6
2023-12-22 13:46:33 +08:00
Fangjun Kuang
b18812ceff
Play generated audio using alsa for TTS ( #482 )
2023-12-13 22:28:03 +08:00
Fangjun Kuang
cae0231f93
Fix releasing go packages ( #476 )
2023-12-09 00:07:52 +08:00
Fangjun Kuang
868c339e5e
Support distil-small.en whisper ( #472 )
2023-12-08 11:59:20 +08:00
Fangjun Kuang
a7d69359c9
Release v1.9.0 ( #470 )
2023-12-06 19:46:50 +08:00
Fangjun Kuang
db41778e99
Support piper-phonemize ( #452 )
2023-11-28 19:12:58 +08:00
Fangjun Kuang
fe977b8e8e
support nodejs ( #438 )
2023-11-21 23:20:08 +08:00
Fangjun Kuang
049fb9f451
Add Python APIs for WeNet CTC models ( #428 )
2023-11-16 14:20:41 +08:00
Fangjun Kuang
61341b7187
Support VITS TTS models from coqui-ai/TTS ( #416 )
...
* Support VITS TTS models from coqui-ai/TTS
* release v1.8.9
2023-11-10 16:24:11 +08:00
Fangjun Kuang
ab0e830bee
Release v1.8.8 ( #414 )
2023-11-07 15:58:23 +08:00
Fangjun Kuang
d1a450bf82
Support text normalization via rule FST ( #407 )
2023-11-05 08:59:03 +08:00
Fangjun Kuang
cca744e34e
Update to onnxruntime v1.16.1 ( #406 )
2023-11-01 16:23:31 +08:00
Fangjun Kuang
157628b257
Support French in TTS ( #397 )
2023-10-28 22:22:00 +08:00