Fangjun Kuang
6da4a1c12f
Add Go API for speaker identification ( #718 )
2024-03-29 19:25:55 +08:00
Fangjun Kuang
a042f44076
Add Golang API for spoken language identification. ( #709 )
2024-03-27 19:40:25 +08:00
Fangjun Kuang
12efbf7397
Sign released TTS APKs ( #710 )
2024-03-27 19:34:37 +08:00
Fangjun Kuang
69c7880c4d
Add Golang API for VAD ( #708 )
2024-03-27 12:09:39 +08:00
Fangjun Kuang
bd66f7a7d0
Build Android TTS APKs for coqui-ai/TTS models ( #704 )
2024-03-26 14:05:26 +08:00
Fangjun Kuang
305c373107
Add C# API for spoken language identification ( #697 )
2024-03-25 18:45:09 +08:00
Fangjun Kuang
1952772654
Add timestamps and tokens for .Net's online models. ( #690 )
2024-03-23 18:51:56 +08:00
Fangjun Kuang
24f437a6f1
Refactor github actions tests ( #688 )
2024-03-22 21:22:42 +08:00
Fangjun Kuang
c8770aec20
Add nuget package for Windows x86 ( #683 )
2024-03-21 14:57:01 +08:00
Fangjun Kuang
acf0975153
Support whisper language/task in various language bindings. ( #679 )
2024-03-20 16:43:35 +08:00
Fangjun Kuang
6571fc9552
Add tts play example for .Net. ( #676 )
...
It plays the generated audio via a speaker as it is generating.
2024-03-19 17:33:15 +08:00
foreversimon
ce60100f68
Add HotwordsFile and HotwordsScore fields to OnlineRecognizerConfig in C# API ( #675 )
2024-03-19 15:04:08 +08:00
Fangjun Kuang
a628002d8f
Release v1.9.12 ( #661 )
2024-03-11 18:52:34 +08:00
Fangjun Kuang
ed06ced16f
Add WebAssembly for NodeJS. ( #628 )
2024-03-03 20:00:36 +08:00
Fangjun Kuang
a65643b594
support onnxruntime v1.17.1 ( #624 )
2024-03-02 11:44:59 +08:00
Fangjun Kuang
c093880d7c
Fix building wheels ( #620 )
2024-03-01 15:20:06 +08:00
Fangjun Kuang
d2cc48ded5
Add more Chinese TTS models (Mandarin and Cantonese) ( #589 )
2024-02-20 15:05:35 +08:00
Fangjun Kuang
81da0fb7a6
Update onnxruntime from 1.16.3 to 1.17.0 ( #581 )
2024-02-17 12:43:42 +08:00
Fangjun Kuang
558f5e3263
Use sequential layout for OfflineTtsConfig in C# ( #567 )
2024-02-01 16:06:32 +08:00
Fangjun Kuang
2e8b321210
Add fine-tuned whisper model on aishell ( #565 )
...
See also https://github.com/k2-fsa/icefall/pull/1466
2024-01-31 17:23:42 +08:00
20246688
0aa47e5ccc
Update test.py ( #560 )
2024-01-29 17:30:44 +08:00
Fangjun Kuang
fa2af5dc69
Add TTS demo for C# API ( #557 )
2024-01-28 23:29:39 +08:00
Fangjun Kuang
035a82df33
Add a new Persian tts model ( #555 )
2024-01-27 20:47:54 +08:00
Fangjun Kuang
bbd7c7fc18
Add Android demo for speaker recognition ( #536 )
...
See pre-built Android APKs at
https://k2-fsa.github.io/sherpa/onnx/speaker-identification/apk.html
2024-01-23 16:50:52 +08:00
Wei Kang
b6c020901a
decoder for open vocabulary keyword spotting ( #505 )
...
* various fixes to ContextGraph to support open vocabulary keywords decoder
* Add keyword spotter runtime
* Add binary
* First version works
* Minor fixes
* update text2token
* default values
* Add jni for kws
* add kws android project
* Minor fixes
* Remove unused interface
* Minor fixes
* Add workflow
* handle extra info in texts
* Minor fixes
* Add more comments
* Fix ci
* fix cpp style
* Add input box in android demo so that users can specify their keywords
* Fix cpp style
* Fix comments
* Minor fixes
* Minor fixes
* minor fixes
* Minor fixes
* Minor fixes
* Add CI
* Fix code style
* cpplint
* Fix comments
* Fix error
2024-01-20 22:52:41 +08:00
Fangjun Kuang
7e0ae677c8
Add a Persian and a Slovenian model from Piper for Android TTS. ( #531 )
2024-01-15 15:00:15 +08:00
Fangjun Kuang
229853b77e
Android TTS APKs for Persian ( #529 )
2024-01-14 21:44:46 +08:00
Fangjun Kuang
68a525a024
Export speaker verification models from NeMo to ONNX ( #526 )
2024-01-13 19:49:45 +08:00
Fangjun Kuang
07e2b9a36d
Support exporting models to onnx from 3D-Speaker ( #522 )
2024-01-10 21:09:45 +08:00
Fangjun Kuang
581eceb4d5
Build text-to-speech engine APKs ( #509 )
2024-01-01 12:44:20 +08:00
Fangjun Kuang
e475e750ac
Support streaming zipformer CTC ( #496 )
...
* Support streaming zipformer CTC
* test online zipformer2 CTC
* Update doc of sherpa-onnx.cc
* Add Python APIs for streaming zipformer2 ctc
* Add Python API examples for streaming zipformer2 ctc
* Swift API for streaming zipformer2 CTC
* NodeJS API for streaming zipformer2 CTC
* Kotlin API for streaming zipformer2 CTC
* Golang API for streaming zipformer2 CTC
* C# API for streaming zipformer2 CTC
* Release v1.9.6
2023-12-22 13:46:33 +08:00
Fangjun Kuang
ef8d112aaa
Fix whisper test script for the latest onnxruntime. ( #494 )
2023-12-20 11:12:12 +08:00
Fangjun Kuang
9829d7c4d3
Add two GLaDOS TTS models ( #481 )
2023-12-13 15:40:07 +08:00
Fangjun Kuang
80d0192325
Fix android tts audio buffer size and fix CI. ( #478 )
2023-12-10 18:25:50 +08:00
Fangjun Kuang
0f053d8040
Support playing as it is generating for Android ( #477 )
2023-12-09 16:36:38 +08:00
Fangjun Kuang
cae0231f93
Fix releasing go packages ( #476 )
2023-12-09 00:07:52 +08:00
Fangjun Kuang
aef74c5125
convert wespeaker models to sherpa-onnx ( #475 )
2023-12-08 19:32:29 +08:00
Fangjun Kuang
868c339e5e
Support distil-small.en whisper ( #472 )
2023-12-08 11:59:20 +08:00
Fangjun Kuang
3ae984f148
Remove the 30-second constraint from whisper. ( #471 )
2023-12-07 17:47:08 +08:00
Fangjun Kuang
23cf92daf7
Use espeak-ng for coqui-ai/TTS VITS English models. ( #466 )
2023-12-06 11:00:38 +08:00
Fangjun Kuang
3b90e85ef2
Fix building for .Net ( #463 )
2023-12-04 19:27:55 +08:00
Fangjun Kuang
62dc3c3e46
Use piper-phonemize to convert text to token IDs ( #453 )
2023-11-30 23:57:43 +08:00
Fangjun Kuang
8dc08a9b97
Fix nodejs on Windows ( #450 )
2023-11-25 21:23:15 +08:00
Fangjun Kuang
fe977b8e8e
support nodejs ( #438 )
2023-11-21 23:20:08 +08:00
Fangjun Kuang
fac4f6bc7c
Support streaming conformer CTC models from wenet ( #427 )
2023-11-16 10:35:23 +08:00
Fangjun Kuang
b83b3e3cd1
Support non-streaming WeNet CTC models. ( #426 )
2023-11-15 14:23:20 +08:00
Fangjun Kuang
d34640e3a3
Add scripts to export ASR models from wenet to ONNX ( #425 )
...
See
https://user-images.githubusercontent.com/5284924/282995968-f6d39118-8008-4ce7-9d7c-d1d6387ac183.png
2023-11-15 11:41:15 +08:00
Fangjun Kuang
9884cf71e7
Update onnxruntime to v1.16.2 ( #421 )
2023-11-12 11:29:33 +08:00
Fangjun Kuang
3c1ea990b1
Build Android APKs for VITS models from Coqui-ai/TTS ( #419 )
...
Support building Android APKs for VITS models from
https://github.com/coqui-ai/TTS
Please see
https://k2-fsa.github.io/sherpa/onnx/tts/apk.html
2023-11-11 13:27:15 +08:00
Fangjun Kuang
ab0e830bee
Release v1.8.8 ( #414 )
2023-11-07 15:58:23 +08:00