Fangjun Kuang
46e4e5b7ac
Add C++ support for streaming NeMo CTC models. ( #857 )
2024-05-10 16:26:43 +08:00
Fangjun Kuang
5ed3ec1c04
Export non-streaming NeMo faster conformer hybrid transducer and ctc to sherpa-onnx ( #847 )
2024-05-09 13:59:47 +08:00
Fangjun Kuang
68b25abf27
Export NeMo FastConformer Hybrid Transducer Large Streaming to ONNX ( #844 )
2024-05-08 19:07:49 +08:00
Fangjun Kuang
a9f936e92b
Export NeMo FastConformer Hybrid Transducer-CTC Large Streaming to ONNX. ( #843 )
2024-05-08 12:33:46 +08:00
Fangjun Kuang
d2e86b0415
Add links to pre-built APKs and pre-trained models to README. ( #840 )
2024-05-07 12:28:42 +08:00
Fangjun Kuang
37a4135dd7
Publish npm package with node-addon-api for Windows ( #838 )
2024-05-06 16:21:29 +08:00
Fangjun Kuang
e1bb928805
Upload two more 3d-speaker models ( #837 )
2024-05-06 12:23:49 +08:00
chiiyeh
9c8255fdb2
Update 3dspeaker/export-onnx.py ( #836 )
...
Update to match the changes in infer_sv.py at 3D-speaker.
Added 2 more supported models and "zh_en" language.
2024-05-06 12:10:35 +08:00
Fangjun Kuang
4f758e6cd3
Publish node-addon-api wrapper for sherpa-onnx as npm packages ( #829 )
2024-05-04 13:27:39 +08:00
Fangjun Kuang
2f9553d838
Begin to add node-addon-api for sherpa-onnx ( #826 )
2024-05-03 14:47:40 +08:00
Fangjun Kuang
612002da57
Fix C# to support Chinese tts models using jieba ( #815 )
2024-04-26 11:50:07 +08:00
Fangjun Kuang
6686c7d3e6
Add dict_dir arg to c api to support Chinese TTS models using jieba ( #809 )
2024-04-25 12:28:31 +08:00
Fangjun Kuang
9b67a476e6
Refactor the JNI interface to make it more modular and maintainable ( #802 )
2024-04-24 09:48:42 +08:00
Fangjun Kuang
7f3b9ffe5d
Refactor TTS Android code to support jieba for Chinese TTS models ( #800 )
2024-04-22 17:21:05 +08:00
Fangjun Kuang
9a68b92ce6
Increase CED's max frame length to 3000 ( #798 )
...
so that it can process waves for up to 30 seconds.
2024-04-22 10:18:47 +08:00
Fangjun Kuang
2e0ee0e8c8
fix a typo in building language ID apk ( #795 )
2024-04-19 20:16:48 +08:00
Fangjun Kuang
c1608b3524
Support CED models ( #792 )
2024-04-19 15:20:37 +08:00
Fangjun Kuang
d97a283dbb
Add Android demo for spoken language identification using Whisper multilingual models ( #783 )
2024-04-18 14:33:59 +08:00
Fangjun Kuang
69440e481f
Add WearOS demo for audio tagging ( #777 )
2024-04-17 12:22:17 +08:00
Fangjun Kuang
bcd9e48150
Add Android demo for audio tagging ( #776 )
...
See https://k2-fsa.github.io/sherpa/onnx/audio-tagging/apk.html
2024-04-16 20:47:16 +08:00
Fangjun Kuang
0d90b34e4a
Support Chinese heteronyms on Android for TTS. ( #742 )
2024-04-08 21:36:47 +08:00
Fangjun Kuang
a5f8fbc83f
Support heteronyms in Chinese TTS ( #738 )
2024-04-08 11:01:30 +08:00
Fangjun Kuang
dbff2eaadb
Add C API for streaming HLG decoding ( #734 )
2024-04-05 10:31:20 +08:00
Fangjun Kuang
3acf373b07
add more piper models ( #725 )
2024-04-01 11:39:52 +08:00
Fangjun Kuang
6da4a1c12f
Add Go API for speaker identification ( #718 )
2024-03-29 19:25:55 +08:00
Fangjun Kuang
a042f44076
Add Golang API for spoken language identification. ( #709 )
2024-03-27 19:40:25 +08:00
Fangjun Kuang
12efbf7397
Sign released TTS APKs ( #710 )
2024-03-27 19:34:37 +08:00
Fangjun Kuang
69c7880c4d
Add Golang API for VAD ( #708 )
2024-03-27 12:09:39 +08:00
Fangjun Kuang
bd66f7a7d0
Build Android TTS APKs for coqui-ai/TTS models ( #704 )
2024-03-26 14:05:26 +08:00
Fangjun Kuang
305c373107
Add C# API for spoken language identification ( #697 )
2024-03-25 18:45:09 +08:00
Fangjun Kuang
1952772654
Add timestamps and tokens for .Net's online models. ( #690 )
2024-03-23 18:51:56 +08:00
Fangjun Kuang
24f437a6f1
Refactor github actions tests ( #688 )
2024-03-22 21:22:42 +08:00
Fangjun Kuang
c8770aec20
Add nuget package for Windows x86 ( #683 )
2024-03-21 14:57:01 +08:00
Fangjun Kuang
acf0975153
Support whisper language/task in various language bindings. ( #679 )
2024-03-20 16:43:35 +08:00
Fangjun Kuang
6571fc9552
Add tts play example for .Net. ( #676 )
...
It plays the generated audio via a speaker as it is generating.
2024-03-19 17:33:15 +08:00
foreversimon
ce60100f68
Add HotwordsFile and HotwordsScore fields to OnlineRecognizerConfig in C# API ( #675 )
2024-03-19 15:04:08 +08:00
Fangjun Kuang
a628002d8f
Release v1.9.12 ( #661 )
2024-03-11 18:52:34 +08:00
Fangjun Kuang
ed06ced16f
Add WebAssembly for NodeJS. ( #628 )
2024-03-03 20:00:36 +08:00
Fangjun Kuang
a65643b594
support onnxruntime v1.17.1 ( #624 )
2024-03-02 11:44:59 +08:00
Fangjun Kuang
c093880d7c
Fix building wheels ( #620 )
2024-03-01 15:20:06 +08:00
Fangjun Kuang
d2cc48ded5
Add more Chinese TTS models (Mandarin and Cantonese) ( #589 )
2024-02-20 15:05:35 +08:00
Fangjun Kuang
81da0fb7a6
Update onnxruntime from 1.16.3 to 1.17.0 ( #581 )
2024-02-17 12:43:42 +08:00
Fangjun Kuang
558f5e3263
Use sequential layout for OfflineTtsConfig in C# ( #567 )
2024-02-01 16:06:32 +08:00
Fangjun Kuang
2e8b321210
Add fine-tuned whisper model on aishell ( #565 )
...
See also https://github.com/k2-fsa/icefall/pull/1466
2024-01-31 17:23:42 +08:00
20246688
0aa47e5ccc
Update test.py ( #560 )
2024-01-29 17:30:44 +08:00
Fangjun Kuang
fa2af5dc69
Add TTS demo for C# API ( #557 )
2024-01-28 23:29:39 +08:00
Fangjun Kuang
035a82df33
Add a new Persian tts model ( #555 )
2024-01-27 20:47:54 +08:00
Fangjun Kuang
bbd7c7fc18
Add Android demo for speaker recognition ( #536 )
...
See pre-built Android APKs at
https://k2-fsa.github.io/sherpa/onnx/speaker-identification/apk.html
2024-01-23 16:50:52 +08:00
Wei Kang
b6c020901a
decoder for open vocabulary keyword spotting ( #505 )
...
* various fixes to ContextGraph to support open vocabulary keywords decoder
* Add keyword spotter runtime
* Add binary
* First version works
* Minor fixes
* update text2token
* default values
* Add jni for kws
* add kws android project
* Minor fixes
* Remove unused interface
* Minor fixes
* Add workflow
* handle extra info in texts
* Minor fixes
* Add more comments
* Fix ci
* fix cpp style
* Add input box in android demo so that users can specify their keywords
* Fix cpp style
* Fix comments
* Minor fixes
* Minor fixes
* minor fixes
* Minor fixes
* Minor fixes
* Add CI
* Fix code style
* cpplint
* Fix comments
* Fix error
2024-01-20 22:52:41 +08:00
Fangjun Kuang
7e0ae677c8
Add a Persian and a Slovenian model from Piper for Android TTS. ( #531 )
2024-01-15 15:00:15 +08:00