Fangjun Kuang
13730ecbd8
Add C API for punctuation ( #768 )
2024-04-14 19:02:34 +08:00
Fangjun Kuang
68b8b88b5a
Add Python API for punctuation models. ( #762 )
2024-04-13 13:28:17 +08:00
Fangjun Kuang
329fe1aa8b
Support adding punctuations to the speech recogntion result ( #761 )
2024-04-13 12:15:57 +08:00
Fangjun Kuang
f204e62b44
Add C API for audio tagging ( #754 )
2024-04-11 14:18:43 +08:00
Fangjun Kuang
34d70a259f
Add Python API and Python examples for audio tagging ( #753 )
2024-04-11 11:12:48 +08:00
Fangjun Kuang
f20291cadc
Support audio tagging using zipformer ( #747 )
2024-04-10 14:47:06 +08:00
Fangjun Kuang
6fb8ceda57
Add VAD examples using ALSA for recording ( #739 )
2024-04-08 16:41:01 +08:00
Fangjun Kuang
a5f8fbc83f
Support heteronyms in Chinese TTS ( #738 )
2024-04-08 11:01:30 +08:00
Fangjun Kuang
dbff2eaadb
Add C API for streaming HLG decoding ( #734 )
2024-04-05 10:31:20 +08:00
Fangjun Kuang
db67e00c77
Add HLG decoding for streaming CTC models ( #731 )
2024-04-03 21:31:42 +08:00
Fangjun Kuang
2e0bccad36
Add C API for speaker embedding extractor. ( #711 )
2024-03-28 18:05:40 +08:00
Fangjun Kuang
305c373107
Add C# API for spoken language identification ( #697 )
2024-03-25 18:45:09 +08:00
Fangjun Kuang
83a10a55a5
Add Swift API for spoken language identification. ( #696 )
2024-03-25 16:22:25 +08:00
Fangjun Kuang
ab7cff2513
Add C API for spoken language identification. ( #695 )
2024-03-25 15:16:47 +08:00
Fangjun Kuang
0d258dd150
Support spoken language identification with whisper ( #694 )
2024-03-24 22:57:00 +08:00
Fangjun Kuang
24f437a6f1
Refactor github actions tests ( #688 )
2024-03-22 21:22:42 +08:00
Wei Kang
734bbd91dc
Add Python API for keyword spotting ( #576 )
...
* Add alsa & microphone support for keyword spotting
* Add python wrapper
2024-03-01 09:31:11 +08:00
Wei Kang
2ff1049079
change modelscope link to github for build-kws-apki ( #540 )
2024-01-24 16:40:14 +08:00
Wei Kang
626775e5e2
Change model url from modelscope to github ( #538 )
2024-01-23 10:15:58 +08:00
Wei Kang
b6c020901a
decoder for open vocabulary keyword spotting ( #505 )
...
* various fixes to ContextGraph to support open vocabulary keywords decoder
* Add keyword spotter runtime
* Add binary
* First version works
* Minor fixes
* update text2token
* default values
* Add jni for kws
* add kws android project
* Minor fixes
* Remove unused interface
* Minor fixes
* Add workflow
* handle extra info in texts
* Minor fixes
* Add more comments
* Fix ci
* fix cpp style
* Add input box in android demo so that users can specify their keywords
* Fix cpp style
* Fix comments
* Minor fixes
* Minor fixes
* minor fixes
* Minor fixes
* Minor fixes
* Add CI
* Fix code style
* cpplint
* Fix comments
* Fix error
2024-01-20 22:52:41 +08:00
Fangjun Kuang
2024e96639
Add C++ runtime for speaker verification models from NeMo ( #527 )
2024-01-13 21:42:09 +08:00
Fangjun Kuang
68a525a024
Export speaker verification models from NeMo to ONNX ( #526 )
2024-01-13 19:49:45 +08:00
Fangjun Kuang
afc81ec122
Add C++ runtime for models from 3d-speaker ( #523 )
2024-01-11 19:10:30 +08:00
Fangjun Kuang
e475e750ac
Support streaming zipformer CTC ( #496 )
...
* Support streaming zipformer CTC
* test online zipformer2 CTC
* Update doc of sherpa-onnx.cc
* Add Python APIs for streaming zipformer2 ctc
* Add Python API examples for streaming zipformer2 ctc
* Swift API for streaming zipformer2 CTC
* NodeJS API for streaming zipformer2 CTC
* Kotlin API for streaming zipformer2 CTC
* Golang API for streaming zipformer2 CTC
* C# API for streaming zipformer2 CTC
* Release v1.9.6
2023-12-22 13:46:33 +08:00
Fangjun Kuang
868c339e5e
Support distil-small.en whisper ( #472 )
2023-12-08 11:59:20 +08:00
Fangjun Kuang
3ae984f148
Remove the 30-second constraint from whisper. ( #471 )
2023-12-07 17:47:08 +08:00
Fangjun Kuang
62dc3c3e46
Use piper-phonemize to convert text to token IDs ( #453 )
2023-11-30 23:57:43 +08:00
Fangjun Kuang
db41778e99
Support piper-phonemize ( #452 )
2023-11-28 19:12:58 +08:00
Fangjun Kuang
8dc08a9b97
Fix nodejs on Windows ( #450 )
2023-11-25 21:23:15 +08:00
Fangjun Kuang
2f22e6ed63
Add Swift API for TTS ( #439 )
2023-11-22 16:04:26 +08:00
Fangjun Kuang
fe977b8e8e
support nodejs ( #438 )
2023-11-21 23:20:08 +08:00
Fangjun Kuang
049fb9f451
Add Python APIs for WeNet CTC models ( #428 )
2023-11-16 14:20:41 +08:00
Fangjun Kuang
fac4f6bc7c
Support streaming conformer CTC models from wenet ( #427 )
2023-11-16 10:35:23 +08:00
Fangjun Kuang
b83b3e3cd1
Support non-streaming WeNet CTC models. ( #426 )
2023-11-15 14:23:20 +08:00
Fangjun Kuang
ea7c45b60c
Add C API for offline TTS. ( #373 )
2023-10-19 17:38:23 +08:00
Fangjun Kuang
1ee79e3ff5
Support Chinese vits models ( #368 )
2023-10-18 10:19:10 +08:00
Fangjun Kuang
655e0fa836
add python API and examples for TTS ( #364 )
2023-10-14 14:21:53 +08:00
Fangjun Kuang
407602445d
Add CTC HLG decoding using OpenFst ( #349 )
2023-10-08 11:32:39 +08:00
Fangjun Kuang
e2be532b32
Add timestamps for offline paraformer ( #310 )
2023-09-14 19:33:41 +08:00
Wei Kang
47184f9db7
Refactor hotwords,support loading hotwords from file ( #296 )
2023-09-14 19:33:17 +08:00
Fangjun Kuang
6038e2aa62
Support streaming paraformer ( #263 )
2023-08-14 10:32:14 +08:00
Fangjun Kuang
a4bff28e21
Support TDNN models from the yesno recipe from icefall ( #262 )
2023-08-12 19:50:22 +08:00
Fangjun Kuang
f7c05b1570
Add CI test for Whisper models ( #239 )
2023-08-07 19:24:52 +08:00
Jingzhao Ou
0ed501b8f1
Added provider option to sherpa-onnx and decode-file-c-api ( #162 )
2023-06-03 04:57:48 +08:00
Jingzhao Ou
de1880948b
Added streaming conformer test ( #150 )
2023-05-12 08:37:24 +08:00
Fangjun Kuang
4024bfab32
Add examples for Kotlin API ( #124 )
2023-04-19 17:29:35 +08:00
Fangjun Kuang
80060c276d
Begin to support CTC models ( #119 )
...
Please see https://k2-fsa.github.io/sherpa/onnx/pretrained_models/offline-ctc/nemo/index.html for a list of pre-trained CTC models from NeMo.
2023-04-07 23:11:34 +08:00
Fangjun Kuang
5d3c8edbc9
add python tests ( #111 )
2023-04-02 23:05:30 +08:00
Fangjun Kuang
f9f9f20d37
Test int8 models ( #107 )
...
* Test int8 models
* Fix displaying help messages
* small fixes
* Fix jni test
2023-03-31 22:34:10 +08:00
Fangjun Kuang
f69b4e9396
Add tests for zipformer ( #106 )
2023-03-31 04:47:20 +08:00