Commit Graph

475 Commits

Author SHA1 Message Date
Fangjun Kuang
43af1e6951 Release v1.9.15 (#719) 2024-03-29 19:58:04 +08:00
Fangjun Kuang
6da4a1c12f Add Go API for speaker identification (#718) 2024-03-29 19:25:55 +08:00
Fangjun Kuang
2e0bccad36 Add C API for speaker embedding extractor. (#711) 2024-03-28 18:05:40 +08:00
Leo Huang
638f48f47a Added progress for callback of tts generator (#712)
Co-authored-by: leohwang <leohwang@360converter.com>
2024-03-28 17:12:20 +08:00
longshiming
de655e838e delete incorrect logs (#714)
Co-authored-by: longshiming <longshiming@greesoft.com>
2024-03-28 10:49:45 +08:00
Fangjun Kuang
559744ac60 Fix ios-swift to remove invalid references (#713) 2024-03-28 09:39:43 +08:00
Fangjun Kuang
a042f44076 Add Golang API for spoken language identification. (#709) 2024-03-27 19:40:25 +08:00
Fangjun Kuang
12efbf7397 Sign released TTS APKs (#710) 2024-03-27 19:34:37 +08:00
Fangjun Kuang
69c7880c4d Add Golang API for VAD (#708) 2024-03-27 12:09:39 +08:00
hantengc
ccb2d435ec add openfst.cmake file (#707)
1. When compiling locally, openfst is missing.so add this file to the sherpa-onnx/cmake folder
2024-03-27 11:31:26 +08:00
Fangjun Kuang
4e040c596e Support including TTS conditionally. (#699) 2024-03-26 17:21:35 +08:00
Fangjun Kuang
bd66f7a7d0 Build Android TTS APKs for coqui-ai/TTS models (#704) 2024-03-26 14:05:26 +08:00
Fangjun Kuang
d364610605 Use a single thread when loading models (#703) 2024-03-26 13:35:33 +08:00
Fangjun Kuang
305c373107 Add C# API for spoken language identification (#697) 2024-03-25 18:45:09 +08:00
Fangjun Kuang
83a10a55a5 Add Swift API for spoken language identification. (#696) 2024-03-25 16:22:25 +08:00
Fangjun Kuang
ab7cff2513 Add C API for spoken language identification. (#695) 2024-03-25 15:16:47 +08:00
Fangjun Kuang
0d258dd150 Support spoken language identification with whisper (#694) 2024-03-24 22:57:00 +08:00
Fangjun Kuang
3cdad9b5d1 Use manylinux in CI test (#692) 2024-03-24 07:54:32 +08:00
Masoud
e60c897ce7 Update MainActivity.kt (#693)
fix read-only test text box
2024-03-24 07:29:14 +08:00
Fangjun Kuang
1952772654 Add timestamps and tokens for .Net's online models. (#690) 2024-03-23 18:51:56 +08:00
Fangjun Kuang
e6da2c5556 Fix build c api examples with alsa (#691) 2024-03-23 16:16:24 +08:00
Karel Vesely
eaec4c83c2 Configurable low_freq high_freq, dithering (#664) 2024-03-22 21:41:44 +08:00
Fangjun Kuang
2fc1201924 Add hotwords support to .Net (#689) 2024-03-22 21:40:42 +08:00
Fangjun Kuang
24f437a6f1 Refactor github actions tests (#688) 2024-03-22 21:22:42 +08:00
Masoud
1c77457d61 Update MainActivity.kt (#687)
Appending a default text to test field.
To faster check the voices
2024-03-22 19:04:14 +08:00
Fangjun Kuang
c8770aec20 Add nuget package for Windows x86 (#683) 2024-03-21 14:57:01 +08:00
Fangjun Kuang
acf0975153 Support whisper language/task in various language bindings. (#679) 2024-03-20 16:43:35 +08:00
Viggo
842d04d7ae support whisper language (#678) 2024-03-20 10:16:22 +08:00
Fangjun Kuang
6571fc9552 Add tts play example for .Net. (#676)
It plays the generated audio via a speaker as it is generating.
2024-03-19 17:33:15 +08:00
foreversimon
ce60100f68 Add HotwordsFile and HotwordsScore fields to OnlineRecognizerConfig in C# API (#675) 2024-03-19 15:04:08 +08:00
Bhaswati Saha
fda614d0d1 beam search value as parameter in offline_recognizer.py (#673)
Co-authored-by: bhascns <bhaswati@mihup.com>
2024-03-18 18:43:05 +08:00
Fangjun Kuang
9d6eb3e834 small fixes to wasm kws. (#672) 2024-03-18 15:28:10 +08:00
Lovemefan
009ed2cd30 add WebAssembly for Kws (#648) 2024-03-11 21:02:31 +08:00
Fangjun Kuang
a628002d8f Release v1.9.12 (#661) 2024-03-11 18:52:34 +08:00
Fangjun Kuang
44d0ef9ae3 Print the time about the first message in tts. (#655) 2024-03-11 11:05:42 +08:00
xinhecuican
f43139e803 c++ api for keyword spotter (#642) 2024-03-11 10:23:46 +08:00
Fangjun Kuang
1777a5dd88 Use onnxruntime 1.17.1 for iOS. (#654) 2024-03-10 14:26:36 +08:00
Fangjun Kuang
3232dff2cf Support user provided data in tts callback. (#653) 2024-03-09 18:15:03 +08:00
GaryLaurenceauAva
ac43c2d7b6 Expose 'language' 'task' 'tailPaddings' in OfflineWhisperModelConfig (#643)
Co-authored-by: Gary <gary.laurenceau@gmail.com>
2024-03-08 19:52:30 +08:00
Fangjun Kuang
4b708e055c Add microphone streaming ASR example for C API (#650) 2024-03-08 19:31:46 +08:00
Fangjun Kuang
d3287f9494 Add Python ASR examples with alsa (#646) 2024-03-08 11:34:48 +08:00
Wei Kang
e9e8d755d9 Fix detetion at the tail when using hotwords in streaming model (#638) 2024-03-08 10:04:33 +08:00
Fangjun Kuang
f70fdd156c Support using T-head-Semi/csi-nn2 for RISC-V (#637) 2024-03-06 18:21:50 +08:00
Fangjun Kuang
bdf9243940 Allow to not use pre-installed onnxruntime libs. (#636) 2024-03-06 14:40:23 +08:00
Fangjun Kuang
13260cdf49 Use self-compiled onnxruntime shared lib. (#635) 2024-03-06 11:03:24 +08:00
Fangjun Kuang
5dc2eaf2b4 Fix building wheels from source. (#632) 2024-03-04 16:39:51 +08:00
Fangjun Kuang
ed06ced16f Add WebAssembly for NodeJS. (#628) 2024-03-03 20:00:36 +08:00
Fangjun Kuang
ac6825ff11 Refactor WebAssembly for nodejs (#626) 2024-03-02 12:31:36 +08:00
Fangjun Kuang
a65643b594 support onnxruntime v1.17.1 (#624) 2024-03-02 11:44:59 +08:00
Fangjun Kuang
d56964371c Support VITS models from icefall. (#625) 2024-03-01 19:48:38 +08:00