Fangjun Kuang
|
43af1e6951
|
Release v1.9.15 (#719)
|
2024-03-29 19:58:04 +08:00 |
|
Fangjun Kuang
|
6da4a1c12f
|
Add Go API for speaker identification (#718)
|
2024-03-29 19:25:55 +08:00 |
|
Fangjun Kuang
|
2e0bccad36
|
Add C API for speaker embedding extractor. (#711)
|
2024-03-28 18:05:40 +08:00 |
|
Leo Huang
|
638f48f47a
|
Added progress for callback of tts generator (#712)
Co-authored-by: leohwang <leohwang@360converter.com>
|
2024-03-28 17:12:20 +08:00 |
|
longshiming
|
de655e838e
|
delete incorrect logs (#714)
Co-authored-by: longshiming <longshiming@greesoft.com>
|
2024-03-28 10:49:45 +08:00 |
|
Fangjun Kuang
|
559744ac60
|
Fix ios-swift to remove invalid references (#713)
|
2024-03-28 09:39:43 +08:00 |
|
Fangjun Kuang
|
a042f44076
|
Add Golang API for spoken language identification. (#709)
|
2024-03-27 19:40:25 +08:00 |
|
Fangjun Kuang
|
12efbf7397
|
Sign released TTS APKs (#710)
|
2024-03-27 19:34:37 +08:00 |
|
Fangjun Kuang
|
69c7880c4d
|
Add Golang API for VAD (#708)
|
2024-03-27 12:09:39 +08:00 |
|
hantengc
|
ccb2d435ec
|
add openfst.cmake file (#707)
1. When compiling locally, openfst is missing.so add this file to the sherpa-onnx/cmake folder
|
2024-03-27 11:31:26 +08:00 |
|
Fangjun Kuang
|
4e040c596e
|
Support including TTS conditionally. (#699)
|
2024-03-26 17:21:35 +08:00 |
|
Fangjun Kuang
|
bd66f7a7d0
|
Build Android TTS APKs for coqui-ai/TTS models (#704)
|
2024-03-26 14:05:26 +08:00 |
|
Fangjun Kuang
|
d364610605
|
Use a single thread when loading models (#703)
|
2024-03-26 13:35:33 +08:00 |
|
Fangjun Kuang
|
305c373107
|
Add C# API for spoken language identification (#697)
|
2024-03-25 18:45:09 +08:00 |
|
Fangjun Kuang
|
83a10a55a5
|
Add Swift API for spoken language identification. (#696)
|
2024-03-25 16:22:25 +08:00 |
|
Fangjun Kuang
|
ab7cff2513
|
Add C API for spoken language identification. (#695)
|
2024-03-25 15:16:47 +08:00 |
|
Fangjun Kuang
|
0d258dd150
|
Support spoken language identification with whisper (#694)
|
2024-03-24 22:57:00 +08:00 |
|
Fangjun Kuang
|
3cdad9b5d1
|
Use manylinux in CI test (#692)
|
2024-03-24 07:54:32 +08:00 |
|
Masoud
|
e60c897ce7
|
Update MainActivity.kt (#693)
fix read-only test text box
|
2024-03-24 07:29:14 +08:00 |
|
Fangjun Kuang
|
1952772654
|
Add timestamps and tokens for .Net's online models. (#690)
|
2024-03-23 18:51:56 +08:00 |
|
Fangjun Kuang
|
e6da2c5556
|
Fix build c api examples with alsa (#691)
|
2024-03-23 16:16:24 +08:00 |
|
Karel Vesely
|
eaec4c83c2
|
Configurable low_freq high_freq, dithering (#664)
|
2024-03-22 21:41:44 +08:00 |
|
Fangjun Kuang
|
2fc1201924
|
Add hotwords support to .Net (#689)
|
2024-03-22 21:40:42 +08:00 |
|
Fangjun Kuang
|
24f437a6f1
|
Refactor github actions tests (#688)
|
2024-03-22 21:22:42 +08:00 |
|
Masoud
|
1c77457d61
|
Update MainActivity.kt (#687)
Appending a default text to test field.
To faster check the voices
|
2024-03-22 19:04:14 +08:00 |
|
Fangjun Kuang
|
c8770aec20
|
Add nuget package for Windows x86 (#683)
|
2024-03-21 14:57:01 +08:00 |
|
Fangjun Kuang
|
acf0975153
|
Support whisper language/task in various language bindings. (#679)
|
2024-03-20 16:43:35 +08:00 |
|
Viggo
|
842d04d7ae
|
support whisper language (#678)
|
2024-03-20 10:16:22 +08:00 |
|
Fangjun Kuang
|
6571fc9552
|
Add tts play example for .Net. (#676)
It plays the generated audio via a speaker as it is generating.
|
2024-03-19 17:33:15 +08:00 |
|
foreversimon
|
ce60100f68
|
Add HotwordsFile and HotwordsScore fields to OnlineRecognizerConfig in C# API (#675)
|
2024-03-19 15:04:08 +08:00 |
|
Bhaswati Saha
|
fda614d0d1
|
beam search value as parameter in offline_recognizer.py (#673)
Co-authored-by: bhascns <bhaswati@mihup.com>
|
2024-03-18 18:43:05 +08:00 |
|
Fangjun Kuang
|
9d6eb3e834
|
small fixes to wasm kws. (#672)
|
2024-03-18 15:28:10 +08:00 |
|
Lovemefan
|
009ed2cd30
|
add WebAssembly for Kws (#648)
|
2024-03-11 21:02:31 +08:00 |
|
Fangjun Kuang
|
a628002d8f
|
Release v1.9.12 (#661)
|
2024-03-11 18:52:34 +08:00 |
|
Fangjun Kuang
|
44d0ef9ae3
|
Print the time about the first message in tts. (#655)
|
2024-03-11 11:05:42 +08:00 |
|
xinhecuican
|
f43139e803
|
c++ api for keyword spotter (#642)
|
2024-03-11 10:23:46 +08:00 |
|
Fangjun Kuang
|
1777a5dd88
|
Use onnxruntime 1.17.1 for iOS. (#654)
|
2024-03-10 14:26:36 +08:00 |
|
Fangjun Kuang
|
3232dff2cf
|
Support user provided data in tts callback. (#653)
|
2024-03-09 18:15:03 +08:00 |
|
GaryLaurenceauAva
|
ac43c2d7b6
|
Expose 'language' 'task' 'tailPaddings' in OfflineWhisperModelConfig (#643)
Co-authored-by: Gary <gary.laurenceau@gmail.com>
|
2024-03-08 19:52:30 +08:00 |
|
Fangjun Kuang
|
4b708e055c
|
Add microphone streaming ASR example for C API (#650)
|
2024-03-08 19:31:46 +08:00 |
|
Fangjun Kuang
|
d3287f9494
|
Add Python ASR examples with alsa (#646)
|
2024-03-08 11:34:48 +08:00 |
|
Wei Kang
|
e9e8d755d9
|
Fix detetion at the tail when using hotwords in streaming model (#638)
|
2024-03-08 10:04:33 +08:00 |
|
Fangjun Kuang
|
f70fdd156c
|
Support using T-head-Semi/csi-nn2 for RISC-V (#637)
|
2024-03-06 18:21:50 +08:00 |
|
Fangjun Kuang
|
bdf9243940
|
Allow to not use pre-installed onnxruntime libs. (#636)
|
2024-03-06 14:40:23 +08:00 |
|
Fangjun Kuang
|
13260cdf49
|
Use self-compiled onnxruntime shared lib. (#635)
|
2024-03-06 11:03:24 +08:00 |
|
Fangjun Kuang
|
5dc2eaf2b4
|
Fix building wheels from source. (#632)
|
2024-03-04 16:39:51 +08:00 |
|
Fangjun Kuang
|
ed06ced16f
|
Add WebAssembly for NodeJS. (#628)
|
2024-03-03 20:00:36 +08:00 |
|
Fangjun Kuang
|
ac6825ff11
|
Refactor WebAssembly for nodejs (#626)
|
2024-03-02 12:31:36 +08:00 |
|
Fangjun Kuang
|
a65643b594
|
support onnxruntime v1.17.1 (#624)
|
2024-03-02 11:44:59 +08:00 |
|
Fangjun Kuang
|
d56964371c
|
Support VITS models from icefall. (#625)
|
2024-03-01 19:48:38 +08:00 |
|