Fangjun Kuang
fbe35ba736
Add Lazarus example for generating subtitles using Silero VAD with non-streaming ASR ( #1251 )
2024-08-15 22:19:45 +08:00
Fangjun Kuang
94e256244d
Add blank penalty for various language bindings. ( #1234 )
2024-08-08 10:43:31 +08:00
Parth Khiera
ba4cb6169f
feat: addition of blank_penalty config in online_recognizer ( #1232 )
2024-08-08 09:10:17 +08:00
Fangjun Kuang
561d04dd92
describe how to add new words for MeloTTS models ( #1209 )
2024-08-03 11:19:02 +08:00
Fangjun Kuang
35c1b4a7a9
Add ReazonSpeech Japanese pre-trained model ( #1203 )
2024-08-02 10:21:24 +08:00
Fangjun Kuang
ec98110e11
Add speaker identification and verification exmaple for Dart API ( #1194 )
2024-07-31 13:53:52 +08:00
Fangjun Kuang
646f99c870
Dart API for adding punctuations to text ( #1182 )
2024-07-29 12:41:52 +08:00
Fangjun Kuang
cd1fedaa49
Add Dart API for audio tagging ( #1181 )
2024-07-29 11:15:14 +08:00
Fangjun Kuang
69b6b47d91
Add vad with non-streaming ASR examples for Dart API ( #1180 )
2024-07-28 23:01:03 +08:00
Fangjun Kuang
4e6aeff07e
Refactor C API to prefix each API with SherpaOnnx. ( #1171 )
2024-07-26 18:47:02 +08:00
Fangjun Kuang
994c3e7c96
Add VAD + Non-streaming ASR example for JavaScript API. ( #1170 )
2024-07-26 12:42:08 +08:00
Fangjun Kuang
dd300b1de5
Add Java and Kotlin API for sense voice ( #1164 )
2024-07-22 14:08:40 +08:00
Fangjun Kuang
ac8223bd8a
Add Dart API for keyword spotter ( #1162 )
2024-07-22 10:53:34 +08:00
Fangjun Kuang
ffdb23a8ec
Add dart API for SenseVoice ( #1159 )
2024-07-21 21:48:12 +08:00
Fangjun Kuang
c3260ef842
Add JavaScript API for SenseVoice ( #1157 )
2024-07-21 10:14:14 +08:00
Fangjun Kuang
8f4d332aab
Add Go API for SenseVoice ( #1154 )
2024-07-20 23:41:53 +08:00
Fangjun Kuang
e472180f2c
Add C# API for SenseVoice models ( #1151 )
2024-07-20 17:09:23 +08:00
Fangjun Kuang
25f0a10468
Add C++ runtime for SenseVoice models ( #1148 )
2024-07-18 22:54:18 +08:00
Fangjun Kuang
3bae5c3fe5
test exported sense voice models ( #1147 )
2024-07-18 12:12:44 +08:00
Fangjun Kuang
346f419f39
export sense-voice to onnx ( #1144 )
2024-07-18 00:18:38 +08:00
Fangjun Kuang
9e448d03bc
Provide npm package for 32-bit Windows x86 ( #1141 )
2024-07-17 12:33:15 +08:00
Fangjun Kuang
960eb7529e
Add C++ runtime for MeloTTS ( #1138 )
2024-07-16 15:55:02 +08:00
Fangjun Kuang
95485411fa
Support English for MeloTTS models. ( #1134 )
2024-07-15 19:49:22 +08:00
Fangjun Kuang
fa07bbc176
Add APK for small paraformer ( #1133 )
2024-07-15 19:44:36 +08:00
Fangjun Kuang
c35200dccf
Revert to onnxruntime 1.17.1 ( #1131 )
2024-07-15 14:24:08 +08:00
Fangjun Kuang
04c2319c2c
Export MeloTTS to ONNX ( #1129 )
2024-07-15 10:47:19 +08:00
Fangjun Kuang
ab71c3976d
Add int8 quantized whisper large models ( #1126 )
2024-07-13 22:30:06 +08:00
Fangjun Kuang
3951a12f8d
Add pre-trained models for the Libriheavy dataset ( #1122 )
2024-07-13 19:21:13 +08:00
Fangjun Kuang
b5093e27f9
Fix publishing apks to huggingface ( #1121 )
...
Save APKs for each release in a separate directory.
Huggingface requires that each directory cannot contain more than 1000 files.
Since we have so many tts models and for each model we need to build APKs of 4 different ABIs,
it is a workaround for the huggingface's constraint by placing them into separate directories for different releases.
2024-07-13 16:14:00 +08:00
Fangjun Kuang
117cd7bb8c
Support whisper large/large-v1/large-v2/large-v3 and distil-large-v2 ( #1114 )
2024-07-12 23:47:39 +08:00
Fangjun Kuang
1c104ea847
Update onnxruntime from v1.18.0 to v1.18.1 ( #1107 )
2024-07-11 09:35:28 +08:00
Fangjun Kuang
08c758520f
Add keyword spotting for C# ( #1105 )
2024-07-10 21:18:46 +08:00
Fangjun Kuang
dd0ff2ca06
Support onnxruntime 1.18.0 ( #906 )
2024-07-10 17:05:26 +08:00
Fangjun Kuang
9e446b8501
Fix typos ( #1101 )
2024-07-09 20:08:47 +08:00
Fangjun Kuang
c2cc9dec58
Add Flush to VAD so that the last segment can be detected. ( #1099 )
2024-07-09 16:15:56 +08:00
Fangjun Kuang
5d2ceb3513
Support linux-arm64 for .Net ( #1092 )
2024-07-08 16:13:51 +08:00
Fangjun Kuang
e832d356c7
Add Flutter text to speech demo ( #1087 )
2024-07-08 11:23:11 +08:00
Fangjun Kuang
a25075101c
Build sherpa-onnx as a single shared library ( #1078 )
...
When `-D BUILD_SHARED_LIBS=ON` is passed to `cmake`, it builds a single shared library.
Specifically,
- For C APIs, it builds `libsherpa-onnx-c-api.so`
- For Python APIs, it builds `_sherpa_onnx.cpython-xx-xx.so`
- For Kotlin and Java APIs, it builds `libsherpa-onnx-jni.so`
There is no `libsherpa-onnx-core.so` any longer.
Note it affects only shared libraries.
2024-07-06 16:41:54 +08:00
Fangjun Kuang
b502116068
Refactor flutter to support Android ( #1072 )
2024-07-04 10:49:09 +08:00
Fangjun Kuang
8c4f576f1b
Support .Net framework 2.0 ( #1062 )
2024-06-28 11:27:19 +08:00
Fangjun Kuang
598c12c4e5
Fix CI tests ( #1061 )
2024-06-27 18:05:18 +08:00
Fangjun Kuang
5cce159cf3
Fix passing C# string to C++ ( #1055 )
2024-06-25 10:52:59 +08:00
Fangjun Kuang
a3bac19c54
fix a bug for wenet streaming model. ( #1054 )
...
* fix a bug for wenet streaming model.
The chunk shift was wrong.
See
https://github.com/wenet-e2e/wenet/blob/main/runtime/core/decoder/asr_model.cc#L15
and
https://github.com/wenet-e2e/wenet/blob/main/runtime/core/decoder/asr_model.cc#L28
2024-06-24 21:52:54 +08:00
Fangjun Kuang
1f95bff719
Add non-streaming zipformer Android APK ( #1052 )
2024-06-24 16:22:19 +08:00
Fangjun Kuang
e7a45108ac
Remove unused files from .Net examples ( #1051 )
2024-06-24 10:25:14 +08:00
东风破
00de2bd00b
Refactor .Net example project ( #1049 )
...
Co-authored-by: 东风破 <birdfishs@163.com >
2024-06-24 10:10:13 +08:00
Fangjun Kuang
169c9bf627
Flutter demo for real-time speech recognition ( #1042 )
2024-06-23 13:29:13 +08:00
Fangjun Kuang
9dd0e03568
Enable to stop TTS generation ( #1041 )
2024-06-22 18:18:36 +08:00
Fangjun Kuang
36336b31f4
Build Android APK for Thai ( #1036 )
2024-06-20 18:05:57 +08:00
Fangjun Kuang
6789c909d2
Inverse text normalization API of streaming ASR for various programming languages ( #1022 )
2024-06-18 13:42:17 +08:00