Fangjun Kuang
3bae5c3fe5
test exported sense voice models ( #1147 )
2024-07-18 12:12:44 +08:00
Fangjun Kuang
346f419f39
export sense-voice to onnx ( #1144 )
2024-07-18 00:18:38 +08:00
Fangjun Kuang
9e448d03bc
Provide npm package for 32-bit Windows x86 ( #1141 )
2024-07-17 12:33:15 +08:00
Fangjun Kuang
960eb7529e
Add C++ runtime for MeloTTS ( #1138 )
2024-07-16 15:55:02 +08:00
Fangjun Kuang
95485411fa
Support English for MeloTTS models. ( #1134 )
2024-07-15 19:49:22 +08:00
Fangjun Kuang
fa07bbc176
Add APK for small paraformer ( #1133 )
2024-07-15 19:44:36 +08:00
Fangjun Kuang
c35200dccf
Revert to onnxruntime 1.17.1 ( #1131 )
2024-07-15 14:24:08 +08:00
Fangjun Kuang
04c2319c2c
Export MeloTTS to ONNX ( #1129 )
2024-07-15 10:47:19 +08:00
Fangjun Kuang
ab71c3976d
Add int8 quantized whisper large models ( #1126 )
2024-07-13 22:30:06 +08:00
Fangjun Kuang
3951a12f8d
Add pre-trained models for the Libriheavy dataset ( #1122 )
2024-07-13 19:21:13 +08:00
Fangjun Kuang
b5093e27f9
Fix publishing apks to huggingface ( #1121 )
...
Save APKs for each release in a separate directory.
Huggingface requires that each directory cannot contain more than 1000 files.
Since we have so many tts models and for each model we need to build APKs of 4 different ABIs,
it is a workaround for the huggingface's constraint by placing them into separate directories for different releases.
2024-07-13 16:14:00 +08:00
Fangjun Kuang
117cd7bb8c
Support whisper large/large-v1/large-v2/large-v3 and distil-large-v2 ( #1114 )
2024-07-12 23:47:39 +08:00
Fangjun Kuang
1c104ea847
Update onnxruntime from v1.18.0 to v1.18.1 ( #1107 )
2024-07-11 09:35:28 +08:00
Fangjun Kuang
08c758520f
Add keyword spotting for C# ( #1105 )
2024-07-10 21:18:46 +08:00
Fangjun Kuang
dd0ff2ca06
Support onnxruntime 1.18.0 ( #906 )
2024-07-10 17:05:26 +08:00
Fangjun Kuang
9e446b8501
Fix typos ( #1101 )
2024-07-09 20:08:47 +08:00
Fangjun Kuang
c2cc9dec58
Add Flush to VAD so that the last segment can be detected. ( #1099 )
2024-07-09 16:15:56 +08:00
Fangjun Kuang
5d2ceb3513
Support linux-arm64 for .Net ( #1092 )
2024-07-08 16:13:51 +08:00
Fangjun Kuang
e832d356c7
Add Flutter text to speech demo ( #1087 )
2024-07-08 11:23:11 +08:00
Fangjun Kuang
a25075101c
Build sherpa-onnx as a single shared library ( #1078 )
...
When `-D BUILD_SHARED_LIBS=ON` is passed to `cmake`, it builds a single shared library.
Specifically,
- For C APIs, it builds `libsherpa-onnx-c-api.so`
- For Python APIs, it builds `_sherpa_onnx.cpython-xx-xx.so`
- For Kotlin and Java APIs, it builds `libsherpa-onnx-jni.so`
There is no `libsherpa-onnx-core.so` any longer.
Note it affects only shared libraries.
2024-07-06 16:41:54 +08:00
Fangjun Kuang
b502116068
Refactor flutter to support Android ( #1072 )
2024-07-04 10:49:09 +08:00
Fangjun Kuang
8c4f576f1b
Support .Net framework 2.0 ( #1062 )
2024-06-28 11:27:19 +08:00
Fangjun Kuang
598c12c4e5
Fix CI tests ( #1061 )
2024-06-27 18:05:18 +08:00
Fangjun Kuang
5cce159cf3
Fix passing C# string to C++ ( #1055 )
2024-06-25 10:52:59 +08:00
Fangjun Kuang
a3bac19c54
fix a bug for wenet streaming model. ( #1054 )
...
* fix a bug for wenet streaming model.
The chunk shift was wrong.
See
https://github.com/wenet-e2e/wenet/blob/main/runtime/core/decoder/asr_model.cc#L15
and
https://github.com/wenet-e2e/wenet/blob/main/runtime/core/decoder/asr_model.cc#L28
2024-06-24 21:52:54 +08:00
Fangjun Kuang
1f95bff719
Add non-streaming zipformer Android APK ( #1052 )
2024-06-24 16:22:19 +08:00
Fangjun Kuang
e7a45108ac
Remove unused files from .Net examples ( #1051 )
2024-06-24 10:25:14 +08:00
东风破
00de2bd00b
Refactor .Net example project ( #1049 )
...
Co-authored-by: 东风破 <birdfishs@163.com >
2024-06-24 10:10:13 +08:00
Fangjun Kuang
169c9bf627
Flutter demo for real-time speech recognition ( #1042 )
2024-06-23 13:29:13 +08:00
Fangjun Kuang
9dd0e03568
Enable to stop TTS generation ( #1041 )
2024-06-22 18:18:36 +08:00
Fangjun Kuang
36336b31f4
Build Android APK for Thai ( #1036 )
2024-06-20 18:05:57 +08:00
Fangjun Kuang
6789c909d2
Inverse text normalization API of streaming ASR for various programming languages ( #1022 )
2024-06-18 13:42:17 +08:00
Fangjun Kuang
6e09933d99
Inverse text normalization API for other programming languages ( #1019 )
2024-06-17 17:02:39 +08:00
Fangjun Kuang
dd69a1b56b
Release v1.9.30 ( #1016 )
2024-06-17 11:57:38 +08:00
Fangjun Kuang
e1201225f2
Add Android APK for Korean ( #1015 )
2024-06-16 19:17:15 +08:00
Fangjun Kuang
e52d32b95b
Add TTS API and examples for Dart ( #1010 )
2024-06-15 14:30:36 +08:00
Fangjun Kuang
e3077670c6
Add streaming ASR examples for Dart API ( #1009 )
2024-06-15 11:48:54 +08:00
Fangjun Kuang
d94506698d
Add non-streaming ASR examples for Dart API ( #1007 )
2024-06-14 18:40:16 +08:00
Lovemefan
b1f08c0a20
scale value fix ( #1006 )
2024-06-14 16:51:53 +08:00
Fangjun Kuang
dcd6ec3123
Publish osx-arm64 nuget package for .Net ( #1003 )
2024-06-14 12:13:49 +08:00
Fangjun Kuang
20a21133af
Use CI to publish dart packages ( #1001 )
2024-06-14 11:50:55 +08:00
Fangjun Kuang
d08cc04567
Add VAD example for Dart API ( #996 )
2024-06-14 10:37:16 +08:00
Fangjun Kuang
09efe54808
add more text-to-speech models from piper ( #988 )
2024-06-11 15:22:48 +08:00
Fangjun Kuang
fd5a0d1e00
Add C++ runtime for Tele-AI/TeleSpeech-ASR ( #970 )
2024-06-05 00:26:40 +08:00
Fangjun Kuang
f8dbc10146
Fix CI ( #964 )
2024-06-04 17:05:49 +08:00
Fangjun Kuang
c6f22f25fa
export telespeech ctc models to sherpa-onnx ( #968 )
2024-06-04 12:10:39 +08:00
Fangjun Kuang
0196f1d546
Wrap VAD APIs to C# ( #946 )
2024-05-30 18:29:38 +08:00
Fangjun Kuang
a99c7cb35c
Wrap punctuation APIs to C#. ( #945 )
2024-05-30 16:19:56 +08:00
东风破
909148fe42
Split online.cs and offline.csFile ( #941 )
...
Co-authored-by: 东风破 <birdfishs@163.com >
2024-05-30 11:00:24 +08:00
Fangjun Kuang
b4f1006864
fix node-addaon-api for vad ( #932 )
2024-05-28 18:46:43 +08:00