Commit Graph

331 Commits

Author SHA1 Message Date
Fangjun Kuang
3bae5c3fe5 test exported sense voice models (#1147) 2024-07-18 12:12:44 +08:00
Fangjun Kuang
346f419f39 export sense-voice to onnx (#1144) 2024-07-18 00:18:38 +08:00
Fangjun Kuang
9e448d03bc Provide npm package for 32-bit Windows x86 (#1141) 2024-07-17 12:33:15 +08:00
Fangjun Kuang
960eb7529e Add C++ runtime for MeloTTS (#1138) 2024-07-16 15:55:02 +08:00
Fangjun Kuang
95485411fa Support English for MeloTTS models. (#1134) 2024-07-15 19:49:22 +08:00
Fangjun Kuang
fa07bbc176 Add APK for small paraformer (#1133) 2024-07-15 19:44:36 +08:00
Fangjun Kuang
c35200dccf Revert to onnxruntime 1.17.1 (#1131) 2024-07-15 14:24:08 +08:00
Fangjun Kuang
04c2319c2c Export MeloTTS to ONNX (#1129) 2024-07-15 10:47:19 +08:00
Fangjun Kuang
ab71c3976d Add int8 quantized whisper large models (#1126) 2024-07-13 22:30:06 +08:00
Fangjun Kuang
3951a12f8d Add pre-trained models for the Libriheavy dataset (#1122) 2024-07-13 19:21:13 +08:00
Fangjun Kuang
b5093e27f9 Fix publishing apks to huggingface (#1121)
Save APKs for each release in a separate directory.

Huggingface requires that each directory cannot contain more than 1000 files.

Since we have so many tts models and for each model we need to build APKs of 4 different ABIs,
it is a workaround for the huggingface's constraint by placing them into separate directories for different releases.
2024-07-13 16:14:00 +08:00
Fangjun Kuang
117cd7bb8c Support whisper large/large-v1/large-v2/large-v3 and distil-large-v2 (#1114) 2024-07-12 23:47:39 +08:00
Fangjun Kuang
1c104ea847 Update onnxruntime from v1.18.0 to v1.18.1 (#1107) 2024-07-11 09:35:28 +08:00
Fangjun Kuang
08c758520f Add keyword spotting for C# (#1105) 2024-07-10 21:18:46 +08:00
Fangjun Kuang
dd0ff2ca06 Support onnxruntime 1.18.0 (#906) 2024-07-10 17:05:26 +08:00
Fangjun Kuang
9e446b8501 Fix typos (#1101) 2024-07-09 20:08:47 +08:00
Fangjun Kuang
c2cc9dec58 Add Flush to VAD so that the last segment can be detected. (#1099) 2024-07-09 16:15:56 +08:00
Fangjun Kuang
5d2ceb3513 Support linux-arm64 for .Net (#1092) 2024-07-08 16:13:51 +08:00
Fangjun Kuang
e832d356c7 Add Flutter text to speech demo (#1087) 2024-07-08 11:23:11 +08:00
Fangjun Kuang
a25075101c Build sherpa-onnx as a single shared library (#1078)
When `-D BUILD_SHARED_LIBS=ON` is passed to `cmake`, it builds a single shared library.

Specifically, 

- For C APIs, it builds `libsherpa-onnx-c-api.so`
- For Python APIs, it builds `_sherpa_onnx.cpython-xx-xx.so`
- For Kotlin and Java APIs, it builds `libsherpa-onnx-jni.so`

There is no `libsherpa-onnx-core.so` any longer.

Note it affects only shared libraries.
2024-07-06 16:41:54 +08:00
Fangjun Kuang
b502116068 Refactor flutter to support Android (#1072) 2024-07-04 10:49:09 +08:00
Fangjun Kuang
8c4f576f1b Support .Net framework 2.0 (#1062) 2024-06-28 11:27:19 +08:00
Fangjun Kuang
598c12c4e5 Fix CI tests (#1061) 2024-06-27 18:05:18 +08:00
Fangjun Kuang
5cce159cf3 Fix passing C# string to C++ (#1055) 2024-06-25 10:52:59 +08:00
Fangjun Kuang
a3bac19c54 fix a bug for wenet streaming model. (#1054)
* fix a bug for wenet streaming model.

The chunk shift was wrong.
See
https://github.com/wenet-e2e/wenet/blob/main/runtime/core/decoder/asr_model.cc#L15
and
https://github.com/wenet-e2e/wenet/blob/main/runtime/core/decoder/asr_model.cc#L28
2024-06-24 21:52:54 +08:00
Fangjun Kuang
1f95bff719 Add non-streaming zipformer Android APK (#1052) 2024-06-24 16:22:19 +08:00
Fangjun Kuang
e7a45108ac Remove unused files from .Net examples (#1051) 2024-06-24 10:25:14 +08:00
东风破
00de2bd00b Refactor .Net example project (#1049)
Co-authored-by: 东风破 <birdfishs@163.com>
2024-06-24 10:10:13 +08:00
Fangjun Kuang
169c9bf627 Flutter demo for real-time speech recognition (#1042) 2024-06-23 13:29:13 +08:00
Fangjun Kuang
9dd0e03568 Enable to stop TTS generation (#1041) 2024-06-22 18:18:36 +08:00
Fangjun Kuang
36336b31f4 Build Android APK for Thai (#1036) 2024-06-20 18:05:57 +08:00
Fangjun Kuang
6789c909d2 Inverse text normalization API of streaming ASR for various programming languages (#1022) 2024-06-18 13:42:17 +08:00
Fangjun Kuang
6e09933d99 Inverse text normalization API for other programming languages (#1019) 2024-06-17 17:02:39 +08:00
Fangjun Kuang
dd69a1b56b Release v1.9.30 (#1016) 2024-06-17 11:57:38 +08:00
Fangjun Kuang
e1201225f2 Add Android APK for Korean (#1015) 2024-06-16 19:17:15 +08:00
Fangjun Kuang
e52d32b95b Add TTS API and examples for Dart (#1010) 2024-06-15 14:30:36 +08:00
Fangjun Kuang
e3077670c6 Add streaming ASR examples for Dart API (#1009) 2024-06-15 11:48:54 +08:00
Fangjun Kuang
d94506698d Add non-streaming ASR examples for Dart API (#1007) 2024-06-14 18:40:16 +08:00
Lovemefan
b1f08c0a20 scale value fix (#1006) 2024-06-14 16:51:53 +08:00
Fangjun Kuang
dcd6ec3123 Publish osx-arm64 nuget package for .Net (#1003) 2024-06-14 12:13:49 +08:00
Fangjun Kuang
20a21133af Use CI to publish dart packages (#1001) 2024-06-14 11:50:55 +08:00
Fangjun Kuang
d08cc04567 Add VAD example for Dart API (#996) 2024-06-14 10:37:16 +08:00
Fangjun Kuang
09efe54808 add more text-to-speech models from piper (#988) 2024-06-11 15:22:48 +08:00
Fangjun Kuang
fd5a0d1e00 Add C++ runtime for Tele-AI/TeleSpeech-ASR (#970) 2024-06-05 00:26:40 +08:00
Fangjun Kuang
f8dbc10146 Fix CI (#964) 2024-06-04 17:05:49 +08:00
Fangjun Kuang
c6f22f25fa export telespeech ctc models to sherpa-onnx (#968) 2024-06-04 12:10:39 +08:00
Fangjun Kuang
0196f1d546 Wrap VAD APIs to C# (#946) 2024-05-30 18:29:38 +08:00
Fangjun Kuang
a99c7cb35c Wrap punctuation APIs to C#. (#945) 2024-05-30 16:19:56 +08:00
东风破
909148fe42 Split online.cs and offline.csFile (#941)
Co-authored-by: 东风破 <birdfishs@163.com>
2024-05-30 11:00:24 +08:00
Fangjun Kuang
b4f1006864 fix node-addaon-api for vad (#932) 2024-05-28 18:46:43 +08:00