Commit Graph

692 Commits

Author SHA1 Message Date
Fangjun Kuang
b5093e27f9 Fix publishing apks to huggingface (#1121)
Save APKs for each release in a separate directory.

Huggingface requires that each directory cannot contain more than 1000 files.

Since we have so many tts models and for each model we need to build APKs of 4 different ABIs,
it is a workaround for the huggingface's constraint by placing them into separate directories for different releases.
2024-07-13 16:14:00 +08:00
Fangjun Kuang
54e6e962bf Add microphone example for .Net keyword spotting (#1120) 2024-07-13 14:56:39 +08:00
Fangjun Kuang
4c626e89cf Fix releasing dart packages (#1117) 2024-07-13 10:51:54 +08:00
Fangjun Kuang
117cd7bb8c Support whisper large/large-v1/large-v2/large-v3 and distil-large-v2 (#1114) 2024-07-12 23:47:39 +08:00
Fangjun Kuang
d928f77d0e Add timestamps about streaming models for Swift API (#1113) 2024-07-12 17:39:46 +08:00
thewh1teagle
c0eaf86dbd feat: find best embedding matches (#1102) 2024-07-11 09:38:06 +08:00
Fangjun Kuang
1c104ea847 Update onnxruntime from v1.18.0 to v1.18.1 (#1107) 2024-07-11 09:35:28 +08:00
Fangjun Kuang
5a2603ff5c Handle invalid utf8 sequence from Whisper for Dart API. (#1106)
Fixes #1104
2024-07-10 21:48:23 +08:00
Fangjun Kuang
08c758520f Add keyword spotting for C# (#1105) 2024-07-10 21:18:46 +08:00
Fangjun Kuang
dd0ff2ca06 Support onnxruntime 1.18.0 (#906) 2024-07-10 17:05:26 +08:00
Fangjun Kuang
9e446b8501 Fix typos (#1101) 2024-07-09 20:08:47 +08:00
Fangjun Kuang
c2cc9dec58 Add Flush to VAD so that the last segment can be detected. (#1099) 2024-07-09 16:15:56 +08:00
Manix
3e4307e2fb updating trt workspace int64 (#1094)
Signed-off-by: Manix <manickavela1998@gmail.com>
2024-07-08 20:38:16 +08:00
Fangjun Kuang
4fd0493037 fix ios demo image for flutter tts (#1093) 2024-07-08 17:07:01 +08:00
Fangjun Kuang
5d2ceb3513 Support linux-arm64 for .Net (#1092) 2024-07-08 16:13:51 +08:00
Fangjun Kuang
02eab549a3 Fix windows build (#1091) 2024-07-08 16:00:35 +08:00
Manix
d6fbecd947 parse option in64_t (#1089)
Signed-off-by: Manix <manickavela1998@gmail.com>
2024-07-08 15:37:30 +08:00
Fangjun Kuang
5817d6c391 Fix Flutter TTS example for iOS (#1090) 2024-07-08 15:22:09 +08:00
Fangjun Kuang
e832d356c7 Add Flutter text to speech demo (#1087) 2024-07-08 11:23:11 +08:00
Fangjun Kuang
1fe12c5107 Support the platform iOS for Flutter (#1079) 2024-07-06 19:43:37 +08:00
Fangjun Kuang
a25075101c Build sherpa-onnx as a single shared library (#1078)
When `-D BUILD_SHARED_LIBS=ON` is passed to `cmake`, it builds a single shared library.

Specifically, 

- For C APIs, it builds `libsherpa-onnx-c-api.so`
- For Python APIs, it builds `_sherpa_onnx.cpython-xx-xx.so`
- For Kotlin and Java APIs, it builds `libsherpa-onnx-jni.so`

There is no `libsherpa-onnx-core.so` any longer.

Note it affects only shared libraries.
2024-07-06 16:41:54 +08:00
Manix
55decb7bee Add config for TensorRT and CUDA execution provider (#992)
Signed-off-by: manickavela1998@gmail.com <manickavela1998@gmail.com>
Signed-off-by: manickavela1998@gmail.com <manickavela.arumugam@uniphore.com>
2024-07-05 15:18:37 +08:00
Fangjun Kuang
f5e9a162d1 Publish flutter packages for Android (#1074) 2024-07-04 20:07:07 +08:00
Fangjun Kuang
b502116068 Refactor flutter to support Android (#1072) 2024-07-04 10:49:09 +08:00
Fangjun Kuang
125bb9ff99 replace macos-11 with macos-13 for GitHub actions (#1066)
Tests using macos-11 wait a long time before they can start, so we replace macos-11 with macos-13.
2024-07-01 12:37:56 +08:00
Fangjun Kuang
6cb018184e Fix for silero vad v5. (#1065)
The network input is 64 + 512 samples instead of 512 samples for 16kHz.
2024-06-30 08:57:23 +08:00
Fangjun Kuang
61c7eb3063 Support silero_vad version 5 (#1064) 2024-06-29 11:45:04 +08:00
Fangjun Kuang
8c4f576f1b Support .Net framework 2.0 (#1062) 2024-06-28 11:27:19 +08:00
Fangjun Kuang
598c12c4e5 Fix CI tests (#1061) 2024-06-27 18:05:18 +08:00
Fangjun Kuang
03ebdf3fc6 Fix possible segfault in C API. (#1059) 2024-06-26 09:57:19 +08:00
hantengc
dfca4500aa Resolve issues with using pre-installed onnxruntime (#1058)
There is an issue with the directory, ${location_onnxruntime_header_dir}
may be NOT-FOUND
2024-06-25 16:21:36 +08:00
Fangjun Kuang
2f8c489698 Publish pre-built jni libs for windows and osx (#1056) 2024-06-25 11:59:04 +08:00
Fangjun Kuang
5cce159cf3 Fix passing C# string to C++ (#1055) 2024-06-25 10:52:59 +08:00
Fangjun Kuang
a3bac19c54 fix a bug for wenet streaming model. (#1054)
* fix a bug for wenet streaming model.

The chunk shift was wrong.
See
https://github.com/wenet-e2e/wenet/blob/main/runtime/core/decoder/asr_model.cc#L15
and
https://github.com/wenet-e2e/wenet/blob/main/runtime/core/decoder/asr_model.cc#L28
2024-06-24 21:52:54 +08:00
Fangjun Kuang
1f95bff719 Add non-streaming zipformer Android APK (#1052) 2024-06-24 16:22:19 +08:00
Fangjun Kuang
e7a45108ac Remove unused files from .Net examples (#1051) 2024-06-24 10:25:14 +08:00
东风破
00de2bd00b Refactor .Net example project (#1049)
Co-authored-by: 东风破 <birdfishs@163.com>
2024-06-24 10:10:13 +08:00
Fangjun Kuang
6b7ced2317 Fix the alsa-lib version to v1.2.12 (#1048) 2024-06-23 20:20:38 +08:00
Fangjun Kuang
7a663afb76 Add streaming ASR example from a microphone for Java API (#1047) 2024-06-23 19:43:53 +08:00
Fangjun Kuang
29abf242c3 Add VAD + Non-streaming ASR + microphone examples for Java API (#1046) 2024-06-23 19:09:21 +08:00
Fangjun Kuang
757a44b116 Add VAD + microphone example for Java API. (#1045) 2024-06-23 18:34:18 +08:00
Fangjun Kuang
7eae5c0d06 Update READEM to include links to pre-built flutter APPs (#1043) 2024-06-23 14:31:56 +08:00
Fangjun Kuang
169c9bf627 Flutter demo for real-time speech recognition (#1042) 2024-06-23 13:29:13 +08:00
Fangjun Kuang
9dd0e03568 Enable to stop TTS generation (#1041) 2024-06-22 18:18:36 +08:00
彭震东
96ab843173 fix typo (#1038) 2024-06-21 11:15:59 +08:00
Fangjun Kuang
36336b31f4 Build Android APK for Thai (#1036) 2024-06-20 18:05:57 +08:00
Zhong-Yi Li
675fb1574f offline transducer: treat unk as blank (#1005)
Co-authored-by: chungyi.li <chungyi.li@ailabs.tw>
2024-06-19 20:52:42 +08:00
Fangjun Kuang
a11c859971 Support clang-tidy (#1034) 2024-06-19 20:51:57 +08:00
SilverSulfide
656b9fa1c8 Add Python API support for Offline LM rescoring (#1033) 2024-06-19 16:29:37 +08:00
愚者自愚
167bc76db0 fix generate-subtitles.py bug (#1029)
* fix generate-subtitles.py If the audio file is not muted for more than 1 second at the end, it will cause the last segment to be lost
2024-06-18 18:29:39 +08:00