Commit Graph

403 Commits

Author SHA1 Message Date
Fangjun Kuang
d5f486878d Remove libonnxruntime_providers_cuda.so as a dependency. (#1210) 2024-08-03 16:25:23 +08:00
Fangjun Kuang
35c1b4a7a9 Add ReazonSpeech Japanese pre-trained model (#1203) 2024-08-02 10:21:24 +08:00
Fangjun Kuang
53484fcd9b Fix reading non-standard wav files. (#1199) 2024-08-01 17:48:04 +08:00
Fangjun Kuang
86b4c9f535 Fix splitting sentences for MeloTTS (#1186) 2024-07-29 17:04:45 +08:00
Fangjun Kuang
1f8e575133 Add TTS example for Java API. (#1176)
It plays the generated audio as it is still generating.
2024-07-28 12:07:19 +08:00
Fangjun Kuang
4e6aeff07e Refactor C API to prefix each API with SherpaOnnx. (#1171) 2024-07-26 18:47:02 +08:00
Fangjun Kuang
994c3e7c96 Add VAD + Non-streaming ASR example for JavaScript API. (#1170) 2024-07-26 12:42:08 +08:00
Fangjun Kuang
299f1a852b Fix style issues reported by clang-tidy (#1167) 2024-07-23 09:26:36 +08:00
thewh1teagle
d32a46169f feat: add directml support (#1153) 2024-07-22 23:50:48 +08:00
Fangjun Kuang
dd300b1de5 Add Java and Kotlin API for sense voice (#1164) 2024-07-22 14:08:40 +08:00
Fangjun Kuang
1a471595a5 Fix Android build (#1161) 2024-07-22 09:27:30 +08:00
Fangjun Kuang
ffdb23a8ec Add dart API for SenseVoice (#1159) 2024-07-21 21:48:12 +08:00
Fangjun Kuang
25f0a10468 Add C++ runtime for SenseVoice models (#1148) 2024-07-18 22:54:18 +08:00
Wei Kang
5b1fa8750f Fix hotwords OOV log (#1139) 2024-07-16 19:41:31 +08:00
Fangjun Kuang
960eb7529e Add C++ runtime for MeloTTS (#1138) 2024-07-16 15:55:02 +08:00
Fangjun Kuang
fa07bbc176 Add APK for small paraformer (#1133) 2024-07-15 19:44:36 +08:00
Manickavela
11cfd33b10 encoder only trt ep for transducer (#1130) 2024-07-15 14:52:33 +08:00
ivan provalov
de04b3b9bf Allow modify model config at decode time for ASR (#1124) 2024-07-13 22:30:47 +08:00
Fangjun Kuang
b5093e27f9 Fix publishing apks to huggingface (#1121)
Save APKs for each release in a separate directory.

Huggingface requires that each directory cannot contain more than 1000 files.

Since we have so many tts models and for each model we need to build APKs of 4 different ABIs,
it is a workaround for the huggingface's constraint by placing them into separate directories for different releases.
2024-07-13 16:14:00 +08:00
Fangjun Kuang
117cd7bb8c Support whisper large/large-v1/large-v2/large-v3 and distil-large-v2 (#1114) 2024-07-12 23:47:39 +08:00
thewh1teagle
c0eaf86dbd feat: find best embedding matches (#1102) 2024-07-11 09:38:06 +08:00
Fangjun Kuang
08c758520f Add keyword spotting for C# (#1105) 2024-07-10 21:18:46 +08:00
Fangjun Kuang
dd0ff2ca06 Support onnxruntime 1.18.0 (#906) 2024-07-10 17:05:26 +08:00
Fangjun Kuang
c2cc9dec58 Add Flush to VAD so that the last segment can be detected. (#1099) 2024-07-09 16:15:56 +08:00
Manix
3e4307e2fb updating trt workspace int64 (#1094)
Signed-off-by: Manix <manickavela1998@gmail.com>
2024-07-08 20:38:16 +08:00
Manix
d6fbecd947 parse option in64_t (#1089)
Signed-off-by: Manix <manickavela1998@gmail.com>
2024-07-08 15:37:30 +08:00
Fangjun Kuang
a25075101c Build sherpa-onnx as a single shared library (#1078)
When `-D BUILD_SHARED_LIBS=ON` is passed to `cmake`, it builds a single shared library.

Specifically, 

- For C APIs, it builds `libsherpa-onnx-c-api.so`
- For Python APIs, it builds `_sherpa_onnx.cpython-xx-xx.so`
- For Kotlin and Java APIs, it builds `libsherpa-onnx-jni.so`

There is no `libsherpa-onnx-core.so` any longer.

Note it affects only shared libraries.
2024-07-06 16:41:54 +08:00
Manix
55decb7bee Add config for TensorRT and CUDA execution provider (#992)
Signed-off-by: manickavela1998@gmail.com <manickavela1998@gmail.com>
Signed-off-by: manickavela1998@gmail.com <manickavela.arumugam@uniphore.com>
2024-07-05 15:18:37 +08:00
Fangjun Kuang
f5e9a162d1 Publish flutter packages for Android (#1074) 2024-07-04 20:07:07 +08:00
Fangjun Kuang
6cb018184e Fix for silero vad v5. (#1065)
The network input is 64 + 512 samples instead of 512 samples for 16kHz.
2024-06-30 08:57:23 +08:00
Fangjun Kuang
61c7eb3063 Support silero_vad version 5 (#1064) 2024-06-29 11:45:04 +08:00
Fangjun Kuang
8c4f576f1b Support .Net framework 2.0 (#1062) 2024-06-28 11:27:19 +08:00
Fangjun Kuang
598c12c4e5 Fix CI tests (#1061) 2024-06-27 18:05:18 +08:00
Fangjun Kuang
03ebdf3fc6 Fix possible segfault in C API. (#1059) 2024-06-26 09:57:19 +08:00
Fangjun Kuang
2f8c489698 Publish pre-built jni libs for windows and osx (#1056) 2024-06-25 11:59:04 +08:00
Fangjun Kuang
5cce159cf3 Fix passing C# string to C++ (#1055) 2024-06-25 10:52:59 +08:00
Fangjun Kuang
a3bac19c54 fix a bug for wenet streaming model. (#1054)
* fix a bug for wenet streaming model.

The chunk shift was wrong.
See
https://github.com/wenet-e2e/wenet/blob/main/runtime/core/decoder/asr_model.cc#L15
and
https://github.com/wenet-e2e/wenet/blob/main/runtime/core/decoder/asr_model.cc#L28
2024-06-24 21:52:54 +08:00
Fangjun Kuang
1f95bff719 Add non-streaming zipformer Android APK (#1052) 2024-06-24 16:22:19 +08:00
Fangjun Kuang
9dd0e03568 Enable to stop TTS generation (#1041) 2024-06-22 18:18:36 +08:00
Fangjun Kuang
36336b31f4 Build Android APK for Thai (#1036) 2024-06-20 18:05:57 +08:00
Zhong-Yi Li
675fb1574f offline transducer: treat unk as blank (#1005)
Co-authored-by: chungyi.li <chungyi.li@ailabs.tw>
2024-06-19 20:52:42 +08:00
Fangjun Kuang
a11c859971 Support clang-tidy (#1034) 2024-06-19 20:51:57 +08:00
SilverSulfide
656b9fa1c8 Add Python API support for Offline LM rescoring (#1033) 2024-06-19 16:29:37 +08:00
Fangjun Kuang
6789c909d2 Inverse text normalization API of streaming ASR for various programming languages (#1022) 2024-06-18 13:42:17 +08:00
Fangjun Kuang
349d957da2 Add inverse text normalization for online ASR (#1020) 2024-06-17 18:39:23 +08:00
Fangjun Kuang
6e09933d99 Inverse text normalization API for other programming languages (#1019) 2024-06-17 17:02:39 +08:00
Fangjun Kuang
b0f7ed3ee3 Add inverse text normalization for non-streaming ASR (#1017) 2024-06-17 14:28:53 +08:00
Fangjun Kuang
dd69a1b56b Release v1.9.30 (#1016) 2024-06-17 11:57:38 +08:00
Fangjun Kuang
e1201225f2 Add Android APK for Korean (#1015) 2024-06-16 19:17:15 +08:00
Fangjun Kuang
99a9da19de add example description for the dart package (#1011) 2024-06-15 14:52:19 +08:00