Fangjun Kuang
bc08160820
Export Pyannote speaker segmentation models to onnx ( #1382 )
2024-09-29 14:23:56 +08:00
Fangjun Kuang
11f0cb7e1c
Support Parakeet models from NeMo ( #1381 )
2024-09-27 17:12:00 +08:00
Fangjun Kuang
12d04ce8ed
Fix running MeloTTS models on GPU. ( #1379 )
...
We need to use opset 18 to export the model to onnx.
2024-09-26 16:51:43 +08:00
Fangjun Kuang
d8809b520e
Fix CI errors introduced by supporting loading keywords from buffers ( #1366 )
2024-09-20 19:04:21 +08:00
Fangjun Kuang
576a3aa90d
Add non-streaming ONNX models for Russian ASR ( #1358 )
2024-09-18 13:43:49 +08:00
Fangjun Kuang
cddac52780
Support passing utf-8 strings from JavaScript to C++. ( #1355 )
...
We first convert utf-16 strings to Uint8Array and then we
pass the array to C++.
2024-09-18 11:03:42 +08:00
lllwan
bf06b268d0
Fix sherpa_onnx.go ( #1353 )
2024-09-17 13:39:56 +08:00
Fangjun Kuang
e7ffcbd677
Add APIs about max speech duration in VAD for various programming languages ( #1349 )
2024-09-14 12:30:13 +08:00
Fangjun Kuang
544857b097
Fix building ( #1343 )
2024-09-13 13:33:52 +08:00
Fangjun Kuang
6b8877f185
Downgrade flutter sdk versions. ( #1305 )
2024-08-30 11:47:27 +08:00
Fangjun Kuang
c38634dfcf
two-pass Android APK for SenseVoice ( #1302 )
2024-08-29 12:08:49 +08:00
Fangjun Kuang
9064430c3e
Fix releasing wasm app for vad+asr ( #1300 )
2024-08-29 08:47:38 +08:00
Emmanuel Schmidbauer
a8556e31ba
add Tokens []string, Timestamps []float32, Lang string, Emotion string, Event string ( #1277 )
2024-08-27 06:35:59 +08:00
Fangjun Kuang
17c8237ee4
Fix releasing npm package and fix building Android VAD+ASR example ( #1288 )
2024-08-26 10:18:48 +08:00
Fangjun Kuang
5ed8e31868
Add VAD and keyword spotting for the Node package with WebAssembly ( #1286 )
2024-08-24 23:05:54 +08:00
Fangjun Kuang
537e163dd0
WebAssembly example for VAD + Non-streaming ASR ( #1284 )
2024-08-24 13:24:52 +08:00
Fangjun Kuang
1ef8a7a202
Add WebAssembly for VAD ( #1281 )
2024-08-23 17:08:37 +08:00
Fangjun Kuang
fb09f8fae3
Set batch size to 1 for more streaming ASR models ( #1280 )
2024-08-23 11:06:55 +08:00
Fangjun Kuang
0e0d04a97a
Provide models for mobile-only platforms by fixing batch size to 1 ( #1276 )
2024-08-22 19:36:24 +08:00
Fangjun Kuang
5a2aa110b8
Text to speech API for Object Pascal. ( #1273 )
2024-08-20 20:52:16 +08:00
Fangjun Kuang
63713ecbf0
Build generating subtitles APPs for more models ( #1265 )
2024-08-16 20:11:24 +08:00
Fangjun Kuang
fbe35ba736
Add Lazarus example for generating subtitles using Silero VAD with non-streaming ASR ( #1251 )
2024-08-15 22:19:45 +08:00
Fangjun Kuang
94e256244d
Add blank penalty for various language bindings. ( #1234 )
2024-08-08 10:43:31 +08:00
Parth Khiera
ba4cb6169f
feat: addition of blank_penalty config in online_recognizer ( #1232 )
2024-08-08 09:10:17 +08:00
Fangjun Kuang
561d04dd92
describe how to add new words for MeloTTS models ( #1209 )
2024-08-03 11:19:02 +08:00
Fangjun Kuang
35c1b4a7a9
Add ReazonSpeech Japanese pre-trained model ( #1203 )
2024-08-02 10:21:24 +08:00
Fangjun Kuang
ec98110e11
Add speaker identification and verification exmaple for Dart API ( #1194 )
2024-07-31 13:53:52 +08:00
Fangjun Kuang
646f99c870
Dart API for adding punctuations to text ( #1182 )
2024-07-29 12:41:52 +08:00
Fangjun Kuang
cd1fedaa49
Add Dart API for audio tagging ( #1181 )
2024-07-29 11:15:14 +08:00
Fangjun Kuang
69b6b47d91
Add vad with non-streaming ASR examples for Dart API ( #1180 )
2024-07-28 23:01:03 +08:00
Fangjun Kuang
4e6aeff07e
Refactor C API to prefix each API with SherpaOnnx. ( #1171 )
2024-07-26 18:47:02 +08:00
Fangjun Kuang
994c3e7c96
Add VAD + Non-streaming ASR example for JavaScript API. ( #1170 )
2024-07-26 12:42:08 +08:00
Fangjun Kuang
dd300b1de5
Add Java and Kotlin API for sense voice ( #1164 )
2024-07-22 14:08:40 +08:00
Fangjun Kuang
ac8223bd8a
Add Dart API for keyword spotter ( #1162 )
2024-07-22 10:53:34 +08:00
Fangjun Kuang
ffdb23a8ec
Add dart API for SenseVoice ( #1159 )
2024-07-21 21:48:12 +08:00
Fangjun Kuang
c3260ef842
Add JavaScript API for SenseVoice ( #1157 )
2024-07-21 10:14:14 +08:00
Fangjun Kuang
8f4d332aab
Add Go API for SenseVoice ( #1154 )
2024-07-20 23:41:53 +08:00
Fangjun Kuang
e472180f2c
Add C# API for SenseVoice models ( #1151 )
2024-07-20 17:09:23 +08:00
Fangjun Kuang
25f0a10468
Add C++ runtime for SenseVoice models ( #1148 )
2024-07-18 22:54:18 +08:00
Fangjun Kuang
3bae5c3fe5
test exported sense voice models ( #1147 )
2024-07-18 12:12:44 +08:00
Fangjun Kuang
346f419f39
export sense-voice to onnx ( #1144 )
2024-07-18 00:18:38 +08:00
Fangjun Kuang
9e448d03bc
Provide npm package for 32-bit Windows x86 ( #1141 )
2024-07-17 12:33:15 +08:00
Fangjun Kuang
960eb7529e
Add C++ runtime for MeloTTS ( #1138 )
2024-07-16 15:55:02 +08:00
Fangjun Kuang
95485411fa
Support English for MeloTTS models. ( #1134 )
2024-07-15 19:49:22 +08:00
Fangjun Kuang
fa07bbc176
Add APK for small paraformer ( #1133 )
2024-07-15 19:44:36 +08:00
Fangjun Kuang
c35200dccf
Revert to onnxruntime 1.17.1 ( #1131 )
2024-07-15 14:24:08 +08:00
Fangjun Kuang
04c2319c2c
Export MeloTTS to ONNX ( #1129 )
2024-07-15 10:47:19 +08:00
Fangjun Kuang
ab71c3976d
Add int8 quantized whisper large models ( #1126 )
2024-07-13 22:30:06 +08:00
Fangjun Kuang
3951a12f8d
Add pre-trained models for the Libriheavy dataset ( #1122 )
2024-07-13 19:21:13 +08:00
Fangjun Kuang
b5093e27f9
Fix publishing apks to huggingface ( #1121 )
...
Save APKs for each release in a separate directory.
Huggingface requires that each directory cannot contain more than 1000 files.
Since we have so many tts models and for each model we need to build APKs of 4 different ABIs,
it is a workaround for the huggingface's constraint by placing them into separate directories for different releases.
2024-07-13 16:14:00 +08:00