Emmanuel Schmidbauer
8c087d9110
flutter: add lang, emotion, event to OfflineRecognizerResult ( #1268 )
2024-08-17 07:21:59 +08:00
Fangjun Kuang
88809753ab
Release v1.10.22 ( #1267 )
2024-08-16 22:40:49 +08:00
Fangjun Kuang
9dcea49dba
Fix looking up OOVs in lexicon.txt for MeloTTS models. ( #1266 )
...
If an English word does not exist in the lexicon, we split
it into characters. For instance, if the word TTS does not
exist in lexicon.txt, we split it into 3 characters T, T, and S.
2024-08-16 22:10:03 +08:00
Fangjun Kuang
63713ecbf0
Build generating subtitles APPs for more models ( #1265 )
2024-08-16 20:11:24 +08:00
Ikko Eltociear Ashimine
a3e98750e9
chore: update online-stream.h ( #1264 )
...
Fix typos.
2024-08-16 15:17:15 +08:00
Fangjun Kuang
fbe35ba736
Add Lazarus example for generating subtitles using Silero VAD with non-streaming ASR ( #1251 )
2024-08-15 22:19:45 +08:00
Fangjun Kuang
97a6a2a16a
Enable IPO only for Release build. ( #1261 )
2024-08-15 18:16:42 +08:00
Fangjun Kuang
ca729faebf
Support reading multi-channel wave files with 8/16/32-bit encoded samples ( #1258 )
2024-08-15 14:54:43 +08:00
Robin Zhong
62c4d4ab62
Add emotion, event of SenseVoice. ( #1257 )
...
* Add emotion, event of SenseVoice.
* Fix tokens size check and update java api.
https://github.com/k2-fsa/sherpa-onnx/pull/1257
2024-08-14 15:50:13 +08:00
Han Zhu
f300ec0f98
Add more C API examples ( #1255 )
...
C API examples for zipformer, paraformer, and TeleSpeech-ASR CTC models.
2024-08-14 10:52:47 +08:00
ivan provalov
9f06b059d7
Update offline-recognizer.cc ( #1253 )
...
Adding setConfig method to JNI to support setting a config on the previously initialized offline-recognizer.
2024-08-13 23:04:51 +08:00
Fangjun Kuang
619279b162
Pascal API for VAD ( #1249 )
2024-08-13 16:16:51 +08:00
Fangjun Kuang
a7dc6c2c16
Pascal API for non-streaming ASR ( #1247 )
2024-08-12 23:33:35 +08:00
Fangjun Kuang
5791b695ea
Pascal API for streaming ASR ( #1246 )
2024-08-12 19:55:51 +08:00
Fangjun Kuang
65f1c0fab2
Add Pascal API for reading wave files ( #1243 )
2024-08-11 22:43:42 +08:00
Fangjun Kuang
968623a477
Exclude .DS_Store files from flutter tts assets ( #1238 )
2024-08-09 13:19:27 +08:00
Fangjun Kuang
94e256244d
Add blank penalty for various language bindings. ( #1234 )
2024-08-08 10:43:31 +08:00
Parth Khiera
ba4cb6169f
feat: addition of blank_penalty config in online_recognizer ( #1232 )
2024-08-08 09:10:17 +08:00
Fangjun Kuang
8a5f5c1999
Fix python two pass ASR examples ( #1230 )
2024-08-07 18:35:38 +08:00
xsjk
1da75ee3c0
Fix typo in offline-lm-config.cc ( #1229 )
2024-08-07 15:38:34 +08:00
Fangjun Kuang
9ee2943ed4
Add CI tests for online punctuation models ( #1226 )
2024-08-06 18:10:30 +08:00
Fangjun Kuang
375c055ff8
Fix style issues for online punctuation source files ( #1225 )
2024-08-06 17:43:24 +08:00
jianyou
1414e4dc61
Add online punctuation and casing prediction model for English language ( #1224 )
2024-08-06 17:33:38 +08:00
Fangjun Kuang
52830cc910
Add MeloTTS example for ios ( #1223 )
2024-08-06 14:48:54 +08:00
Fangjun Kuang
6422966a7f
Support passing TTS callback in Swift API ( #1218 )
2024-08-05 14:06:21 +08:00
Fangjun Kuang
9caa488019
Fix setting SenseVoice language. ( #1214 )
2024-08-04 19:02:23 +08:00
Fangjun Kuang
c2dce19140
Update README to include Rust. ( #1212 )
2024-08-04 12:20:05 +08:00
Fangjun Kuang
d5f486878d
Remove libonnxruntime_providers_cuda.so as a dependency. ( #1210 )
2024-08-03 16:25:23 +08:00
Fangjun Kuang
561d04dd92
describe how to add new words for MeloTTS models ( #1209 )
2024-08-03 11:19:02 +08:00
Fangjun Kuang
35c1b4a7a9
Add ReazonSpeech Japanese pre-trained model ( #1203 )
2024-08-02 10:21:24 +08:00
Fangjun Kuang
53484fcd9b
Fix reading non-standard wav files. ( #1199 )
2024-08-01 17:48:04 +08:00
Fangjun Kuang
ec98110e11
Add speaker identification and verification exmaple for Dart API ( #1194 )
2024-07-31 13:53:52 +08:00
Fangjun Kuang
963aaba82b
Add Chinese+English tts example for flutter ( #1192 )
2024-07-30 18:38:43 +08:00
Fangjun Kuang
c1b5fce01b
Fix copying asset files for flutter examples. ( #1191 )
...
If the target file exists but has a different file size, we need
to copy the source file to the target file.
2024-07-30 18:24:56 +08:00
Fangjun Kuang
9e02f88dbb
Non-streaming WebSocket client for Java. ( #1190 )
2024-07-30 17:21:33 +08:00
Fangjun Kuang
06fd50f536
Add test about whisper large-v3 for .Net ( #1187 )
2024-07-29 20:49:38 +08:00
Fangjun Kuang
86b4c9f535
Fix splitting sentences for MeloTTS ( #1186 )
2024-07-29 17:04:45 +08:00
Fangjun Kuang
b1711ecaa1
Fix ffmpeg c api example ( #1185 )
2024-07-29 14:27:55 +08:00
Fangjun Kuang
646f99c870
Dart API for adding punctuations to text ( #1182 )
2024-07-29 12:41:52 +08:00
Fangjun Kuang
cd1fedaa49
Add Dart API for audio tagging ( #1181 )
2024-07-29 11:15:14 +08:00
Fangjun Kuang
69b6b47d91
Add vad with non-streaming ASR examples for Dart API ( #1180 )
2024-07-28 23:01:03 +08:00
Fangjun Kuang
d279c8d20e
Add more Python examples for SenseVoice ( #1179 )
2024-07-28 21:54:38 +08:00
Fangjun Kuang
9e005f53c3
fix building MFC examples ( #1178 )
2024-07-28 14:07:25 +08:00
Fangjun Kuang
1f8e575133
Add TTS example for Java API. ( #1176 )
...
It plays the generated audio as it is still generating.
2024-07-28 12:07:19 +08:00
Fangjun Kuang
4e6aeff07e
Refactor C API to prefix each API with SherpaOnnx. ( #1171 )
2024-07-26 18:47:02 +08:00
Fangjun Kuang
994c3e7c96
Add VAD + Non-streaming ASR example for JavaScript API. ( #1170 )
2024-07-26 12:42:08 +08:00
Fangjun Kuang
299f1a852b
Fix style issues reported by clang-tidy ( #1167 )
2024-07-23 09:26:36 +08:00
thewh1teagle
d32a46169f
feat: add directml support ( #1153 )
2024-07-22 23:50:48 +08:00
Fangjun Kuang
ea1d81bdfe
C api example for sense voice ( #1165 )
2024-07-22 16:54:00 +08:00
Fangjun Kuang
dd300b1de5
Add Java and Kotlin API for sense voice ( #1164 )
2024-07-22 14:08:40 +08:00