Commit Graph

796 Commits

Author SHA1 Message Date
Fangjun Kuang
ae2bc17168 Build websocket related binaries for embedded systems. (#1327) 2024-09-08 17:16:58 +08:00
Michael Twohey
b409b0a958 Fixed the C api calls and created the TTS project file (#1324)
Co-authored-by: Michael Twohey <mtwohey@americanambulance.com>
2024-09-07 23:25:02 +08:00
SilverSulfide
888f74bf3c Re-implement LM rescore for online transducer (#1231)
Co-authored-by: Martins Kronis <martins.kuznecovs@tilde.lv>
2024-09-06 10:01:25 +08:00
RGdevz
1f29e4a1a9 throw error instead exit (#1323) 2024-09-06 09:59:21 +08:00
Fangjun Kuang
e66d4c414a Fix releasing dart packages. (#1317) 2024-09-04 12:12:13 +08:00
Fangjun Kuang
cc462316db Release v1.10.24 (#1309) 2024-08-30 17:27:08 +08:00
Fangjun Kuang
d60a4d418e Provide prebuilt .jar files for different java versions. (#1307) 2024-08-30 14:16:31 +08:00
Fangjun Kuang
3687c9f60a Reduce onnxruntime log output. (#1306)
Change the logging level from WARNING to ERROR.
2024-08-30 12:50:34 +08:00
Fangjun Kuang
6b8877f185 Downgrade flutter sdk versions. (#1305) 2024-08-30 11:47:27 +08:00
Fangjun Kuang
c38634dfcf two-pass Android APK for SenseVoice (#1302) 2024-08-29 12:08:49 +08:00
Fangjun Kuang
0ccd3a4c3f remove extra files from linux/macos/windows jni libs (#1301) 2024-08-29 10:45:38 +08:00
Fangjun Kuang
9064430c3e Fix releasing wasm app for vad+asr (#1300) 2024-08-29 08:47:38 +08:00
Fangjun Kuang
ca30d83915 Avoid SherpaOnnxSpeakerEmbeddingManagerFreeBestMatches freeing null. (#1296)
Fixes #1295
2024-08-28 10:42:36 +08:00
Fangjun Kuang
22c6f81393 Fix VAD+ASR example for Dart API. (#1294)
There is no need to invoke vad.isDetected().
2024-08-27 22:15:50 +08:00
Fangjun Kuang
a2a70900d6 ADD VAD+ASR example for dart with CircularBuffer. (#1293) 2024-08-27 19:29:34 +08:00
Fangjun Kuang
6ec57327ce add vad+sense voice example for C API (#1291) 2024-08-27 16:11:24 +08:00
Emmanuel Schmidbauer
a8556e31ba add Tokens []string, Timestamps []float32, Lang string, Emotion string, Event string (#1277) 2024-08-27 06:35:59 +08:00
Fangjun Kuang
17c8237ee4 Fix releasing npm package and fix building Android VAD+ASR example (#1288) 2024-08-26 10:18:48 +08:00
Hán Trung Kiên
452555b218 update generate-asset-list.py (#1287) 2024-08-25 08:57:16 +08:00
Fangjun Kuang
5ed8e31868 Add VAD and keyword spotting for the Node package with WebAssembly (#1286) 2024-08-24 23:05:54 +08:00
Fangjun Kuang
537e163dd0 WebAssembly example for VAD + Non-streaming ASR (#1284) 2024-08-24 13:24:52 +08:00
Fangjun Kuang
1ef8a7a202 Add WebAssembly for VAD (#1281) 2024-08-23 17:08:37 +08:00
Fangjun Kuang
fb09f8fae3 Set batch size to 1 for more streaming ASR models (#1280) 2024-08-23 11:06:55 +08:00
Malcolm Ke Win
c61423ec5a Update wave-reader.cc (#1278)
* Update wave-reader.cc

missing "#include <cstdint>"
2024-08-22 23:22:45 +08:00
Fangjun Kuang
0e0d04a97a Provide models for mobile-only platforms by fixing batch size to 1 (#1276) 2024-08-22 19:36:24 +08:00
Robin Zhong
d8001d6edc update kotlin api for better release native object and add user-friendly apis. (#1275) 2024-08-22 19:18:11 +08:00
Fangjun Kuang
5a2aa110b8 Text to speech API for Object Pascal. (#1273) 2024-08-20 20:52:16 +08:00
Fangjun Kuang
e34a1a2aa3 Object pascal examples for recording and playing audio with portaudio. (#1271)
The recording example can be used for speech recognition while the playing example can be used for text to speech.

The portaudio wrapper for object pascal is copied from
https://github.com/UltraStar-Deluxe/USDX/blob/master/src/lib/portaudio/portaudio.pas
2024-08-18 19:51:08 +08:00
Fangjun Kuang
f93f0ca94d Use a separate thread to initialize models for lazarus examples. (#1270)
So that the main thread is not blocked and the user interface is responsive.
2024-08-18 14:59:48 +08:00
Emmanuel Schmidbauer
8c087d9110 flutter: add lang, emotion, event to OfflineRecognizerResult (#1268) 2024-08-17 07:21:59 +08:00
Fangjun Kuang
88809753ab Release v1.10.22 (#1267) 2024-08-16 22:40:49 +08:00
Fangjun Kuang
9dcea49dba Fix looking up OOVs in lexicon.txt for MeloTTS models. (#1266)
If an English word does not exist in the lexicon, we split
it into characters. For instance, if the word TTS does not
exist in lexicon.txt, we split it into 3 characters T, T, and S.
2024-08-16 22:10:03 +08:00
Fangjun Kuang
63713ecbf0 Build generating subtitles APPs for more models (#1265) 2024-08-16 20:11:24 +08:00
Ikko Eltociear Ashimine
a3e98750e9 chore: update online-stream.h (#1264)
Fix typos.
2024-08-16 15:17:15 +08:00
Fangjun Kuang
fbe35ba736 Add Lazarus example for generating subtitles using Silero VAD with non-streaming ASR (#1251) 2024-08-15 22:19:45 +08:00
Fangjun Kuang
97a6a2a16a Enable IPO only for Release build. (#1261) 2024-08-15 18:16:42 +08:00
Fangjun Kuang
ca729faebf Support reading multi-channel wave files with 8/16/32-bit encoded samples (#1258) 2024-08-15 14:54:43 +08:00
Robin Zhong
62c4d4ab62 Add emotion, event of SenseVoice. (#1257)
* Add emotion, event of SenseVoice.

* Fix tokens size check and update java api.

https://github.com/k2-fsa/sherpa-onnx/pull/1257
2024-08-14 15:50:13 +08:00
Han Zhu
f300ec0f98 Add more C API examples (#1255)
C API examples for zipformer, paraformer, and TeleSpeech-ASR CTC models.
2024-08-14 10:52:47 +08:00
ivan provalov
9f06b059d7 Update offline-recognizer.cc (#1253)
Adding setConfig method to JNI to support setting a config on the previously initialized offline-recognizer.
2024-08-13 23:04:51 +08:00
Fangjun Kuang
619279b162 Pascal API for VAD (#1249) 2024-08-13 16:16:51 +08:00
Fangjun Kuang
a7dc6c2c16 Pascal API for non-streaming ASR (#1247) 2024-08-12 23:33:35 +08:00
Fangjun Kuang
5791b695ea Pascal API for streaming ASR (#1246) 2024-08-12 19:55:51 +08:00
Fangjun Kuang
65f1c0fab2 Add Pascal API for reading wave files (#1243) 2024-08-11 22:43:42 +08:00
Fangjun Kuang
968623a477 Exclude .DS_Store files from flutter tts assets (#1238) 2024-08-09 13:19:27 +08:00
Fangjun Kuang
94e256244d Add blank penalty for various language bindings. (#1234) 2024-08-08 10:43:31 +08:00
Parth Khiera
ba4cb6169f feat: addition of blank_penalty config in online_recognizer (#1232) 2024-08-08 09:10:17 +08:00
Fangjun Kuang
8a5f5c1999 Fix python two pass ASR examples (#1230) 2024-08-07 18:35:38 +08:00
xsjk
1da75ee3c0 Fix typo in offline-lm-config.cc (#1229) 2024-08-07 15:38:34 +08:00
Fangjun Kuang
9ee2943ed4 Add CI tests for online punctuation models (#1226) 2024-08-06 18:10:30 +08:00