lllwan
bf06b268d0
Fix sherpa_onnx.go ( #1353 )
2024-09-17 13:39:56 +08:00
Fangjun Kuang
9dade25d3e
Release v1.10.26 ( #1350 )
2024-09-14 14:37:42 +08:00
Fangjun Kuang
e7ffcbd677
Add APIs about max speech duration in VAD for various programming languages ( #1349 )
2024-09-14 12:30:13 +08:00
Fangjun Kuang
1423ddb1f0
Support specifying max speech duration for VAD. ( #1348 )
2024-09-14 10:57:46 +08:00
Fangjun Kuang
5d761712db
Support lang/emotion/event results from SenseVoice in Swift API. ( #1346 )
2024-09-13 19:43:46 +08:00
Fangjun Kuang
6bf9310cb4
Add links to projects using sherpa-onnx. ( #1345 )
2024-09-13 19:17:08 +08:00
Fangjun Kuang
211786e798
Release v1.10.25 ( #1344 )
2024-09-13 14:58:38 +08:00
Fangjun Kuang
544857b097
Fix building ( #1343 )
2024-09-13 13:33:52 +08:00
lxiao336
65cfa7548a
re-pull-request allow tokens and hotwords be loaded from buffered string driectly ( #1339 )
...
Co-authored-by: xiao <shawl336@163.com >
2024-09-13 09:58:17 +08:00
Fangjun Kuang
6b6e7635ed
Fix computing features for CED audio tagging models. ( #1341 )
...
See also
https://github.com/RicherMans/CED/blob/main/onnx_inference_with_kaldi.py
2024-09-12 19:38:18 +08:00
Askars
fa20ae1552
Preserve previous result as context for next segment ( #1335 )
...
Co-authored-by: vsd-vector <askars.salimbajevs@tilde.lv >
2024-09-11 10:44:13 +08:00
Fangjun Kuang
ba7f1a7439
Fix building ( #1331 )
2024-09-09 10:29:31 +08:00
Lim Yao Chong
3bffc24d64
Add Python binding for online punctuation models ( #1312 )
2024-09-09 10:26:53 +08:00
Fangjun Kuang
857cb5075c
Fix typos ( #1330 )
2024-09-09 10:22:42 +08:00
Fangjun Kuang
363b8e4c1e
Fix vad.Flush(). ( #1329 )
...
Fixes #1314
2024-09-08 17:52:53 +08:00
Fangjun Kuang
1977c8d04d
fix wasm app for streaming paraformer ( #1328 )
2024-09-08 17:49:19 +08:00
Fangjun Kuang
ae2bc17168
Build websocket related binaries for embedded systems. ( #1327 )
2024-09-08 17:16:58 +08:00
Michael Twohey
b409b0a958
Fixed the C api calls and created the TTS project file ( #1324 )
...
Co-authored-by: Michael Twohey <mtwohey@americanambulance.com >
2024-09-07 23:25:02 +08:00
SilverSulfide
888f74bf3c
Re-implement LM rescore for online transducer ( #1231 )
...
Co-authored-by: Martins Kronis <martins.kuznecovs@tilde.lv >
2024-09-06 10:01:25 +08:00
RGdevz
1f29e4a1a9
throw error instead exit ( #1323 )
2024-09-06 09:59:21 +08:00
Fangjun Kuang
e66d4c414a
Fix releasing dart packages. ( #1317 )
2024-09-04 12:12:13 +08:00
Fangjun Kuang
cc462316db
Release v1.10.24 ( #1309 )
2024-08-30 17:27:08 +08:00
Fangjun Kuang
d60a4d418e
Provide prebuilt .jar files for different java versions. ( #1307 )
2024-08-30 14:16:31 +08:00
Fangjun Kuang
3687c9f60a
Reduce onnxruntime log output. ( #1306 )
...
Change the logging level from WARNING to ERROR.
2024-08-30 12:50:34 +08:00
Fangjun Kuang
6b8877f185
Downgrade flutter sdk versions. ( #1305 )
2024-08-30 11:47:27 +08:00
Fangjun Kuang
c38634dfcf
two-pass Android APK for SenseVoice ( #1302 )
2024-08-29 12:08:49 +08:00
Fangjun Kuang
0ccd3a4c3f
remove extra files from linux/macos/windows jni libs ( #1301 )
2024-08-29 10:45:38 +08:00
Fangjun Kuang
9064430c3e
Fix releasing wasm app for vad+asr ( #1300 )
2024-08-29 08:47:38 +08:00
Fangjun Kuang
ca30d83915
Avoid SherpaOnnxSpeakerEmbeddingManagerFreeBestMatches freeing null. ( #1296 )
...
Fixes #1295
2024-08-28 10:42:36 +08:00
Fangjun Kuang
22c6f81393
Fix VAD+ASR example for Dart API. ( #1294 )
...
There is no need to invoke vad.isDetected().
2024-08-27 22:15:50 +08:00
Fangjun Kuang
a2a70900d6
ADD VAD+ASR example for dart with CircularBuffer. ( #1293 )
2024-08-27 19:29:34 +08:00
Fangjun Kuang
6ec57327ce
add vad+sense voice example for C API ( #1291 )
2024-08-27 16:11:24 +08:00
Emmanuel Schmidbauer
a8556e31ba
add Tokens []string, Timestamps []float32, Lang string, Emotion string, Event string ( #1277 )
2024-08-27 06:35:59 +08:00
Fangjun Kuang
17c8237ee4
Fix releasing npm package and fix building Android VAD+ASR example ( #1288 )
2024-08-26 10:18:48 +08:00
Hán Trung Kiên
452555b218
update generate-asset-list.py ( #1287 )
2024-08-25 08:57:16 +08:00
Fangjun Kuang
5ed8e31868
Add VAD and keyword spotting for the Node package with WebAssembly ( #1286 )
2024-08-24 23:05:54 +08:00
Fangjun Kuang
537e163dd0
WebAssembly example for VAD + Non-streaming ASR ( #1284 )
2024-08-24 13:24:52 +08:00
Fangjun Kuang
1ef8a7a202
Add WebAssembly for VAD ( #1281 )
2024-08-23 17:08:37 +08:00
Fangjun Kuang
fb09f8fae3
Set batch size to 1 for more streaming ASR models ( #1280 )
2024-08-23 11:06:55 +08:00
Malcolm Ke Win
c61423ec5a
Update wave-reader.cc ( #1278 )
...
* Update wave-reader.cc
missing "#include <cstdint>"
2024-08-22 23:22:45 +08:00
Fangjun Kuang
0e0d04a97a
Provide models for mobile-only platforms by fixing batch size to 1 ( #1276 )
2024-08-22 19:36:24 +08:00
Robin Zhong
d8001d6edc
update kotlin api for better release native object and add user-friendly apis. ( #1275 )
2024-08-22 19:18:11 +08:00
Fangjun Kuang
5a2aa110b8
Text to speech API for Object Pascal. ( #1273 )
2024-08-20 20:52:16 +08:00
Fangjun Kuang
e34a1a2aa3
Object pascal examples for recording and playing audio with portaudio. ( #1271 )
...
The recording example can be used for speech recognition while the playing example can be used for text to speech.
The portaudio wrapper for object pascal is copied from
https://github.com/UltraStar-Deluxe/USDX/blob/master/src/lib/portaudio/portaudio.pas
2024-08-18 19:51:08 +08:00
Fangjun Kuang
f93f0ca94d
Use a separate thread to initialize models for lazarus examples. ( #1270 )
...
So that the main thread is not blocked and the user interface is responsive.
2024-08-18 14:59:48 +08:00
Emmanuel Schmidbauer
8c087d9110
flutter: add lang, emotion, event to OfflineRecognizerResult ( #1268 )
2024-08-17 07:21:59 +08:00
Fangjun Kuang
88809753ab
Release v1.10.22 ( #1267 )
2024-08-16 22:40:49 +08:00
Fangjun Kuang
9dcea49dba
Fix looking up OOVs in lexicon.txt for MeloTTS models. ( #1266 )
...
If an English word does not exist in the lexicon, we split
it into characters. For instance, if the word TTS does not
exist in lexicon.txt, we split it into 3 characters T, T, and S.
2024-08-16 22:10:03 +08:00
Fangjun Kuang
63713ecbf0
Build generating subtitles APPs for more models ( #1265 )
2024-08-16 20:11:24 +08:00
Ikko Eltociear Ashimine
a3e98750e9
chore: update online-stream.h ( #1264 )
...
Fix typos.
2024-08-16 15:17:15 +08:00