Fangjun Kuang
|
fbe35ba736
|
Add Lazarus example for generating subtitles using Silero VAD with non-streaming ASR (#1251)
|
2024-08-15 22:19:45 +08:00 |
|
Fangjun Kuang
|
97a6a2a16a
|
Enable IPO only for Release build. (#1261)
|
2024-08-15 18:16:42 +08:00 |
|
Fangjun Kuang
|
ca729faebf
|
Support reading multi-channel wave files with 8/16/32-bit encoded samples (#1258)
|
2024-08-15 14:54:43 +08:00 |
|
Robin Zhong
|
62c4d4ab62
|
Add emotion, event of SenseVoice. (#1257)
* Add emotion, event of SenseVoice.
* Fix tokens size check and update java api.
https://github.com/k2-fsa/sherpa-onnx/pull/1257
|
2024-08-14 15:50:13 +08:00 |
|
Han Zhu
|
f300ec0f98
|
Add more C API examples (#1255)
C API examples for zipformer, paraformer, and TeleSpeech-ASR CTC models.
|
2024-08-14 10:52:47 +08:00 |
|
ivan provalov
|
9f06b059d7
|
Update offline-recognizer.cc (#1253)
Adding setConfig method to JNI to support setting a config on the previously initialized offline-recognizer.
|
2024-08-13 23:04:51 +08:00 |
|
Fangjun Kuang
|
619279b162
|
Pascal API for VAD (#1249)
|
2024-08-13 16:16:51 +08:00 |
|
Fangjun Kuang
|
a7dc6c2c16
|
Pascal API for non-streaming ASR (#1247)
|
2024-08-12 23:33:35 +08:00 |
|
Fangjun Kuang
|
5791b695ea
|
Pascal API for streaming ASR (#1246)
|
2024-08-12 19:55:51 +08:00 |
|
Fangjun Kuang
|
65f1c0fab2
|
Add Pascal API for reading wave files (#1243)
|
2024-08-11 22:43:42 +08:00 |
|
Fangjun Kuang
|
968623a477
|
Exclude .DS_Store files from flutter tts assets (#1238)
|
2024-08-09 13:19:27 +08:00 |
|
Fangjun Kuang
|
94e256244d
|
Add blank penalty for various language bindings. (#1234)
|
2024-08-08 10:43:31 +08:00 |
|
Parth Khiera
|
ba4cb6169f
|
feat: addition of blank_penalty config in online_recognizer (#1232)
|
2024-08-08 09:10:17 +08:00 |
|
Fangjun Kuang
|
8a5f5c1999
|
Fix python two pass ASR examples (#1230)
|
2024-08-07 18:35:38 +08:00 |
|
xsjk
|
1da75ee3c0
|
Fix typo in offline-lm-config.cc (#1229)
|
2024-08-07 15:38:34 +08:00 |
|
Fangjun Kuang
|
9ee2943ed4
|
Add CI tests for online punctuation models (#1226)
|
2024-08-06 18:10:30 +08:00 |
|
Fangjun Kuang
|
375c055ff8
|
Fix style issues for online punctuation source files (#1225)
|
2024-08-06 17:43:24 +08:00 |
|
jianyou
|
1414e4dc61
|
Add online punctuation and casing prediction model for English language (#1224)
|
2024-08-06 17:33:38 +08:00 |
|
Fangjun Kuang
|
52830cc910
|
Add MeloTTS example for ios (#1223)
|
2024-08-06 14:48:54 +08:00 |
|
Fangjun Kuang
|
6422966a7f
|
Support passing TTS callback in Swift API (#1218)
|
2024-08-05 14:06:21 +08:00 |
|
Fangjun Kuang
|
9caa488019
|
Fix setting SenseVoice language. (#1214)
|
2024-08-04 19:02:23 +08:00 |
|
Fangjun Kuang
|
c2dce19140
|
Update README to include Rust. (#1212)
|
2024-08-04 12:20:05 +08:00 |
|
Fangjun Kuang
|
d5f486878d
|
Remove libonnxruntime_providers_cuda.so as a dependency. (#1210)
|
2024-08-03 16:25:23 +08:00 |
|
Fangjun Kuang
|
561d04dd92
|
describe how to add new words for MeloTTS models (#1209)
|
2024-08-03 11:19:02 +08:00 |
|
Fangjun Kuang
|
35c1b4a7a9
|
Add ReazonSpeech Japanese pre-trained model (#1203)
|
2024-08-02 10:21:24 +08:00 |
|
Fangjun Kuang
|
53484fcd9b
|
Fix reading non-standard wav files. (#1199)
|
2024-08-01 17:48:04 +08:00 |
|
Fangjun Kuang
|
ec98110e11
|
Add speaker identification and verification exmaple for Dart API (#1194)
|
2024-07-31 13:53:52 +08:00 |
|
Fangjun Kuang
|
963aaba82b
|
Add Chinese+English tts example for flutter (#1192)
|
2024-07-30 18:38:43 +08:00 |
|
Fangjun Kuang
|
c1b5fce01b
|
Fix copying asset files for flutter examples. (#1191)
If the target file exists but has a different file size, we need
to copy the source file to the target file.
|
2024-07-30 18:24:56 +08:00 |
|
Fangjun Kuang
|
9e02f88dbb
|
Non-streaming WebSocket client for Java. (#1190)
|
2024-07-30 17:21:33 +08:00 |
|
Fangjun Kuang
|
06fd50f536
|
Add test about whisper large-v3 for .Net (#1187)
|
2024-07-29 20:49:38 +08:00 |
|
Fangjun Kuang
|
86b4c9f535
|
Fix splitting sentences for MeloTTS (#1186)
|
2024-07-29 17:04:45 +08:00 |
|
Fangjun Kuang
|
b1711ecaa1
|
Fix ffmpeg c api example (#1185)
|
2024-07-29 14:27:55 +08:00 |
|
Fangjun Kuang
|
646f99c870
|
Dart API for adding punctuations to text (#1182)
|
2024-07-29 12:41:52 +08:00 |
|
Fangjun Kuang
|
cd1fedaa49
|
Add Dart API for audio tagging (#1181)
|
2024-07-29 11:15:14 +08:00 |
|
Fangjun Kuang
|
69b6b47d91
|
Add vad with non-streaming ASR examples for Dart API (#1180)
|
2024-07-28 23:01:03 +08:00 |
|
Fangjun Kuang
|
d279c8d20e
|
Add more Python examples for SenseVoice (#1179)
|
2024-07-28 21:54:38 +08:00 |
|
Fangjun Kuang
|
9e005f53c3
|
fix building MFC examples (#1178)
|
2024-07-28 14:07:25 +08:00 |
|
Fangjun Kuang
|
1f8e575133
|
Add TTS example for Java API. (#1176)
It plays the generated audio as it is still generating.
|
2024-07-28 12:07:19 +08:00 |
|
Fangjun Kuang
|
4e6aeff07e
|
Refactor C API to prefix each API with SherpaOnnx. (#1171)
|
2024-07-26 18:47:02 +08:00 |
|
Fangjun Kuang
|
994c3e7c96
|
Add VAD + Non-streaming ASR example for JavaScript API. (#1170)
|
2024-07-26 12:42:08 +08:00 |
|
Fangjun Kuang
|
299f1a852b
|
Fix style issues reported by clang-tidy (#1167)
|
2024-07-23 09:26:36 +08:00 |
|
thewh1teagle
|
d32a46169f
|
feat: add directml support (#1153)
|
2024-07-22 23:50:48 +08:00 |
|
Fangjun Kuang
|
ea1d81bdfe
|
C api example for sense voice (#1165)
|
2024-07-22 16:54:00 +08:00 |
|
Fangjun Kuang
|
dd300b1de5
|
Add Java and Kotlin API for sense voice (#1164)
|
2024-07-22 14:08:40 +08:00 |
|
Fangjun Kuang
|
ac8223bd8a
|
Add Dart API for keyword spotter (#1162)
|
2024-07-22 10:53:34 +08:00 |
|
thewh1teagle
|
22a262f5e4
|
feat: add stt c api example (#1156)
|
2024-07-22 10:32:12 +08:00 |
|
Fangjun Kuang
|
1a471595a5
|
Fix Android build (#1161)
|
2024-07-22 09:27:30 +08:00 |
|
Fangjun Kuang
|
ffdb23a8ec
|
Add dart API for SenseVoice (#1159)
|
2024-07-21 21:48:12 +08:00 |
|
Fangjun Kuang
|
70d14353bb
|
Add WebAssembly for SenseVoice (#1158)
|
2024-07-21 15:39:55 +08:00 |
|