Fangjun Kuang
e7ffcbd677
Add APIs about max speech duration in VAD for various programming languages ( #1349 )
2024-09-14 12:30:13 +08:00
Fangjun Kuang
544857b097
Fix building ( #1343 )
2024-09-13 13:33:52 +08:00
lxiao336
65cfa7548a
re-pull-request allow tokens and hotwords be loaded from buffered string driectly ( #1339 )
...
Co-authored-by: xiao <shawl336@163.com >
2024-09-13 09:58:17 +08:00
Fangjun Kuang
e66d4c414a
Fix releasing dart packages. ( #1317 )
2024-09-04 12:12:13 +08:00
Fangjun Kuang
d60a4d418e
Provide prebuilt .jar files for different java versions. ( #1307 )
2024-08-30 14:16:31 +08:00
Fangjun Kuang
6b8877f185
Downgrade flutter sdk versions. ( #1305 )
2024-08-30 11:47:27 +08:00
Fangjun Kuang
0ccd3a4c3f
remove extra files from linux/macos/windows jni libs ( #1301 )
2024-08-29 10:45:38 +08:00
Fangjun Kuang
9064430c3e
Fix releasing wasm app for vad+asr ( #1300 )
2024-08-29 08:47:38 +08:00
Fangjun Kuang
6ec57327ce
add vad+sense voice example for C API ( #1291 )
2024-08-27 16:11:24 +08:00
Fangjun Kuang
5ed8e31868
Add VAD and keyword spotting for the Node package with WebAssembly ( #1286 )
2024-08-24 23:05:54 +08:00
Fangjun Kuang
537e163dd0
WebAssembly example for VAD + Non-streaming ASR ( #1284 )
2024-08-24 13:24:52 +08:00
Fangjun Kuang
1ef8a7a202
Add WebAssembly for VAD ( #1281 )
2024-08-23 17:08:37 +08:00
Fangjun Kuang
fb09f8fae3
Set batch size to 1 for more streaming ASR models ( #1280 )
2024-08-23 11:06:55 +08:00
Fangjun Kuang
0e0d04a97a
Provide models for mobile-only platforms by fixing batch size to 1 ( #1276 )
2024-08-22 19:36:24 +08:00
Fangjun Kuang
5a2aa110b8
Text to speech API for Object Pascal. ( #1273 )
2024-08-20 20:52:16 +08:00
Fangjun Kuang
f93f0ca94d
Use a separate thread to initialize models for lazarus examples. ( #1270 )
...
So that the main thread is not blocked and the user interface is responsive.
2024-08-18 14:59:48 +08:00
Fangjun Kuang
63713ecbf0
Build generating subtitles APPs for more models ( #1265 )
2024-08-16 20:11:24 +08:00
Fangjun Kuang
fbe35ba736
Add Lazarus example for generating subtitles using Silero VAD with non-streaming ASR ( #1251 )
2024-08-15 22:19:45 +08:00
Fangjun Kuang
ca729faebf
Support reading multi-channel wave files with 8/16/32-bit encoded samples ( #1258 )
2024-08-15 14:54:43 +08:00
Han Zhu
f300ec0f98
Add more C API examples ( #1255 )
...
C API examples for zipformer, paraformer, and TeleSpeech-ASR CTC models.
2024-08-14 10:52:47 +08:00
Fangjun Kuang
619279b162
Pascal API for VAD ( #1249 )
2024-08-13 16:16:51 +08:00
Fangjun Kuang
a7dc6c2c16
Pascal API for non-streaming ASR ( #1247 )
2024-08-12 23:33:35 +08:00
Fangjun Kuang
5791b695ea
Pascal API for streaming ASR ( #1246 )
2024-08-12 19:55:51 +08:00
Fangjun Kuang
65f1c0fab2
Add Pascal API for reading wave files ( #1243 )
2024-08-11 22:43:42 +08:00
Fangjun Kuang
9ee2943ed4
Add CI tests for online punctuation models ( #1226 )
2024-08-06 18:10:30 +08:00
Fangjun Kuang
561d04dd92
describe how to add new words for MeloTTS models ( #1209 )
2024-08-03 11:19:02 +08:00
Fangjun Kuang
35c1b4a7a9
Add ReazonSpeech Japanese pre-trained model ( #1203 )
2024-08-02 10:21:24 +08:00
Fangjun Kuang
ec98110e11
Add speaker identification and verification exmaple for Dart API ( #1194 )
2024-07-31 13:53:52 +08:00
Fangjun Kuang
06fd50f536
Add test about whisper large-v3 for .Net ( #1187 )
2024-07-29 20:49:38 +08:00
Fangjun Kuang
b1711ecaa1
Fix ffmpeg c api example ( #1185 )
2024-07-29 14:27:55 +08:00
Fangjun Kuang
646f99c870
Dart API for adding punctuations to text ( #1182 )
2024-07-29 12:41:52 +08:00
Fangjun Kuang
cd1fedaa49
Add Dart API for audio tagging ( #1181 )
2024-07-29 11:15:14 +08:00
Fangjun Kuang
69b6b47d91
Add vad with non-streaming ASR examples for Dart API ( #1180 )
2024-07-28 23:01:03 +08:00
Fangjun Kuang
d279c8d20e
Add more Python examples for SenseVoice ( #1179 )
2024-07-28 21:54:38 +08:00
Fangjun Kuang
ea1d81bdfe
C api example for sense voice ( #1165 )
2024-07-22 16:54:00 +08:00
Fangjun Kuang
dd300b1de5
Add Java and Kotlin API for sense voice ( #1164 )
2024-07-22 14:08:40 +08:00
Fangjun Kuang
ac8223bd8a
Add Dart API for keyword spotter ( #1162 )
2024-07-22 10:53:34 +08:00
Fangjun Kuang
70d14353bb
Add WebAssembly for SenseVoice ( #1158 )
2024-07-21 15:39:55 +08:00
Fangjun Kuang
8f4d332aab
Add Go API for SenseVoice ( #1154 )
2024-07-20 23:41:53 +08:00
Fangjun Kuang
25f0a10468
Add C++ runtime for SenseVoice models ( #1148 )
2024-07-18 22:54:18 +08:00
Fangjun Kuang
346f419f39
export sense-voice to onnx ( #1144 )
2024-07-18 00:18:38 +08:00
Fangjun Kuang
4198d9a166
Provide pre-built wheels with CUDA support. ( #1143 )
2024-07-17 22:59:13 +08:00
Fangjun Kuang
803c02db0a
publish all pre-built wheels to huggingface ( #1142 )
...
pypi.org provides only 10GB of free space for open-source projects.
Each new release of sherpa-onnx occupies about 800MB, so we have to delete previous releases otherwise pypi.org refuses to accept new releases due to limited spaces.
To let users install previous versions, we also publish wheels to huggingface and users can find them at
https://k2-fsa.github.io/sherpa/onnx/cpu.html
and
https://k2-fsa.github.io/sherpa/onnx/cpu-cn.html (for users without access to huggingface.co)
2024-07-17 14:41:27 +08:00
Fangjun Kuang
9e448d03bc
Provide npm package for 32-bit Windows x86 ( #1141 )
2024-07-17 12:33:15 +08:00
Fangjun Kuang
960eb7529e
Add C++ runtime for MeloTTS ( #1138 )
2024-07-16 15:55:02 +08:00
Fangjun Kuang
95485411fa
Support English for MeloTTS models. ( #1134 )
2024-07-15 19:49:22 +08:00
Fangjun Kuang
c35200dccf
Revert to onnxruntime 1.17.1 ( #1131 )
2024-07-15 14:24:08 +08:00
Fangjun Kuang
04c2319c2c
Export MeloTTS to ONNX ( #1129 )
2024-07-15 10:47:19 +08:00
Fangjun Kuang
ab71c3976d
Add int8 quantized whisper large models ( #1126 )
2024-07-13 22:30:06 +08:00
Fangjun Kuang
3951a12f8d
Add pre-trained models for the Libriheavy dataset ( #1122 )
2024-07-13 19:21:13 +08:00