Fangjun Kuang
|
70165cb42d
|
Speaker diarization example with onnxruntime Python API (#1395)
|
2024-10-06 16:37:29 +08:00 |
|
Fangjun Kuang
|
66feecb2b5
|
support whisper turbo (#1390)
|
2024-10-02 18:13:34 +08:00 |
|
Fangjun Kuang
|
b965f14cf0
|
Add Python API for clustering (#1385)
|
2024-09-30 11:33:15 +08:00 |
|
Fangjun Kuang
|
bc08160820
|
Export Pyannote speaker segmentation models to onnx (#1382)
|
2024-09-29 14:23:56 +08:00 |
|
Fangjun Kuang
|
11f0cb7e1c
|
Support Parakeet models from NeMo (#1381)
|
2024-09-27 17:12:00 +08:00 |
|
Fangjun Kuang
|
12d04ce8ed
|
Fix running MeloTTS models on GPU. (#1379)
We need to use opset 18 to export the model to onnx.
|
2024-09-26 16:51:43 +08:00 |
|
Fangjun Kuang
|
d8809b520e
|
Fix CI errors introduced by supporting loading keywords from buffers (#1366)
|
2024-09-20 19:04:21 +08:00 |
|
Fangjun Kuang
|
576a3aa90d
|
Add non-streaming ONNX models for Russian ASR (#1358)
|
2024-09-18 13:43:49 +08:00 |
|
Fangjun Kuang
|
cddac52780
|
Support passing utf-8 strings from JavaScript to C++. (#1355)
We first convert utf-16 strings to Uint8Array and then we
pass the array to C++.
|
2024-09-18 11:03:42 +08:00 |
|
lllwan
|
bf06b268d0
|
Fix sherpa_onnx.go (#1353)
|
2024-09-17 13:39:56 +08:00 |
|
Fangjun Kuang
|
e7ffcbd677
|
Add APIs about max speech duration in VAD for various programming languages (#1349)
|
2024-09-14 12:30:13 +08:00 |
|
Fangjun Kuang
|
544857b097
|
Fix building (#1343)
|
2024-09-13 13:33:52 +08:00 |
|
Fangjun Kuang
|
6b8877f185
|
Downgrade flutter sdk versions. (#1305)
|
2024-08-30 11:47:27 +08:00 |
|
Fangjun Kuang
|
c38634dfcf
|
two-pass Android APK for SenseVoice (#1302)
|
2024-08-29 12:08:49 +08:00 |
|
Fangjun Kuang
|
9064430c3e
|
Fix releasing wasm app for vad+asr (#1300)
|
2024-08-29 08:47:38 +08:00 |
|
Emmanuel Schmidbauer
|
a8556e31ba
|
add Tokens []string, Timestamps []float32, Lang string, Emotion string, Event string (#1277)
|
2024-08-27 06:35:59 +08:00 |
|
Fangjun Kuang
|
17c8237ee4
|
Fix releasing npm package and fix building Android VAD+ASR example (#1288)
|
2024-08-26 10:18:48 +08:00 |
|
Fangjun Kuang
|
5ed8e31868
|
Add VAD and keyword spotting for the Node package with WebAssembly (#1286)
|
2024-08-24 23:05:54 +08:00 |
|
Fangjun Kuang
|
537e163dd0
|
WebAssembly example for VAD + Non-streaming ASR (#1284)
|
2024-08-24 13:24:52 +08:00 |
|
Fangjun Kuang
|
1ef8a7a202
|
Add WebAssembly for VAD (#1281)
|
2024-08-23 17:08:37 +08:00 |
|
Fangjun Kuang
|
fb09f8fae3
|
Set batch size to 1 for more streaming ASR models (#1280)
|
2024-08-23 11:06:55 +08:00 |
|
Fangjun Kuang
|
0e0d04a97a
|
Provide models for mobile-only platforms by fixing batch size to 1 (#1276)
|
2024-08-22 19:36:24 +08:00 |
|
Fangjun Kuang
|
5a2aa110b8
|
Text to speech API for Object Pascal. (#1273)
|
2024-08-20 20:52:16 +08:00 |
|
Fangjun Kuang
|
63713ecbf0
|
Build generating subtitles APPs for more models (#1265)
|
2024-08-16 20:11:24 +08:00 |
|
Fangjun Kuang
|
fbe35ba736
|
Add Lazarus example for generating subtitles using Silero VAD with non-streaming ASR (#1251)
|
2024-08-15 22:19:45 +08:00 |
|
Fangjun Kuang
|
94e256244d
|
Add blank penalty for various language bindings. (#1234)
|
2024-08-08 10:43:31 +08:00 |
|
Parth Khiera
|
ba4cb6169f
|
feat: addition of blank_penalty config in online_recognizer (#1232)
|
2024-08-08 09:10:17 +08:00 |
|
Fangjun Kuang
|
561d04dd92
|
describe how to add new words for MeloTTS models (#1209)
|
2024-08-03 11:19:02 +08:00 |
|
Fangjun Kuang
|
35c1b4a7a9
|
Add ReazonSpeech Japanese pre-trained model (#1203)
|
2024-08-02 10:21:24 +08:00 |
|
Fangjun Kuang
|
ec98110e11
|
Add speaker identification and verification exmaple for Dart API (#1194)
|
2024-07-31 13:53:52 +08:00 |
|
Fangjun Kuang
|
646f99c870
|
Dart API for adding punctuations to text (#1182)
|
2024-07-29 12:41:52 +08:00 |
|
Fangjun Kuang
|
cd1fedaa49
|
Add Dart API for audio tagging (#1181)
|
2024-07-29 11:15:14 +08:00 |
|
Fangjun Kuang
|
69b6b47d91
|
Add vad with non-streaming ASR examples for Dart API (#1180)
|
2024-07-28 23:01:03 +08:00 |
|
Fangjun Kuang
|
4e6aeff07e
|
Refactor C API to prefix each API with SherpaOnnx. (#1171)
|
2024-07-26 18:47:02 +08:00 |
|
Fangjun Kuang
|
994c3e7c96
|
Add VAD + Non-streaming ASR example for JavaScript API. (#1170)
|
2024-07-26 12:42:08 +08:00 |
|
Fangjun Kuang
|
dd300b1de5
|
Add Java and Kotlin API for sense voice (#1164)
|
2024-07-22 14:08:40 +08:00 |
|
Fangjun Kuang
|
ac8223bd8a
|
Add Dart API for keyword spotter (#1162)
|
2024-07-22 10:53:34 +08:00 |
|
Fangjun Kuang
|
ffdb23a8ec
|
Add dart API for SenseVoice (#1159)
|
2024-07-21 21:48:12 +08:00 |
|
Fangjun Kuang
|
c3260ef842
|
Add JavaScript API for SenseVoice (#1157)
|
2024-07-21 10:14:14 +08:00 |
|
Fangjun Kuang
|
8f4d332aab
|
Add Go API for SenseVoice (#1154)
|
2024-07-20 23:41:53 +08:00 |
|
Fangjun Kuang
|
e472180f2c
|
Add C# API for SenseVoice models (#1151)
|
2024-07-20 17:09:23 +08:00 |
|
Fangjun Kuang
|
25f0a10468
|
Add C++ runtime for SenseVoice models (#1148)
|
2024-07-18 22:54:18 +08:00 |
|
Fangjun Kuang
|
3bae5c3fe5
|
test exported sense voice models (#1147)
|
2024-07-18 12:12:44 +08:00 |
|
Fangjun Kuang
|
346f419f39
|
export sense-voice to onnx (#1144)
|
2024-07-18 00:18:38 +08:00 |
|
Fangjun Kuang
|
9e448d03bc
|
Provide npm package for 32-bit Windows x86 (#1141)
|
2024-07-17 12:33:15 +08:00 |
|
Fangjun Kuang
|
960eb7529e
|
Add C++ runtime for MeloTTS (#1138)
|
2024-07-16 15:55:02 +08:00 |
|
Fangjun Kuang
|
95485411fa
|
Support English for MeloTTS models. (#1134)
|
2024-07-15 19:49:22 +08:00 |
|
Fangjun Kuang
|
fa07bbc176
|
Add APK for small paraformer (#1133)
|
2024-07-15 19:44:36 +08:00 |
|
Fangjun Kuang
|
c35200dccf
|
Revert to onnxruntime 1.17.1 (#1131)
|
2024-07-15 14:24:08 +08:00 |
|
Fangjun Kuang
|
04c2319c2c
|
Export MeloTTS to ONNX (#1129)
|
2024-07-15 10:47:19 +08:00 |
|