Fangjun Kuang
b3e05f6dc4
Fix style issues ( #1458 )
2024-10-24 11:15:08 +08:00
Fangjun Kuang
ceb69ebd94
Add C++ API for non-streaming ASR ( #1456 )
2024-10-23 16:40:12 +08:00
Fangjun Kuang
effd5ef2be
Add C++ API for streaming ASR. ( #1455 )
...
It is a wrapper around the C API.
2024-10-23 12:07:43 +08:00
JameWade
3edd8d7cf6
add java android demo ( #1454 )
2024-10-23 11:38:26 +08:00
YeyuchenBa
bcaa91ed36
update java for hotword jar ( #1444 )
...
---------
Co-authored-by: root <1552138571@qq.com >
2024-10-18 18:07:51 +08:00
Fangjun Kuang
1af8ad89e6
Add Java API example for hotwords. ( #1442 )
2024-10-18 16:35:31 +08:00
Fangjun Kuang
e0586f1876
add more models for speaker diarization ( #1440 )
2024-10-17 20:03:09 +08:00
Zazzle516
4783c8f590
fix "log10" compile error by import CMATH lib ( #1438 )
2024-10-17 14:50:04 +08:00
Fangjun Kuang
620597f501
Support https://huggingface.co/Revai/reverb-diarization-v1 ( #1437 )
2024-10-17 11:58:14 +08:00
lxiao336
471cbd83c6
updated onnxruntime-linux-aarch64.cmake so that libonnxruntime.so can be found for specific aarch64 cross-compilation environments ( #1436 )
...
Co-authored-by: xiao <shawl336@163.com >
2024-10-16 22:42:42 +08:00
Fangjun Kuang
593b96758b
Add Go API for offline punctuation models ( #1434 )
...
It is contributed by a community user
from [our QQ group](https://k2-fsa.github.io/sherpa/social-groups.html#qq ).
2024-10-16 17:16:47 +08:00
semxum
77dd5f73fc
Update README.md ( #1431 )
2024-10-14 18:25:27 +08:00
Fangjun Kuang
df4150dc5d
Upload speaker embedding models to huggingface ( #1428 )
...
See also
https://huggingface.co/spaces/k2-fsa/speaker-diarization
2024-10-14 16:20:00 +08:00
Fangjun Kuang
99f320b893
Release v1.10.28 ( #1424 )
2024-10-13 15:27:38 +08:00
Fangjun Kuang
5a22f74b2b
Android demo for speaker diarization ( #1423 )
2024-10-13 14:02:57 +08:00
Fangjun Kuang
94b26ff07c
Android JNI support for speaker diarization ( #1421 )
2024-10-12 13:03:48 +08:00
Fangjun Kuang
5e273c5be4
Pascal API for speaker diarization ( #1420 )
2024-10-12 12:28:38 +08:00
Fangjun Kuang
1ed803adc1
Dart API for speaker diarization ( #1418 )
2024-10-11 21:17:41 +08:00
Fangjun Kuang
1851ff6337
Java API for speaker diarization ( #1416 )
2024-10-11 16:51:40 +08:00
Fangjun Kuang
2d412b1190
Kotlin API for speaker diarization ( #1415 )
2024-10-11 14:41:53 +08:00
Fangjun Kuang
eefc172095
JavaScript API with WebAssembly for speaker diarization ( #1414 )
...
#1408 uses [node-addon-api](https://github.com/nodejs/node-addon-api ) to call C API from JavaScript, whereas this pull request uses WebAssembly to call C API from JavaScript.
2024-10-11 11:40:10 +08:00
Fangjun Kuang
f1b311ee4f
Handle audio files less than 10s long for speaker diarization. ( #1412 )
...
If the input audio file is less than 10 seconds long, there is only
one chunk, and there is no need to compute embeddings or
do clustering.
We can use the segmentation result from the speaker segmentation
model directly.
2024-10-11 10:27:16 +08:00
Fangjun Kuang
1d061df355
WebAssembly exmaple for speaker diarization ( #1411 )
2024-10-10 22:14:45 +08:00
Fangjun Kuang
67349b52f2
JavaScript API (node-addon) for speaker diarization ( #1408 )
2024-10-10 15:51:31 +08:00
Fangjun Kuang
a45e5dba99
C# API for speaker diarization ( #1407 )
2024-10-10 14:29:05 +08:00
Fangjun Kuang
bd50e79590
Update readme to include more external projects using sherpa-onnx ( #1405 )
2024-10-10 10:27:14 +08:00
Fangjun Kuang
1571344509
Swift API for speaker diarization ( #1404 )
2024-10-09 23:25:39 +08:00
Fangjun Kuang
df681e9807
Go API for speaker diarization ( #1403 )
2024-10-09 20:10:44 +08:00
Yongzeng Liu
97654122fa
docs(nodejs-addon-examples): add guide for pnpm user ( #1401 )
2024-10-09 18:12:41 +08:00
Fangjun Kuang
d468527f62
C API for speaker diarization ( #1402 )
2024-10-09 17:10:03 +08:00
Fangjun Kuang
8535b1d3bb
Python API for speaker diarization. ( #1400 )
2024-10-09 14:13:26 +08:00
Fangjun Kuang
59407edcad
C++ API for speaker diarization ( #1396 )
2024-10-09 12:01:20 +08:00
Fangjun Kuang
70165cb42d
Speaker diarization example with onnxruntime Python API ( #1395 )
2024-10-06 16:37:29 +08:00
Askars
5f50cbf65a
context_state is not set correctly when previous context is passed after reset ( #1393 )
...
Co-authored-by: vsd-vector <askars.salimbajevs@tilde.lv >
2024-10-03 16:42:09 +08:00
Fangjun Kuang
66feecb2b5
support whisper turbo ( #1390 )
2024-10-02 18:13:34 +08:00
Fangjun Kuang
b965f14cf0
Add Python API for clustering ( #1385 )
2024-09-30 11:33:15 +08:00
Fangjun Kuang
70568c2df7
Support Agglomerative clustering. ( #1384 )
...
We use the open-source implementation from
https://github.com/cdalitz/hclust-cpp
2024-09-29 23:44:29 +08:00
Fangjun Kuang
bc08160820
Export Pyannote speaker segmentation models to onnx ( #1382 )
2024-09-29 14:23:56 +08:00
Fangjun Kuang
11f0cb7e1c
Support Parakeet models from NeMo ( #1381 )
2024-09-27 17:12:00 +08:00
Fangjun Kuang
12d04ce8ed
Fix running MeloTTS models on GPU. ( #1379 )
...
We need to use opset 18 to export the model to onnx.
2024-09-26 16:51:43 +08:00
Flutter Painter
69c8e7b9d0
Update online_model.dart ( #1375 )
...
fix typo on french joiner file
2024-09-25 06:40:22 +08:00
Fangjun Kuang
d8809b520e
Fix CI errors introduced by supporting loading keywords from buffers ( #1366 )
2024-09-20 19:04:21 +08:00
lxiao336
06b61ccad8
Allow more online models to load tokens file from the memory ( #1352 )
...
Co-authored-by: xiao <shawl336@6163.com >
2024-09-20 16:38:41 +08:00
Fangjun Kuang
73c90ec871
Fix swift example for generating subtitles. ( #1362 )
...
We need to invoke vad.flush() at the end.
2024-09-20 11:44:25 +08:00
Fangjun Kuang
647b63ea44
Release v1.10.27 ( #1359 )
2024-09-19 10:49:29 +08:00
Fangjun Kuang
576a3aa90d
Add non-streaming ONNX models for Russian ASR ( #1358 )
2024-09-18 13:43:49 +08:00
Fangjun Kuang
7e642325f3
Fix building Flutter TTS examples for Linux ( #1356 )
2024-09-18 12:04:02 +08:00
Fangjun Kuang
cddac52780
Support passing utf-8 strings from JavaScript to C++. ( #1355 )
...
We first convert utf-16 strings to Uint8Array and then we
pass the array to C++.
2024-09-18 11:03:42 +08:00
lllwan
bf06b268d0
Fix sherpa_onnx.go ( #1353 )
2024-09-17 13:39:56 +08:00
Fangjun Kuang
9dade25d3e
Release v1.10.26 ( #1350 )
2024-09-14 14:37:42 +08:00