Commit Graph

895 Commits

Author SHA1 Message Date
Fangjun Kuang
f97daed408 Fixes #1512 (#1522) 2024-11-08 21:07:36 +08:00
VEP
f94cca71cf Fix: Reset sample-buffer after processing (#1521)
Co-authored-by: VEP <517138883@qq.com>
2024-11-08 19:04:34 +08:00
張小凡
86b1856c20 Reduce vad-sense-voice example code. (#1510) 2024-11-05 20:34:12 +08:00
Fangjun Kuang
4eeb336f59 Export the English TTS model from MeloTTS (#1509) 2024-11-04 07:54:19 +08:00
Fangjun Kuang
6ee8c99c5d Fix building (#1508) 2024-11-03 19:47:04 +08:00
Fangjun Kuang
f0cced1f37 Publish pre-built wheels with CUDA support for Linux aarch64. (#1507) 2024-11-03 19:15:11 +08:00
Fangjun Kuang
c5205f08bf Add an example for computing RTF about streaming ASR. (#1501) 2024-11-01 11:40:13 +08:00
Fangjun Kuang
9ab89c33bc Support building GPU-capable sherpa-onnx on Linux aarch64. (#1500)
Thanks to @Peakyxh for providing pre-built onnxruntime libraries 
with CUDA support for Linux aarch64.

Tested on Jetson nano b01
2024-11-01 11:16:28 +08:00
Fangjun Kuang
a3c89aa0d8 Add two-pass ASR Android APKs for Moonshine models. (#1499) 2024-10-31 17:54:16 +08:00
Fangjun Kuang
9fa3bc40d7 Fix reading tokens.txt on Windows. (#1497) 2024-10-30 12:13:11 +08:00
Fangjun Kuang
d9f65c9840 Update pybind11 to support numpy 2.0 (#1493) 2024-10-29 17:00:39 +08:00
Fangjun Kuang
d9c586ccf2 Removed unused TTS example code in .Net examples (#1492) 2024-10-29 14:59:12 +08:00
Fangjun Kuang
356da3b54c Publish pre-built macos xcframework (#1490) 2024-10-29 12:26:26 +08:00
彭震东
72dc68c8fa fix typo (#1488) 2024-10-28 21:30:18 +08:00
Fangjun Kuang
36a0e78884 Add some commonly used models to README.md (#1486) 2024-10-28 12:49:47 +08:00
Fangjun Kuang
9eb493f6bf Publish pre-built wheels for Python 3.13 (#1485) 2024-10-28 10:37:50 +08:00
Fangjun Kuang
91e090ff86 Release v1.10.30 (#1484) 2024-10-27 13:45:13 +08:00
Fangjun Kuang
3622104133 Add C# API for Moonshine models. (#1483)
* Also, return timestamps for non-streaming ASR.
2024-10-27 13:14:25 +08:00
Fangjun Kuang
cdd8e1bbcb Add Pascal API for Moonshine models (#1482) 2024-10-27 12:21:16 +08:00
Fangjun Kuang
54468a7370 Add Dart API for Moonshine models. (#1481) 2024-10-27 12:04:12 +08:00
Fangjun Kuang
6f261d39f3 Add JavaScript API for Moonshine models (#1480) 2024-10-27 11:31:01 +08:00
Fangjun Kuang
3d3edabb5f Add Go API for Moonshine models (#1479) 2024-10-27 09:39:09 +08:00
Fangjun Kuang
052b8645ba Add Go API examples for adding punctuations to text. (#1478) 2024-10-27 09:04:05 +08:00
Fangjun Kuang
4a4659aa4f Add Swift API for Moonshine models. (#1477) 2024-10-27 08:19:01 +08:00
Fangjun Kuang
2ca2985d04 Add C and C++ API for Moonshine models (#1476) 2024-10-26 23:24:46 +08:00
Fangjun Kuang
bd4b223920 Add Kotlin and Java API for Moonshine models (#1474) 2024-10-26 22:30:29 +08:00
Fangjun Kuang
669f5ef441 Add C++ runtime and Python APIs for Moonshine models (#1473) 2024-10-26 14:34:07 +08:00
Fangjun Kuang
0f2732e4e8 Publish pre-built JNI libs for Linux aarch64 (#1472) 2024-10-26 09:59:18 +08:00
Fangjun Kuang
b06b460851 Begin to support https://github.com/usefulsensors/moonshine (#1470) 2024-10-26 09:51:16 +08:00
Fangjun Kuang
3d6344ead3 Fix building node-addon for Windows x86. (#1469) 2024-10-25 18:49:33 +08:00
Fangjun Kuang
d5a2f52413 Release v1.10.29 (#1468) 2024-10-25 15:50:42 +08:00
Fangjun Kuang
707cf792c5 Add GigaAM NeMo transducer model for Russian ASR (#1467) 2024-10-25 15:20:13 +08:00
Fangjun Kuang
b41f6d2c94 Support GigaAM CTC models for Russian ASR (#1464)
See also https://github.com/salute-developers/GigaAM
2024-10-25 10:55:16 +08:00
Peakyxh
2b40079faf Add speaker identification with VAD and non-streaming ASR using ALSA (#1463) 2024-10-24 22:04:51 +08:00
Fangjun Kuang
a5295aad10 Handle NaN embeddings in speaker diarization. (#1461)
See also https://github.com/thewh1teagle/sherpa-rs/issues/33
2024-10-24 14:03:09 +08:00
Fangjun Kuang
b3e05f6dc4 Fix style issues (#1458) 2024-10-24 11:15:08 +08:00
Fangjun Kuang
ceb69ebd94 Add C++ API for non-streaming ASR (#1456) 2024-10-23 16:40:12 +08:00
Fangjun Kuang
effd5ef2be Add C++ API for streaming ASR. (#1455)
It is a wrapper around the C API.
2024-10-23 12:07:43 +08:00
JameWade
3edd8d7cf6 add java android demo (#1454) 2024-10-23 11:38:26 +08:00
YeyuchenBa
bcaa91ed36 update java for hotword jar (#1444)
---------

Co-authored-by: root <1552138571@qq.com>
2024-10-18 18:07:51 +08:00
Fangjun Kuang
1af8ad89e6 Add Java API example for hotwords. (#1442) 2024-10-18 16:35:31 +08:00
Fangjun Kuang
e0586f1876 add more models for speaker diarization (#1440) 2024-10-17 20:03:09 +08:00
Zazzle516
4783c8f590 fix "log10" compile error by import CMATH lib (#1438) 2024-10-17 14:50:04 +08:00
Fangjun Kuang
620597f501 Support https://huggingface.co/Revai/reverb-diarization-v1 (#1437) 2024-10-17 11:58:14 +08:00
lxiao336
471cbd83c6 updated onnxruntime-linux-aarch64.cmake so that libonnxruntime.so can be found for specific aarch64 cross-compilation environments (#1436)
Co-authored-by: xiao <shawl336@163.com>
2024-10-16 22:42:42 +08:00
Fangjun Kuang
593b96758b Add Go API for offline punctuation models (#1434)
It is contributed by a community user 
from [our QQ group](https://k2-fsa.github.io/sherpa/social-groups.html#qq).
2024-10-16 17:16:47 +08:00
semxum
77dd5f73fc Update README.md (#1431) 2024-10-14 18:25:27 +08:00
Fangjun Kuang
df4150dc5d Upload speaker embedding models to huggingface (#1428)
See also
https://huggingface.co/spaces/k2-fsa/speaker-diarization
2024-10-14 16:20:00 +08:00
Fangjun Kuang
99f320b893 Release v1.10.28 (#1424) 2024-10-13 15:27:38 +08:00
Fangjun Kuang
5a22f74b2b Android demo for speaker diarization (#1423) 2024-10-13 14:02:57 +08:00