Commit Graph

604 Commits

Author SHA1 Message Date
Fangjun Kuang
648903834b Add CXX API for MatchaTTS models (#1676) 2025-01-03 14:16:36 +08:00
Fangjun Kuang
9aa4897a9e Add C API for MatchaTTS models (#1675) 2025-01-03 12:17:26 +08:00
Fangjun Kuang
a00d3b4821 Add Java API for Matcha-TTS models. (#1673) 2025-01-02 15:15:30 +08:00
Fangjun Kuang
f457baea42 Support Matcha-TTS models using espeak-ng (#1672) 2025-01-02 13:46:43 +08:00
Fangjun Kuang
3422b9388d Add Kotlin API for Matcha-TTS models. (#1668) 2024-12-31 19:20:52 +08:00
Fangjun Kuang
ebe92e523d Remove spaces after punctuations for TTS (#1666) 2024-12-31 16:06:27 +08:00
Fangjun Kuang
d3538531c4 Fix initialize TTS in Python. (#1664) 2024-12-31 15:14:56 +08:00
Fangjun Kuang
2c2926af7d Add C++ runtime for Matcha-TTS (#1627) 2024-12-31 12:44:14 +08:00
yujinqiu
5c2cc48f50 Add swift online punctuation (#1661) 2024-12-31 11:26:32 +08:00
Fangjun Kuang
b6f0f5fc2e Support removing invalid utf-8 sequences. (#1648) 2024-12-25 19:32:13 +08:00
Fangjun Kuang
08d771337b Add a byte-level BPE Chinese+English non-streaming zipformer model (#1645) 2024-12-24 16:56:49 +08:00
Fangjun Kuang
d00d1c6298 Fix GitHub actions. (#1642) 2024-12-24 11:34:35 +08:00
Fangjun Kuang
b76cd9033a Support decoding with byte-level BPE (bbpe) models. (#1633) 2024-12-20 19:21:32 +08:00
Fangjun Kuang
e639c70d78 Support linking onnxruntime statically for Android (#1619) 2024-12-14 09:53:44 +08:00
Fangjun Kuang
be87f866f3 Use aar in Android Java demo. (#1616) 2024-12-12 18:26:54 +08:00
Fangjun Kuang
4dc4f1a708 Provide sherpa-onnx.aar for Android (#1615) 2024-12-12 16:59:00 +08:00
Fangjun Kuang
914cbad6a3 Add speaker diarization demo for HarmonyOS (#1610) 2024-12-10 20:11:44 +08:00
Fangjun Kuang
1bae4085ca Add speaker diarization API for HarmonyOS. (#1609) 2024-12-10 16:03:03 +08:00
Fangjun Kuang
314545f938 Add speaker identification APIs for HarmonyOS (#1607)
* Add speaker embedding extractor API for HarmonyOS

* Add ArkTS API for speaker identification
2024-12-09 19:23:18 +08:00
Fangjun Kuang
a743a4400f Add on-device real-time ASR demo for HarmonyOS (#1606) 2024-12-09 16:40:15 +08:00
Fangjun Kuang
74a8735f7a Add on-device tex-to-speech (TTS) demo for HarmonyOS (#1590) 2024-12-04 14:27:12 +08:00
Fangjun Kuang
dc3287f3a8 Add HarmonyOS support for text-to-speech. (#1584) 2024-12-01 21:43:34 +08:00
Fangjun Kuang
315d8e2a47 Publish sherpa_onnx.har for HarmonyOS (#1572) 2024-11-28 17:30:16 +08:00
Fangjun Kuang
109fb799ca fix building for Android (#1568) 2024-11-27 10:36:16 +08:00
Fangjun Kuang
2101227269 Add streaming ASR support for HarmonyOS. (#1565) 2024-11-26 18:36:56 +08:00
Fangjun Kuang
298b6b6fda Add non-streaming ASR support for HarmonyOS. (#1564) 2024-11-26 16:38:35 +08:00
Fangjun Kuang
31d6206fde HarmonyOS support for VAD. (#1561) 2024-11-24 16:29:24 +08:00
Fangjun Kuang
f97daed408 Fixes #1512 (#1522) 2024-11-08 21:07:36 +08:00
Fangjun Kuang
4eeb336f59 Export the English TTS model from MeloTTS (#1509) 2024-11-04 07:54:19 +08:00
Fangjun Kuang
6ee8c99c5d Fix building (#1508) 2024-11-03 19:47:04 +08:00
Fangjun Kuang
9ab89c33bc Support building GPU-capable sherpa-onnx on Linux aarch64. (#1500)
Thanks to @Peakyxh for providing pre-built onnxruntime libraries 
with CUDA support for Linux aarch64.

Tested on Jetson nano b01
2024-11-01 11:16:28 +08:00
Fangjun Kuang
a3c89aa0d8 Add two-pass ASR Android APKs for Moonshine models. (#1499) 2024-10-31 17:54:16 +08:00
Fangjun Kuang
9fa3bc40d7 Fix reading tokens.txt on Windows. (#1497) 2024-10-30 12:13:11 +08:00
Fangjun Kuang
356da3b54c Publish pre-built macos xcframework (#1490) 2024-10-29 12:26:26 +08:00
Fangjun Kuang
cdd8e1bbcb Add Pascal API for Moonshine models (#1482) 2024-10-27 12:21:16 +08:00
Fangjun Kuang
2ca2985d04 Add C and C++ API for Moonshine models (#1476) 2024-10-26 23:24:46 +08:00
Fangjun Kuang
bd4b223920 Add Kotlin and Java API for Moonshine models (#1474) 2024-10-26 22:30:29 +08:00
Fangjun Kuang
669f5ef441 Add C++ runtime and Python APIs for Moonshine models (#1473) 2024-10-26 14:34:07 +08:00
Fangjun Kuang
707cf792c5 Add GigaAM NeMo transducer model for Russian ASR (#1467) 2024-10-25 15:20:13 +08:00
Fangjun Kuang
b41f6d2c94 Support GigaAM CTC models for Russian ASR (#1464)
See also https://github.com/salute-developers/GigaAM
2024-10-25 10:55:16 +08:00
Fangjun Kuang
a5295aad10 Handle NaN embeddings in speaker diarization. (#1461)
See also https://github.com/thewh1teagle/sherpa-rs/issues/33
2024-10-24 14:03:09 +08:00
Fangjun Kuang
b3e05f6dc4 Fix style issues (#1458) 2024-10-24 11:15:08 +08:00
Fangjun Kuang
ceb69ebd94 Add C++ API for non-streaming ASR (#1456) 2024-10-23 16:40:12 +08:00
Fangjun Kuang
effd5ef2be Add C++ API for streaming ASR. (#1455)
It is a wrapper around the C API.
2024-10-23 12:07:43 +08:00
YeyuchenBa
bcaa91ed36 update java for hotword jar (#1444)
---------

Co-authored-by: root <1552138571@qq.com>
2024-10-18 18:07:51 +08:00
Zazzle516
4783c8f590 fix "log10" compile error by import CMATH lib (#1438) 2024-10-17 14:50:04 +08:00
Fangjun Kuang
5a22f74b2b Android demo for speaker diarization (#1423) 2024-10-13 14:02:57 +08:00
Fangjun Kuang
94b26ff07c Android JNI support for speaker diarization (#1421) 2024-10-12 13:03:48 +08:00
Fangjun Kuang
5e273c5be4 Pascal API for speaker diarization (#1420) 2024-10-12 12:28:38 +08:00
Fangjun Kuang
1ed803adc1 Dart API for speaker diarization (#1418) 2024-10-11 21:17:41 +08:00