Fangjun Kuang
316424b382
Add C++ and Python API for FireRedASR AED models ( #1867 )
2025-02-16 22:45:24 +08:00
Fangjun Kuang
944400e399
Fix spliting text by languages for kokoro tts. ( #1849 )
2025-02-13 18:19:34 +08:00
ahadjawaid
73d7c25233
Fix: made print sherpa_onnx_loge when it is in debug mode ( #1838 )
...
Currently, during normal use you may get a lot of print statements such as: `Use espeak-ng to handle the OOV: 'ipsum'` which may not be relevant unless you are debugging.
2025-02-11 00:22:50 +08:00
Fangjun Kuang
ad883d44fe
Support specifying voice in espeak-ng for kokoro tts models. ( #1836 )
2025-02-10 19:05:53 +08:00
Fangjun Kuang
d5da9430e8
Add PengChengStarling models to sherpa-onnx ( #1835 )
2025-02-10 18:23:40 +08:00
Fangjun Kuang
9559a10bd3
Add C++ support for MatchaTTS models not from icefall. ( #1834 )
2025-02-10 15:38:29 +08:00
Fangjun Kuang
69f489f0cd
Support scaling the duration of a pause in TTS. ( #1820 )
2025-02-08 12:47:26 +08:00
Fangjun Kuang
d38cb81014
Fix passing gb2312 encoded strings to tts on Windows ( #1819 )
2025-02-08 09:48:58 +08:00
Fangjun Kuang
7330f7519a
Add C API for Kokoro TTS 1.0 ( #1801 )
2025-02-07 14:30:40 +08:00
Fangjun Kuang
c84a833863
Add C++ and Python API for Kokoro 1.0 multilingual TTS model ( #1795 )
2025-02-06 22:57:13 +08:00
ahadjawaid
8677d83efc
Fix: Prepend 0 to tokenization to prevent word skipping for Kokoro. ( #1787 )
...
Addressed issue Skipping words #1777
2025-02-03 13:49:42 +08:00
Fangjun Kuang
f178e96bf0
Add keyword spotter C API for HarmonyOS ( #1769 )
2025-01-26 14:12:30 +08:00
Fangjun Kuang
8b989a851c
Fix keyword spotting. ( #1689 )
...
Reset the stream right after detecting a keyword
2025-01-20 16:41:10 +08:00
Fangjun Kuang
2d0869c709
Fix style issues ( #1718 )
2025-01-16 15:43:51 +08:00
Fangjun Kuang
ffc6b480a0
Add C++ and Python API for Kokoro TTS models. ( #1715 )
2025-01-16 14:24:51 +08:00
Fangjun Kuang
cbe07ac1b6
Release v1.10.39 ( #1702 )
2025-01-13 10:28:05 +08:00
Fangjun Kuang
1fe5fe495f
Add Android demo for MatchaTTS models. ( #1683 )
2025-01-06 06:44:09 +08:00
Fangjun Kuang
bf3330c906
Add HarmonyOS examples for MatchaTTS. ( #1678 )
2025-01-03 17:09:29 +08:00
Fangjun Kuang
9aa4897a9e
Add C API for MatchaTTS models ( #1675 )
2025-01-03 12:17:26 +08:00
Fangjun Kuang
a00d3b4821
Add Java API for Matcha-TTS models. ( #1673 )
2025-01-02 15:15:30 +08:00
Fangjun Kuang
f457baea42
Support Matcha-TTS models using espeak-ng ( #1672 )
2025-01-02 13:46:43 +08:00
Fangjun Kuang
3422b9388d
Add Kotlin API for Matcha-TTS models. ( #1668 )
2024-12-31 19:20:52 +08:00
Fangjun Kuang
ebe92e523d
Remove spaces after punctuations for TTS ( #1666 )
2024-12-31 16:06:27 +08:00
Fangjun Kuang
2c2926af7d
Add C++ runtime for Matcha-TTS ( #1627 )
2024-12-31 12:44:14 +08:00
Fangjun Kuang
b6f0f5fc2e
Support removing invalid utf-8 sequences. ( #1648 )
2024-12-25 19:32:13 +08:00
Fangjun Kuang
d00d1c6298
Fix GitHub actions. ( #1642 )
2024-12-24 11:34:35 +08:00
Fangjun Kuang
b76cd9033a
Support decoding with byte-level BPE (bbpe) models. ( #1633 )
2024-12-20 19:21:32 +08:00
Fangjun Kuang
1bae4085ca
Add speaker diarization API for HarmonyOS. ( #1609 )
2024-12-10 16:03:03 +08:00
Fangjun Kuang
314545f938
Add speaker identification APIs for HarmonyOS ( #1607 )
...
* Add speaker embedding extractor API for HarmonyOS
* Add ArkTS API for speaker identification
2024-12-09 19:23:18 +08:00
Fangjun Kuang
a743a4400f
Add on-device real-time ASR demo for HarmonyOS ( #1606 )
2024-12-09 16:40:15 +08:00
Fangjun Kuang
74a8735f7a
Add on-device tex-to-speech (TTS) demo for HarmonyOS ( #1590 )
2024-12-04 14:27:12 +08:00
Fangjun Kuang
dc3287f3a8
Add HarmonyOS support for text-to-speech. ( #1584 )
2024-12-01 21:43:34 +08:00
Fangjun Kuang
109fb799ca
fix building for Android ( #1568 )
2024-11-27 10:36:16 +08:00
Fangjun Kuang
2101227269
Add streaming ASR support for HarmonyOS. ( #1565 )
2024-11-26 18:36:56 +08:00
Fangjun Kuang
298b6b6fda
Add non-streaming ASR support for HarmonyOS. ( #1564 )
2024-11-26 16:38:35 +08:00
Fangjun Kuang
31d6206fde
HarmonyOS support for VAD. ( #1561 )
2024-11-24 16:29:24 +08:00
Fangjun Kuang
f97daed408
Fixes #1512 ( #1522 )
2024-11-08 21:07:36 +08:00
Fangjun Kuang
4eeb336f59
Export the English TTS model from MeloTTS ( #1509 )
2024-11-04 07:54:19 +08:00
Fangjun Kuang
6ee8c99c5d
Fix building ( #1508 )
2024-11-03 19:47:04 +08:00
Fangjun Kuang
9ab89c33bc
Support building GPU-capable sherpa-onnx on Linux aarch64. ( #1500 )
...
Thanks to @Peakyxh for providing pre-built onnxruntime libraries
with CUDA support for Linux aarch64.
Tested on Jetson nano b01
2024-11-01 11:16:28 +08:00
Fangjun Kuang
9fa3bc40d7
Fix reading tokens.txt on Windows. ( #1497 )
2024-10-30 12:13:11 +08:00
Fangjun Kuang
669f5ef441
Add C++ runtime and Python APIs for Moonshine models ( #1473 )
2024-10-26 14:34:07 +08:00
Fangjun Kuang
707cf792c5
Add GigaAM NeMo transducer model for Russian ASR ( #1467 )
2024-10-25 15:20:13 +08:00
Fangjun Kuang
b41f6d2c94
Support GigaAM CTC models for Russian ASR ( #1464 )
...
See also https://github.com/salute-developers/GigaAM
2024-10-25 10:55:16 +08:00
Fangjun Kuang
a5295aad10
Handle NaN embeddings in speaker diarization. ( #1461 )
...
See also https://github.com/thewh1teagle/sherpa-rs/issues/33
2024-10-24 14:03:09 +08:00
Fangjun Kuang
b3e05f6dc4
Fix style issues ( #1458 )
2024-10-24 11:15:08 +08:00
Fangjun Kuang
ceb69ebd94
Add C++ API for non-streaming ASR ( #1456 )
2024-10-23 16:40:12 +08:00
Zazzle516
4783c8f590
fix "log10" compile error by import CMATH lib ( #1438 )
2024-10-17 14:50:04 +08:00
Fangjun Kuang
94b26ff07c
Android JNI support for speaker diarization ( #1421 )
2024-10-12 13:03:48 +08:00
Fangjun Kuang
1ed803adc1
Dart API for speaker diarization ( #1418 )
2024-10-11 21:17:41 +08:00