Commit Graph

130 Commits

Author SHA1 Message Date
Fangjun Kuang
2b2788332e Add C++ support for UVR models (#2269) 2025-06-01 17:22:08 +08:00
Fangjun Kuang
2e9e0b4e9e Add Android demo for real-time ASR with non-streaming ASR models. (#2214) 2025-05-14 19:10:44 +08:00
Fangjun Kuang
028b8f2718 Add C++ example for streaming ASR with SenseVoice. (#2199) 2025-05-11 00:23:32 +08:00
Fangjun Kuang
e51c37eb2f Add C and CXX API for homophone replacer (#2156) 2025-04-27 22:09:13 +08:00
Fangjun Kuang
8137ac9f0b Add Pascal API for Dolphin CTC models (#2096) 2025-04-03 16:00:22 +08:00
Fangjun Kuang
da4aad1189 Add C and CXX API for Dolphin CTC models (#2088) 2025-04-02 21:54:20 +08:00
Fangjun Kuang
0703bc1b86 Add CXX API for VAD (#2077) 2025-04-01 14:51:43 +08:00
niansa/tuxifan
9d23606ee6 Allow building repository as CMake subdirectory (#2059)
* Use PROJECT_SOURCE_DIR rather than CMAKE_SOURCE_DIR to allow building as subdirectory

* Also use PROJECT_SOURCE_DIR instead of CMAKE_SOURCE_DIR in c/cxx api examples

* Only build examples by default when not building as subdirectory

* Do not suggest building binaries either

---------

Co-authored-by: user <user@mail.tld>
2025-03-29 06:27:59 +08:00
Fangjun Kuang
6a97f8adcf Add JavaScript (node-addon) API for speech enhancement GTCRN models (#1996) 2025-03-12 15:52:01 +08:00
Fangjun Kuang
802119db17 Add CXX API for speech enhancement GTCRN models (#1986) 2025-03-11 17:07:52 +08:00
Fangjun Kuang
c5dbf1177c Add C API for speech enhancement GTCRN models (#1984) 2025-03-11 15:50:04 +08:00
cjsdurj
b87fce9a7f c-api add wave write to buffer. (#1962)
Co-authored-by: jian.chen03 <jian.chen03@transwarp.io>
2025-03-10 17:21:23 +08:00
Fangjun Kuang
1d49dd2fb0 Add CXX API for FireRedAsr (#1872) 2025-02-17 11:46:13 +08:00
Fangjun Kuang
193d31333c Add C API for FireRedAsr AED model. (#1871) 2025-02-17 11:22:17 +08:00
Fangjun Kuang
944400e399 Fix spliting text by languages for kokoro tts. (#1849) 2025-02-13 18:19:34 +08:00
Kell
2ac41d3d85 OfflineRecognizer supports create stream with hotwords (#1833)
Co-authored-by: Wangkai <kell.wang@huawei.com>
2025-02-10 16:26:56 +08:00
Fangjun Kuang
69f489f0cd Support scaling the duration of a pause in TTS. (#1820) 2025-02-08 12:47:26 +08:00
Fangjun Kuang
d815204774 Add CXX API for Kokoro TTS 1.0 (#1802) 2025-02-07 14:51:49 +08:00
Fangjun Kuang
7330f7519a Add C API for Kokoro TTS 1.0 (#1801) 2025-02-07 14:30:40 +08:00
Fangjun Kuang
f178e96bf0 Add keyword spotter C API for HarmonyOS (#1769) 2025-01-26 14:12:30 +08:00
Fangjun Kuang
8b989a851c Fix keyword spotting. (#1689)
Reset the stream right after detecting a keyword
2025-01-20 16:41:10 +08:00
Fangjun Kuang
af671e2b63 Add C API for Kokoro TTS models (#1717) 2025-01-16 15:07:26 +08:00
Fangjun Kuang
6d18430dbf Fix building without TTS (#1691) 2025-01-07 09:59:20 +08:00
Fangjun Kuang
bf3330c906 Add HarmonyOS examples for MatchaTTS. (#1678) 2025-01-03 17:09:29 +08:00
Fangjun Kuang
648903834b Add CXX API for MatchaTTS models (#1676) 2025-01-03 14:16:36 +08:00
Fangjun Kuang
9aa4897a9e Add C API for MatchaTTS models (#1675) 2025-01-03 12:17:26 +08:00
Fangjun Kuang
3422b9388d Add Kotlin API for Matcha-TTS models. (#1668) 2024-12-31 19:20:52 +08:00
yujinqiu
5c2cc48f50 Add swift online punctuation (#1661) 2024-12-31 11:26:32 +08:00
Fangjun Kuang
914cbad6a3 Add speaker diarization demo for HarmonyOS (#1610) 2024-12-10 20:11:44 +08:00
Fangjun Kuang
1bae4085ca Add speaker diarization API for HarmonyOS. (#1609) 2024-12-10 16:03:03 +08:00
Fangjun Kuang
314545f938 Add speaker identification APIs for HarmonyOS (#1607)
* Add speaker embedding extractor API for HarmonyOS

* Add ArkTS API for speaker identification
2024-12-09 19:23:18 +08:00
Fangjun Kuang
74a8735f7a Add on-device tex-to-speech (TTS) demo for HarmonyOS (#1590) 2024-12-04 14:27:12 +08:00
Fangjun Kuang
dc3287f3a8 Add HarmonyOS support for text-to-speech. (#1584) 2024-12-01 21:43:34 +08:00
Fangjun Kuang
315d8e2a47 Publish sherpa_onnx.har for HarmonyOS (#1572) 2024-11-28 17:30:16 +08:00
Fangjun Kuang
2101227269 Add streaming ASR support for HarmonyOS. (#1565) 2024-11-26 18:36:56 +08:00
Fangjun Kuang
298b6b6fda Add non-streaming ASR support for HarmonyOS. (#1564) 2024-11-26 16:38:35 +08:00
Fangjun Kuang
31d6206fde HarmonyOS support for VAD. (#1561) 2024-11-24 16:29:24 +08:00
Fangjun Kuang
356da3b54c Publish pre-built macos xcframework (#1490) 2024-10-29 12:26:26 +08:00
Fangjun Kuang
2ca2985d04 Add C and C++ API for Moonshine models (#1476) 2024-10-26 23:24:46 +08:00
Fangjun Kuang
b3e05f6dc4 Fix style issues (#1458) 2024-10-24 11:15:08 +08:00
Fangjun Kuang
ceb69ebd94 Add C++ API for non-streaming ASR (#1456) 2024-10-23 16:40:12 +08:00
Fangjun Kuang
effd5ef2be Add C++ API for streaming ASR. (#1455)
It is a wrapper around the C API.
2024-10-23 12:07:43 +08:00
Fangjun Kuang
1ed803adc1 Dart API for speaker diarization (#1418) 2024-10-11 21:17:41 +08:00
Fangjun Kuang
1d061df355 WebAssembly exmaple for speaker diarization (#1411) 2024-10-10 22:14:45 +08:00
Fangjun Kuang
d468527f62 C API for speaker diarization (#1402) 2024-10-09 17:10:03 +08:00
lxiao336
06b61ccad8 Allow more online models to load tokens file from the memory (#1352)
Co-authored-by: xiao <shawl336@6163.com>
2024-09-20 16:38:41 +08:00
Fangjun Kuang
e7ffcbd677 Add APIs about max speech duration in VAD for various programming languages (#1349) 2024-09-14 12:30:13 +08:00
Fangjun Kuang
544857b097 Fix building (#1343) 2024-09-13 13:33:52 +08:00
lxiao336
65cfa7548a re-pull-request allow tokens and hotwords be loaded from buffered string driectly (#1339)
Co-authored-by: xiao <shawl336@163.com>
2024-09-13 09:58:17 +08:00
Fangjun Kuang
ca30d83915 Avoid SherpaOnnxSpeakerEmbeddingManagerFreeBestMatches freeing null. (#1296)
Fixes #1295
2024-08-28 10:42:36 +08:00