Commit Graph

150 Commits

Author SHA1 Message Date
Fangjun Kuang
4335e2accd Add Dart API for Kokoro TTS models (#1723) 2025-01-16 17:58:19 +08:00
Fangjun Kuang
ad61ad6ff5 Add Swift API for Kokoro TTS models (#1721) 2025-01-16 16:47:37 +08:00
Fangjun Kuang
cc812e6237 Add C# API for Kokoro TTS models (#1720) 2025-01-16 16:30:10 +08:00
Fangjun Kuang
ffc6b480a0 Add C++ and Python API for Kokoro TTS models. (#1715) 2025-01-16 14:24:51 +08:00
Fangjun Kuang
0cb2db385a Add README for android libs. (#1693) 2025-01-07 11:04:10 +08:00
Fangjun Kuang
d7c95d33a3 Add Dart API for MatchaTTS models (#1687) 2025-01-06 11:03:31 +08:00
Fangjun Kuang
6f085babcc Add Swift API for MatchaTTS models. (#1684) 2025-01-06 07:23:45 +08:00
Fangjun Kuang
3eced3e7ee Add C# and JavaScript (wasm) API for MatchaTTS models (#1682) 2025-01-05 15:08:19 +08:00
Fangjun Kuang
0e299f30f5 Add JavaScript API (node-addon-api) for MatchaTTS models. (#1677) 2025-01-03 15:14:28 +08:00
Fangjun Kuang
f457baea42 Support Matcha-TTS models using espeak-ng (#1672) 2025-01-02 13:46:43 +08:00
Fangjun Kuang
2c2926af7d Add C++ runtime for Matcha-TTS (#1627) 2024-12-31 12:44:14 +08:00
Fangjun Kuang
08d771337b Add a byte-level BPE Chinese+English non-streaming zipformer model (#1645) 2024-12-24 16:56:49 +08:00
Fangjun Kuang
3622104133 Add C# API for Moonshine models. (#1483)
* Also, return timestamps for non-streaming ASR.
2024-10-27 13:14:25 +08:00
Fangjun Kuang
54468a7370 Add Dart API for Moonshine models. (#1481) 2024-10-27 12:04:12 +08:00
Fangjun Kuang
6f261d39f3 Add JavaScript API for Moonshine models (#1480) 2024-10-27 11:31:01 +08:00
Fangjun Kuang
669f5ef441 Add C++ runtime and Python APIs for Moonshine models (#1473) 2024-10-26 14:34:07 +08:00
Fangjun Kuang
b41f6d2c94 Support GigaAM CTC models for Russian ASR (#1464)
See also https://github.com/salute-developers/GigaAM
2024-10-25 10:55:16 +08:00
Fangjun Kuang
ceb69ebd94 Add C++ API for non-streaming ASR (#1456) 2024-10-23 16:40:12 +08:00
Fangjun Kuang
effd5ef2be Add C++ API for streaming ASR. (#1455)
It is a wrapper around the C API.
2024-10-23 12:07:43 +08:00
Fangjun Kuang
1ed803adc1 Dart API for speaker diarization (#1418) 2024-10-11 21:17:41 +08:00
Fangjun Kuang
eefc172095 JavaScript API with WebAssembly for speaker diarization (#1414)
#1408 uses [node-addon-api](https://github.com/nodejs/node-addon-api) to call C API from JavaScript, whereas this pull request uses WebAssembly to call C API from JavaScript.
2024-10-11 11:40:10 +08:00
Fangjun Kuang
67349b52f2 JavaScript API (node-addon) for speaker diarization (#1408) 2024-10-10 15:51:31 +08:00
Fangjun Kuang
a45e5dba99 C# API for speaker diarization (#1407) 2024-10-10 14:29:05 +08:00
Fangjun Kuang
1571344509 Swift API for speaker diarization (#1404) 2024-10-09 23:25:39 +08:00
Fangjun Kuang
8535b1d3bb Python API for speaker diarization. (#1400) 2024-10-09 14:13:26 +08:00
Fangjun Kuang
59407edcad C++ API for speaker diarization (#1396) 2024-10-09 12:01:20 +08:00
Fangjun Kuang
b965f14cf0 Add Python API for clustering (#1385) 2024-09-30 11:33:15 +08:00
Fangjun Kuang
576a3aa90d Add non-streaming ONNX models for Russian ASR (#1358) 2024-09-18 13:43:49 +08:00
Lim Yao Chong
3bffc24d64 Add Python binding for online punctuation models (#1312) 2024-09-09 10:26:53 +08:00
Fangjun Kuang
a2a70900d6 ADD VAD+ASR example for dart with CircularBuffer. (#1293) 2024-08-27 19:29:34 +08:00
Fangjun Kuang
5ed8e31868 Add VAD and keyword spotting for the Node package with WebAssembly (#1286) 2024-08-24 23:05:54 +08:00
Fangjun Kuang
ca729faebf Support reading multi-channel wave files with 8/16/32-bit encoded samples (#1258) 2024-08-15 14:54:43 +08:00
Fangjun Kuang
9ee2943ed4 Add CI tests for online punctuation models (#1226) 2024-08-06 18:10:30 +08:00
Fangjun Kuang
35c1b4a7a9 Add ReazonSpeech Japanese pre-trained model (#1203) 2024-08-02 10:21:24 +08:00
Fangjun Kuang
53484fcd9b Fix reading non-standard wav files. (#1199) 2024-08-01 17:48:04 +08:00
Fangjun Kuang
ec98110e11 Add speaker identification and verification exmaple for Dart API (#1194) 2024-07-31 13:53:52 +08:00
Fangjun Kuang
06fd50f536 Add test about whisper large-v3 for .Net (#1187) 2024-07-29 20:49:38 +08:00
Fangjun Kuang
646f99c870 Dart API for adding punctuations to text (#1182) 2024-07-29 12:41:52 +08:00
Fangjun Kuang
cd1fedaa49 Add Dart API for audio tagging (#1181) 2024-07-29 11:15:14 +08:00
Fangjun Kuang
69b6b47d91 Add vad with non-streaming ASR examples for Dart API (#1180) 2024-07-28 23:01:03 +08:00
Fangjun Kuang
d279c8d20e Add more Python examples for SenseVoice (#1179) 2024-07-28 21:54:38 +08:00
Fangjun Kuang
994c3e7c96 Add VAD + Non-streaming ASR example for JavaScript API. (#1170) 2024-07-26 12:42:08 +08:00
Fangjun Kuang
ac8223bd8a Add Dart API for keyword spotter (#1162) 2024-07-22 10:53:34 +08:00
Fangjun Kuang
ffdb23a8ec Add dart API for SenseVoice (#1159) 2024-07-21 21:48:12 +08:00
Fangjun Kuang
70d14353bb Add WebAssembly for SenseVoice (#1158) 2024-07-21 15:39:55 +08:00
Fangjun Kuang
c3260ef842 Add JavaScript API for SenseVoice (#1157) 2024-07-21 10:14:14 +08:00
Fangjun Kuang
e472180f2c Add C# API for SenseVoice models (#1151) 2024-07-20 17:09:23 +08:00
Fangjun Kuang
25f0a10468 Add C++ runtime for SenseVoice models (#1148) 2024-07-18 22:54:18 +08:00
Fangjun Kuang
9e448d03bc Provide npm package for 32-bit Windows x86 (#1141) 2024-07-17 12:33:15 +08:00
Fangjun Kuang
b2c283fa2b Add Swift API for adding punctuations to text. (#1132) 2024-07-15 15:30:40 +08:00