Fangjun Kuang
16a3449945
Build APK with replace.fst ( #2254 )
2025-05-28 12:19:29 +08:00
Fangjun Kuang
d8bb20710d
Add script to build APK for simulated-streaming-asr. ( #2220 )
2025-05-15 15:40:22 +08:00
Fangjun Kuang
7cbb1bc433
Upload more onnx ASR models ( #2141 )
2025-04-21 18:57:41 +08:00
Fangjun Kuang
da4aad1189
Add C and CXX API for Dolphin CTC models ( #2088 )
2025-04-02 21:54:20 +08:00
Fangjun Kuang
eee5575836
Add Kotlin and Java API for Dolphin CTC models ( #2086 )
2025-04-02 21:16:14 +08:00
Fangjun Kuang
0aacf02dd8
Add C++ runtime for vocos ( #2014 )
2025-03-17 17:05:15 +08:00
Fangjun Kuang
dfcbc8d40b
Add Kokoro v1.1-zh ( #1942 )
2025-02-28 15:47:59 +08:00
Fangjun Kuang
8b8ef1090b
Fix CI ( #1841 )
2025-02-11 12:27:09 +08:00
Fangjun Kuang
9559a10bd3
Add C++ support for MatchaTTS models not from icefall. ( #1834 )
2025-02-10 15:38:29 +08:00
Fangjun Kuang
a52b819fb5
Add Android demo for Kokoro TTS 1.0 ( #1799 )
2025-02-07 13:07:30 +08:00
Fangjun Kuang
99cef4198b
Add Koltin and Java API for Kokoro TTS models ( #1728 )
2025-01-17 17:36:13 +08:00
Fangjun Kuang
1fe5fe495f
Add Android demo for MatchaTTS models. ( #1683 )
2025-01-06 06:44:09 +08:00
Fangjun Kuang
08d771337b
Add a byte-level BPE Chinese+English non-streaming zipformer model ( #1645 )
2024-12-24 16:56:49 +08:00
Fangjun Kuang
fe3265aa25
Add new tts models for Latvia and Persian+English ( #1644 )
2024-12-24 15:16:02 +08:00
Fangjun Kuang
299f2392e2
Add CI to build HAPs for HarmonyOS ( #1578 )
2024-11-29 21:13:01 +08:00
Fangjun Kuang
c34ab35591
Add Android APK for streaming Paraformer ASR ( #1538 )
2024-11-14 20:57:35 +08:00
Fangjun Kuang
a3c89aa0d8
Add two-pass ASR Android APKs for Moonshine models. ( #1499 )
2024-10-31 17:54:16 +08:00
Fangjun Kuang
bd4b223920
Add Kotlin and Java API for Moonshine models ( #1474 )
2024-10-26 22:30:29 +08:00
Fangjun Kuang
707cf792c5
Add GigaAM NeMo transducer model for Russian ASR ( #1467 )
2024-10-25 15:20:13 +08:00
Fangjun Kuang
b41f6d2c94
Support GigaAM CTC models for Russian ASR ( #1464 )
...
See also https://github.com/salute-developers/GigaAM
2024-10-25 10:55:16 +08:00
Fangjun Kuang
e0586f1876
add more models for speaker diarization ( #1440 )
2024-10-17 20:03:09 +08:00
Fangjun Kuang
620597f501
Support https://huggingface.co/Revai/reverb-diarization-v1 ( #1437 )
2024-10-17 11:58:14 +08:00
Fangjun Kuang
5a22f74b2b
Android demo for speaker diarization ( #1423 )
2024-10-13 14:02:57 +08:00
Fangjun Kuang
b965f14cf0
Add Python API for clustering ( #1385 )
2024-09-30 11:33:15 +08:00
Fangjun Kuang
576a3aa90d
Add non-streaming ONNX models for Russian ASR ( #1358 )
2024-09-18 13:43:49 +08:00
Fangjun Kuang
c38634dfcf
two-pass Android APK for SenseVoice ( #1302 )
2024-08-29 12:08:49 +08:00
Fangjun Kuang
fb09f8fae3
Set batch size to 1 for more streaming ASR models ( #1280 )
2024-08-23 11:06:55 +08:00
Fangjun Kuang
5a2aa110b8
Text to speech API for Object Pascal. ( #1273 )
2024-08-20 20:52:16 +08:00
Fangjun Kuang
fbe35ba736
Add Lazarus example for generating subtitles using Silero VAD with non-streaming ASR ( #1251 )
2024-08-15 22:19:45 +08:00
Fangjun Kuang
35c1b4a7a9
Add ReazonSpeech Japanese pre-trained model ( #1203 )
2024-08-02 10:21:24 +08:00
Fangjun Kuang
dd300b1de5
Add Java and Kotlin API for sense voice ( #1164 )
2024-07-22 14:08:40 +08:00
Fangjun Kuang
960eb7529e
Add C++ runtime for MeloTTS ( #1138 )
2024-07-16 15:55:02 +08:00
Fangjun Kuang
fa07bbc176
Add APK for small paraformer ( #1133 )
2024-07-15 19:44:36 +08:00
Fangjun Kuang
3951a12f8d
Add pre-trained models for the Libriheavy dataset ( #1122 )
2024-07-13 19:21:13 +08:00
Fangjun Kuang
b5093e27f9
Fix publishing apks to huggingface ( #1121 )
...
Save APKs for each release in a separate directory.
Huggingface requires that each directory cannot contain more than 1000 files.
Since we have so many tts models and for each model we need to build APKs of 4 different ABIs,
it is a workaround for the huggingface's constraint by placing them into separate directories for different releases.
2024-07-13 16:14:00 +08:00
Fangjun Kuang
dd0ff2ca06
Support onnxruntime 1.18.0 ( #906 )
2024-07-10 17:05:26 +08:00
Fangjun Kuang
9e446b8501
Fix typos ( #1101 )
2024-07-09 20:08:47 +08:00
Fangjun Kuang
1f95bff719
Add non-streaming zipformer Android APK ( #1052 )
2024-06-24 16:22:19 +08:00
Fangjun Kuang
36336b31f4
Build Android APK for Thai ( #1036 )
2024-06-20 18:05:57 +08:00
Fangjun Kuang
6789c909d2
Inverse text normalization API of streaming ASR for various programming languages ( #1022 )
2024-06-18 13:42:17 +08:00
Fangjun Kuang
e1201225f2
Add Android APK for Korean ( #1015 )
2024-06-16 19:17:15 +08:00
Fangjun Kuang
09efe54808
add more text-to-speech models from piper ( #988 )
2024-06-11 15:22:48 +08:00
Fangjun Kuang
fd5a0d1e00
Add C++ runtime for Tele-AI/TeleSpeech-ASR ( #970 )
2024-06-05 00:26:40 +08:00
Fangjun Kuang
cd65e7627d
add a new tts piper model ( #927 )
2024-05-28 10:43:13 +08:00
Fangjun Kuang
384f96c40f
Add streaming CTC ASR APIs for node-addon-api ( #867 )
2024-05-13 11:58:25 +08:00
Fangjun Kuang
db85b2c1d8
Add Android APKs for NeMo CTC models. ( #866 )
2024-05-12 14:58:36 +08:00
Fangjun Kuang
7322f4e0a3
Fix node addon tests ( #865 )
...
* Install naudiodon2 manually.
It is needed only when using a microphone. The CI tests don't need it.
2024-05-12 12:03:43 +08:00
Fangjun Kuang
d2e86b0415
Add links to pre-built APKs and pre-trained models to README. ( #840 )
2024-05-07 12:28:42 +08:00
Fangjun Kuang
9b67a476e6
Refactor the JNI interface to make it more modular and maintainable ( #802 )
2024-04-24 09:48:42 +08:00
Fangjun Kuang
7f3b9ffe5d
Refactor TTS Android code to support jieba for Chinese TTS models ( #800 )
2024-04-22 17:21:05 +08:00