Fangjun Kuang
|
ab21131f7f
|
Swift API for keyword spotting. (#1027)
|
2024-06-18 16:51:30 +08:00 |
|
Fangjun Kuang
|
6789c909d2
|
Inverse text normalization API of streaming ASR for various programming languages (#1022)
|
2024-06-18 13:42:17 +08:00 |
|
Fangjun Kuang
|
349d957da2
|
Add inverse text normalization for online ASR (#1020)
|
2024-06-17 18:39:23 +08:00 |
|
Fangjun Kuang
|
6e09933d99
|
Inverse text normalization API for other programming languages (#1019)
|
2024-06-17 17:02:39 +08:00 |
|
Fangjun Kuang
|
b0f7ed3ee3
|
Add inverse text normalization for non-streaming ASR (#1017)
|
2024-06-17 14:28:53 +08:00 |
|
Fangjun Kuang
|
e52d32b95b
|
Add TTS API and examples for Dart (#1010)
|
2024-06-15 14:30:36 +08:00 |
|
Fangjun Kuang
|
e3077670c6
|
Add streaming ASR examples for Dart API (#1009)
|
2024-06-15 11:48:54 +08:00 |
|
Fangjun Kuang
|
d94506698d
|
Add non-streaming ASR examples for Dart API (#1007)
|
2024-06-14 18:40:16 +08:00 |
|
Fangjun Kuang
|
fd5a0d1e00
|
Add C++ runtime for Tele-AI/TeleSpeech-ASR (#970)
|
2024-06-05 00:26:40 +08:00 |
|
Fangjun Kuang
|
0196f1d546
|
Wrap VAD APIs to C# (#946)
|
2024-05-30 18:29:38 +08:00 |
|
Fangjun Kuang
|
a99c7cb35c
|
Wrap punctuation APIs to C#. (#945)
|
2024-05-30 16:19:56 +08:00 |
|
Fangjun Kuang
|
082f230dfb
|
Fix nemo streaming transducer greedy search (#944)
|
2024-05-30 15:31:10 +08:00 |
|
Fangjun Kuang
|
b4f1006864
|
fix node-addaon-api for vad (#932)
|
2024-05-28 18:46:43 +08:00 |
|
Fangjun Kuang
|
b1c7d04ce2
|
Support not using external buffers for node-addon (#925)
|
2024-05-28 11:50:23 +08:00 |
|
Fangjun Kuang
|
e7b7348102
|
fix detecting node-addon packages (#908)
|
2024-05-23 19:52:19 +08:00 |
|
Fangjun Kuang
|
4f21aabd3c
|
Fix CI for JavaScript and Python APIs. (#901)
|
2024-05-22 13:57:00 +08:00 |
|
Fangjun Kuang
|
b445956675
|
Fix CI tests. (#898)
|
2024-05-21 20:37:29 +08:00 |
|
Wei Kang
|
b012b78ceb
|
Encode hotwords in C++ side (#828)
* Encode hotwords in C++ side
|
2024-05-20 19:41:36 +08:00 |
|
Fangjun Kuang
|
c2dcdabab1
|
Fix sherpa-onnx-node-version in node examples (#879)
|
2024-05-15 14:32:30 +08:00 |
|
Fangjun Kuang
|
03c956a317
|
Add keyword spotting API for node-addon-api (#877)
|
2024-05-14 20:26:48 +08:00 |
|
Fangjun Kuang
|
75630b986b
|
Support adding puncutations to text for node-addon-api (#876)
|
2024-05-14 19:28:56 +08:00 |
|
Fangjun Kuang
|
d19f50b799
|
Add audio tagging APIs for node-addon-api (#875)
|
2024-05-14 17:32:30 +08:00 |
|
Fangjun Kuang
|
388e6a98fc
|
Add speaker identification APIs for node-addon-api (#874)
|
2024-05-14 13:28:50 +08:00 |
|
Fangjun Kuang
|
939fdd942c
|
Add spoken language identification for node-addon-api (#872)
|
2024-05-13 20:26:11 +08:00 |
|
Fangjun Kuang
|
031134b4d4
|
Add TTS for node-addon-api (#871)
|
2024-05-13 19:24:09 +08:00 |
|
Fangjun Kuang
|
697b960768
|
Add non-streaming ASR APIs for node-addon-api (#868)
|
2024-05-13 16:03:34 +08:00 |
|
Fangjun Kuang
|
384f96c40f
|
Add streaming CTC ASR APIs for node-addon-api (#867)
|
2024-05-13 11:58:25 +08:00 |
|
Fangjun Kuang
|
677bc1da3e
|
Add Speaker ID demo for C# (#862)
|
2024-05-11 13:27:33 +08:00 |
|
Fangjun Kuang
|
46e4e5b7ac
|
Add C++ support for streaming NeMo CTC models. (#857)
|
2024-05-10 16:26:43 +08:00 |
|
Fangjun Kuang
|
17cd3a5f01
|
Add C++ runtime for non-streaming faster conformer transducer from NeMo. (#854)
|
2024-05-10 12:15:39 +08:00 |
|
Fangjun Kuang
|
5d8c35e44e
|
Add C++ support for non-streaming NeMo fast conformer hybrid transducer ctc (the ctc branch) (#848)
|
2024-05-09 15:32:22 +08:00 |
|
Fangjun Kuang
|
dbaa26ff4b
|
Publish node-addon-api npm package for linux arm64 (#841)
|
2024-05-07 23:05:40 +08:00 |
|
Fangjun Kuang
|
37a4135dd7
|
Publish npm package with node-addon-api for Windows (#838)
|
2024-05-06 16:21:29 +08:00 |
|
Fangjun Kuang
|
4f758e6cd3
|
Publish node-addon-api wrapper for sherpa-onnx as npm packages (#829)
|
2024-05-04 13:27:39 +08:00 |
|
Fangjun Kuang
|
612002da57
|
Fix C# to support Chinese tts models using jieba (#815)
|
2024-04-26 11:50:07 +08:00 |
|
Fangjun Kuang
|
13730ecbd8
|
Add C API for punctuation (#768)
|
2024-04-14 19:02:34 +08:00 |
|
Fangjun Kuang
|
68b8b88b5a
|
Add Python API for punctuation models. (#762)
|
2024-04-13 13:28:17 +08:00 |
|
Fangjun Kuang
|
329fe1aa8b
|
Support adding punctuations to the speech recogntion result (#761)
|
2024-04-13 12:15:57 +08:00 |
|
Fangjun Kuang
|
f204e62b44
|
Add C API for audio tagging (#754)
|
2024-04-11 14:18:43 +08:00 |
|
Fangjun Kuang
|
34d70a259f
|
Add Python API and Python examples for audio tagging (#753)
|
2024-04-11 11:12:48 +08:00 |
|
Fangjun Kuang
|
f20291cadc
|
Support audio tagging using zipformer (#747)
|
2024-04-10 14:47:06 +08:00 |
|
Fangjun Kuang
|
6fb8ceda57
|
Add VAD examples using ALSA for recording (#739)
|
2024-04-08 16:41:01 +08:00 |
|
Fangjun Kuang
|
a5f8fbc83f
|
Support heteronyms in Chinese TTS (#738)
|
2024-04-08 11:01:30 +08:00 |
|
Fangjun Kuang
|
dbff2eaadb
|
Add C API for streaming HLG decoding (#734)
|
2024-04-05 10:31:20 +08:00 |
|
Fangjun Kuang
|
db67e00c77
|
Add HLG decoding for streaming CTC models (#731)
|
2024-04-03 21:31:42 +08:00 |
|
Fangjun Kuang
|
2e0bccad36
|
Add C API for speaker embedding extractor. (#711)
|
2024-03-28 18:05:40 +08:00 |
|
Fangjun Kuang
|
305c373107
|
Add C# API for spoken language identification (#697)
|
2024-03-25 18:45:09 +08:00 |
|
Fangjun Kuang
|
83a10a55a5
|
Add Swift API for spoken language identification. (#696)
|
2024-03-25 16:22:25 +08:00 |
|
Fangjun Kuang
|
ab7cff2513
|
Add C API for spoken language identification. (#695)
|
2024-03-25 15:16:47 +08:00 |
|
Fangjun Kuang
|
0d258dd150
|
Support spoken language identification with whisper (#694)
|
2024-03-24 22:57:00 +08:00 |
|