Fangjun Kuang
|
677bc1da3e
|
Add Speaker ID demo for C# (#862)
|
2024-05-11 13:27:33 +08:00 |
|
Fangjun Kuang
|
46e4e5b7ac
|
Add C++ support for streaming NeMo CTC models. (#857)
|
2024-05-10 16:26:43 +08:00 |
|
Fangjun Kuang
|
17cd3a5f01
|
Add C++ runtime for non-streaming faster conformer transducer from NeMo. (#854)
|
2024-05-10 12:15:39 +08:00 |
|
Fangjun Kuang
|
5d8c35e44e
|
Add C++ support for non-streaming NeMo fast conformer hybrid transducer ctc (the ctc branch) (#848)
|
2024-05-09 15:32:22 +08:00 |
|
Fangjun Kuang
|
5ed3ec1c04
|
Export non-streaming NeMo faster conformer hybrid transducer and ctc to sherpa-onnx (#847)
|
2024-05-09 13:59:47 +08:00 |
|
Fangjun Kuang
|
68b25abf27
|
Export NeMo FastConformer Hybrid Transducer Large Streaming to ONNX (#844)
|
2024-05-08 19:07:49 +08:00 |
|
Fangjun Kuang
|
a9f936e92b
|
Export NeMo FastConformer Hybrid Transducer-CTC Large Streaming to ONNX. (#843)
|
2024-05-08 12:33:46 +08:00 |
|
Fangjun Kuang
|
dbaa26ff4b
|
Publish node-addon-api npm package for linux arm64 (#841)
|
2024-05-07 23:05:40 +08:00 |
|
Fangjun Kuang
|
d2e86b0415
|
Add links to pre-built APKs and pre-trained models to README. (#840)
|
2024-05-07 12:28:42 +08:00 |
|
Fangjun Kuang
|
37a4135dd7
|
Publish npm package with node-addon-api for Windows (#838)
|
2024-05-06 16:21:29 +08:00 |
|
Fangjun Kuang
|
4f758e6cd3
|
Publish node-addon-api wrapper for sherpa-onnx as npm packages (#829)
|
2024-05-04 13:27:39 +08:00 |
|
Fangjun Kuang
|
2f9553d838
|
Begin to add node-addon-api for sherpa-onnx (#826)
|
2024-05-03 14:47:40 +08:00 |
|
Fangjun Kuang
|
fcd6024200
|
Fix typos in JNI TTS (#824)
|
2024-05-01 14:14:24 +08:00 |
|
Fangjun Kuang
|
cff207623e
|
Add Java API for speaker identification (#822)
|
2024-04-29 21:23:56 +08:00 |
|
Fangjun Kuang
|
88202f05bb
|
Add Java API for audio tagging (#820)
|
2024-04-28 22:26:04 +08:00 |
|
Fangjun Kuang
|
5407f880c0
|
Add Java and Kotlin API for punctuation models (#818)
|
2024-04-26 22:06:48 +08:00 |
|
Fangjun Kuang
|
db25986240
|
Add Java API for spoken language identification with whisper multilingual models (#817)
|
2024-04-26 19:05:39 +08:00 |
|
Fangjun Kuang
|
612002da57
|
Fix C# to support Chinese tts models using jieba (#815)
|
2024-04-26 11:50:07 +08:00 |
|
Fangjun Kuang
|
c693676d20
|
Fix building wheels for macOS (#814)
|
2024-04-26 10:05:39 +08:00 |
|
Fangjun Kuang
|
15772d2150
|
Add Java API for text-to-speech (#811)
|
2024-04-26 09:26:39 +08:00 |
|
Fangjun Kuang
|
f7b3735621
|
Add CTC HLG decoding for JNI (#810)
|
2024-04-25 17:20:02 +08:00 |
|
Fangjun Kuang
|
83cd533f67
|
Add Java API for non-streaming ASR (#807)
|
2024-04-24 21:03:26 +08:00 |
|
Fangjun Kuang
|
c3a2e8a67c
|
Refactor Java API (#806)
|
2024-04-24 18:41:48 +08:00 |
|
Fangjun Kuang
|
c7691650d7
|
Fix CI tests (#804)
|
2024-04-24 13:01:06 +08:00 |
|
Fangjun Kuang
|
9b67a476e6
|
Refactor the JNI interface to make it more modular and maintainable (#802)
|
2024-04-24 09:48:42 +08:00 |
|
Fangjun Kuang
|
c1608b3524
|
Support CED models (#792)
|
2024-04-19 15:20:37 +08:00 |
|
Fangjun Kuang
|
d97a283dbb
|
Add Android demo for spoken language identification using Whisper multilingual models (#783)
|
2024-04-18 14:33:59 +08:00 |
|
Fangjun Kuang
|
3a43049ba1
|
Add JNI support for spoken language identification (#782)
|
2024-04-17 19:27:15 +08:00 |
|
Fangjun Kuang
|
69440e481f
|
Add WearOS demo for audio tagging (#777)
|
2024-04-17 12:22:17 +08:00 |
|
Fangjun Kuang
|
bcd9e48150
|
Add Android demo for audio tagging (#776)
See https://k2-fsa.github.io/sherpa/onnx/audio-tagging/apk.html
|
2024-04-16 20:47:16 +08:00 |
|
Fangjun Kuang
|
13730ecbd8
|
Add C API for punctuation (#768)
|
2024-04-14 19:02:34 +08:00 |
|
Fangjun Kuang
|
329fe1aa8b
|
Support adding punctuations to the speech recogntion result (#761)
|
2024-04-13 12:15:57 +08:00 |
|
Fangjun Kuang
|
f204e62b44
|
Add C API for audio tagging (#754)
|
2024-04-11 14:18:43 +08:00 |
|
Fangjun Kuang
|
042976ea6e
|
Add C++ microphone examples for audio tagging (#749)
|
2024-04-10 21:00:35 +08:00 |
|
Fangjun Kuang
|
f20291cadc
|
Support audio tagging using zipformer (#747)
|
2024-04-10 14:47:06 +08:00 |
|
Fangjun Kuang
|
6fb8ceda57
|
Add VAD examples using ALSA for recording (#739)
|
2024-04-08 16:41:01 +08:00 |
|
Fangjun Kuang
|
a5f8fbc83f
|
Support heteronyms in Chinese TTS (#738)
|
2024-04-08 11:01:30 +08:00 |
|
Fangjun Kuang
|
dbff2eaadb
|
Add C API for streaming HLG decoding (#734)
|
2024-04-05 10:31:20 +08:00 |
|
Fangjun Kuang
|
db67e00c77
|
Add HLG decoding for streaming CTC models (#731)
|
2024-04-03 21:31:42 +08:00 |
|
Fangjun Kuang
|
2ededa7e98
|
Fix building wasm in CI (#720)
|
2024-03-31 20:50:56 +08:00 |
|
Fangjun Kuang
|
6da4a1c12f
|
Add Go API for speaker identification (#718)
|
2024-03-29 19:25:55 +08:00 |
|
Fangjun Kuang
|
2e0bccad36
|
Add C API for speaker embedding extractor. (#711)
|
2024-03-28 18:05:40 +08:00 |
|
Fangjun Kuang
|
12efbf7397
|
Sign released TTS APKs (#710)
|
2024-03-27 19:34:37 +08:00 |
|
Fangjun Kuang
|
4e040c596e
|
Support including TTS conditionally. (#699)
|
2024-03-26 17:21:35 +08:00 |
|
Fangjun Kuang
|
305c373107
|
Add C# API for spoken language identification (#697)
|
2024-03-25 18:45:09 +08:00 |
|
Fangjun Kuang
|
ab7cff2513
|
Add C API for spoken language identification. (#695)
|
2024-03-25 15:16:47 +08:00 |
|
Fangjun Kuang
|
0d258dd150
|
Support spoken language identification with whisper (#694)
|
2024-03-24 22:57:00 +08:00 |
|
Fangjun Kuang
|
3cdad9b5d1
|
Use manylinux in CI test (#692)
|
2024-03-24 07:54:32 +08:00 |
|
Fangjun Kuang
|
1952772654
|
Add timestamps and tokens for .Net's online models. (#690)
|
2024-03-23 18:51:56 +08:00 |
|
Fangjun Kuang
|
2fc1201924
|
Add hotwords support to .Net (#689)
|
2024-03-22 21:40:42 +08:00 |
|