Fangjun Kuang
|
4f758e6cd3
|
Publish node-addon-api wrapper for sherpa-onnx as npm packages (#829)
|
2024-05-04 13:27:39 +08:00 |
|
Fangjun Kuang
|
2f9553d838
|
Begin to add node-addon-api for sherpa-onnx (#826)
|
2024-05-03 14:47:40 +08:00 |
|
Fangjun Kuang
|
fcd6024200
|
Fix typos in JNI TTS (#824)
|
2024-05-01 14:14:24 +08:00 |
|
Fangjun Kuang
|
cff207623e
|
Add Java API for speaker identification (#822)
|
2024-04-29 21:23:56 +08:00 |
|
Fangjun Kuang
|
88202f05bb
|
Add Java API for audio tagging (#820)
|
2024-04-28 22:26:04 +08:00 |
|
Fangjun Kuang
|
5407f880c0
|
Add Java and Kotlin API for punctuation models (#818)
|
2024-04-26 22:06:48 +08:00 |
|
Fangjun Kuang
|
db25986240
|
Add Java API for spoken language identification with whisper multilingual models (#817)
|
2024-04-26 19:05:39 +08:00 |
|
Fangjun Kuang
|
612002da57
|
Fix C# to support Chinese tts models using jieba (#815)
|
2024-04-26 11:50:07 +08:00 |
|
Fangjun Kuang
|
c693676d20
|
Fix building wheels for macOS (#814)
|
2024-04-26 10:05:39 +08:00 |
|
Fangjun Kuang
|
15772d2150
|
Add Java API for text-to-speech (#811)
|
2024-04-26 09:26:39 +08:00 |
|
Fangjun Kuang
|
f7b3735621
|
Add CTC HLG decoding for JNI (#810)
|
2024-04-25 17:20:02 +08:00 |
|
Fangjun Kuang
|
83cd533f67
|
Add Java API for non-streaming ASR (#807)
|
2024-04-24 21:03:26 +08:00 |
|
Fangjun Kuang
|
c3a2e8a67c
|
Refactor Java API (#806)
|
2024-04-24 18:41:48 +08:00 |
|
Fangjun Kuang
|
c7691650d7
|
Fix CI tests (#804)
|
2024-04-24 13:01:06 +08:00 |
|
Fangjun Kuang
|
9b67a476e6
|
Refactor the JNI interface to make it more modular and maintainable (#802)
|
2024-04-24 09:48:42 +08:00 |
|
Fangjun Kuang
|
c1608b3524
|
Support CED models (#792)
|
2024-04-19 15:20:37 +08:00 |
|
Fangjun Kuang
|
d97a283dbb
|
Add Android demo for spoken language identification using Whisper multilingual models (#783)
|
2024-04-18 14:33:59 +08:00 |
|
Fangjun Kuang
|
3a43049ba1
|
Add JNI support for spoken language identification (#782)
|
2024-04-17 19:27:15 +08:00 |
|
Fangjun Kuang
|
69440e481f
|
Add WearOS demo for audio tagging (#777)
|
2024-04-17 12:22:17 +08:00 |
|
Fangjun Kuang
|
bcd9e48150
|
Add Android demo for audio tagging (#776)
See https://k2-fsa.github.io/sherpa/onnx/audio-tagging/apk.html
|
2024-04-16 20:47:16 +08:00 |
|
Fangjun Kuang
|
13730ecbd8
|
Add C API for punctuation (#768)
|
2024-04-14 19:02:34 +08:00 |
|
Fangjun Kuang
|
329fe1aa8b
|
Support adding punctuations to the speech recogntion result (#761)
|
2024-04-13 12:15:57 +08:00 |
|
Fangjun Kuang
|
f204e62b44
|
Add C API for audio tagging (#754)
|
2024-04-11 14:18:43 +08:00 |
|
Fangjun Kuang
|
042976ea6e
|
Add C++ microphone examples for audio tagging (#749)
|
2024-04-10 21:00:35 +08:00 |
|
Fangjun Kuang
|
f20291cadc
|
Support audio tagging using zipformer (#747)
|
2024-04-10 14:47:06 +08:00 |
|
Fangjun Kuang
|
6fb8ceda57
|
Add VAD examples using ALSA for recording (#739)
|
2024-04-08 16:41:01 +08:00 |
|
Fangjun Kuang
|
a5f8fbc83f
|
Support heteronyms in Chinese TTS (#738)
|
2024-04-08 11:01:30 +08:00 |
|
Fangjun Kuang
|
dbff2eaadb
|
Add C API for streaming HLG decoding (#734)
|
2024-04-05 10:31:20 +08:00 |
|
Fangjun Kuang
|
db67e00c77
|
Add HLG decoding for streaming CTC models (#731)
|
2024-04-03 21:31:42 +08:00 |
|
Fangjun Kuang
|
2ededa7e98
|
Fix building wasm in CI (#720)
|
2024-03-31 20:50:56 +08:00 |
|
Fangjun Kuang
|
6da4a1c12f
|
Add Go API for speaker identification (#718)
|
2024-03-29 19:25:55 +08:00 |
|
Fangjun Kuang
|
2e0bccad36
|
Add C API for speaker embedding extractor. (#711)
|
2024-03-28 18:05:40 +08:00 |
|
Fangjun Kuang
|
12efbf7397
|
Sign released TTS APKs (#710)
|
2024-03-27 19:34:37 +08:00 |
|
Fangjun Kuang
|
4e040c596e
|
Support including TTS conditionally. (#699)
|
2024-03-26 17:21:35 +08:00 |
|
Fangjun Kuang
|
305c373107
|
Add C# API for spoken language identification (#697)
|
2024-03-25 18:45:09 +08:00 |
|
Fangjun Kuang
|
ab7cff2513
|
Add C API for spoken language identification. (#695)
|
2024-03-25 15:16:47 +08:00 |
|
Fangjun Kuang
|
0d258dd150
|
Support spoken language identification with whisper (#694)
|
2024-03-24 22:57:00 +08:00 |
|
Fangjun Kuang
|
3cdad9b5d1
|
Use manylinux in CI test (#692)
|
2024-03-24 07:54:32 +08:00 |
|
Fangjun Kuang
|
1952772654
|
Add timestamps and tokens for .Net's online models. (#690)
|
2024-03-23 18:51:56 +08:00 |
|
Fangjun Kuang
|
2fc1201924
|
Add hotwords support to .Net (#689)
|
2024-03-22 21:40:42 +08:00 |
|
Fangjun Kuang
|
24f437a6f1
|
Refactor github actions tests (#688)
|
2024-03-22 21:22:42 +08:00 |
|
Fangjun Kuang
|
c8770aec20
|
Add nuget package for Windows x86 (#683)
|
2024-03-21 14:57:01 +08:00 |
|
Fangjun Kuang
|
6571fc9552
|
Add tts play example for .Net. (#676)
It plays the generated audio via a speaker as it is generating.
|
2024-03-19 17:33:15 +08:00 |
|
Fangjun Kuang
|
f70fdd156c
|
Support using T-head-Semi/csi-nn2 for RISC-V (#637)
|
2024-03-06 18:21:50 +08:00 |
|
Fangjun Kuang
|
13260cdf49
|
Use self-compiled onnxruntime shared lib. (#635)
|
2024-03-06 11:03:24 +08:00 |
|
Fangjun Kuang
|
ed06ced16f
|
Add WebAssembly for NodeJS. (#628)
|
2024-03-03 20:00:36 +08:00 |
|
Fangjun Kuang
|
ac6825ff11
|
Refactor WebAssembly for nodejs (#626)
|
2024-03-02 12:31:36 +08:00 |
|
Fangjun Kuang
|
a65643b594
|
support onnxruntime v1.17.1 (#624)
|
2024-03-02 11:44:59 +08:00 |
|
Fangjun Kuang
|
f9db33c926
|
Add WebAssembly demo for streaming trilingual Paraformer (Chinese+Cantonese+English) (#618)
|
2024-03-01 15:20:56 +08:00 |
|
Fangjun Kuang
|
c093880d7c
|
Fix building wheels (#620)
|
2024-03-01 15:20:06 +08:00 |
|