Fangjun Kuang
|
c3a2e8a67c
|
Refactor Java API (#806)
|
2024-04-24 18:41:48 +08:00 |
|
Fangjun Kuang
|
c7691650d7
|
Fix CI tests (#804)
|
2024-04-24 13:01:06 +08:00 |
|
Fangjun Kuang
|
9b67a476e6
|
Refactor the JNI interface to make it more modular and maintainable (#802)
|
2024-04-24 09:48:42 +08:00 |
|
Fangjun Kuang
|
c1608b3524
|
Support CED models (#792)
|
2024-04-19 15:20:37 +08:00 |
|
Fangjun Kuang
|
d97a283dbb
|
Add Android demo for spoken language identification using Whisper multilingual models (#783)
|
2024-04-18 14:33:59 +08:00 |
|
Fangjun Kuang
|
3a43049ba1
|
Add JNI support for spoken language identification (#782)
|
2024-04-17 19:27:15 +08:00 |
|
Fangjun Kuang
|
69440e481f
|
Add WearOS demo for audio tagging (#777)
|
2024-04-17 12:22:17 +08:00 |
|
Fangjun Kuang
|
bcd9e48150
|
Add Android demo for audio tagging (#776)
See https://k2-fsa.github.io/sherpa/onnx/audio-tagging/apk.html
|
2024-04-16 20:47:16 +08:00 |
|
Fangjun Kuang
|
13730ecbd8
|
Add C API for punctuation (#768)
|
2024-04-14 19:02:34 +08:00 |
|
Fangjun Kuang
|
329fe1aa8b
|
Support adding punctuations to the speech recogntion result (#761)
|
2024-04-13 12:15:57 +08:00 |
|
Fangjun Kuang
|
f204e62b44
|
Add C API for audio tagging (#754)
|
2024-04-11 14:18:43 +08:00 |
|
Fangjun Kuang
|
042976ea6e
|
Add C++ microphone examples for audio tagging (#749)
|
2024-04-10 21:00:35 +08:00 |
|
Fangjun Kuang
|
f20291cadc
|
Support audio tagging using zipformer (#747)
|
2024-04-10 14:47:06 +08:00 |
|
Fangjun Kuang
|
6fb8ceda57
|
Add VAD examples using ALSA for recording (#739)
|
2024-04-08 16:41:01 +08:00 |
|
Fangjun Kuang
|
a5f8fbc83f
|
Support heteronyms in Chinese TTS (#738)
|
2024-04-08 11:01:30 +08:00 |
|
Fangjun Kuang
|
dbff2eaadb
|
Add C API for streaming HLG decoding (#734)
|
2024-04-05 10:31:20 +08:00 |
|
Fangjun Kuang
|
db67e00c77
|
Add HLG decoding for streaming CTC models (#731)
|
2024-04-03 21:31:42 +08:00 |
|
Fangjun Kuang
|
2ededa7e98
|
Fix building wasm in CI (#720)
|
2024-03-31 20:50:56 +08:00 |
|
Fangjun Kuang
|
6da4a1c12f
|
Add Go API for speaker identification (#718)
|
2024-03-29 19:25:55 +08:00 |
|
Fangjun Kuang
|
2e0bccad36
|
Add C API for speaker embedding extractor. (#711)
|
2024-03-28 18:05:40 +08:00 |
|
Fangjun Kuang
|
12efbf7397
|
Sign released TTS APKs (#710)
|
2024-03-27 19:34:37 +08:00 |
|
Fangjun Kuang
|
4e040c596e
|
Support including TTS conditionally. (#699)
|
2024-03-26 17:21:35 +08:00 |
|
Fangjun Kuang
|
305c373107
|
Add C# API for spoken language identification (#697)
|
2024-03-25 18:45:09 +08:00 |
|
Fangjun Kuang
|
ab7cff2513
|
Add C API for spoken language identification. (#695)
|
2024-03-25 15:16:47 +08:00 |
|
Fangjun Kuang
|
0d258dd150
|
Support spoken language identification with whisper (#694)
|
2024-03-24 22:57:00 +08:00 |
|
Fangjun Kuang
|
3cdad9b5d1
|
Use manylinux in CI test (#692)
|
2024-03-24 07:54:32 +08:00 |
|
Fangjun Kuang
|
1952772654
|
Add timestamps and tokens for .Net's online models. (#690)
|
2024-03-23 18:51:56 +08:00 |
|
Fangjun Kuang
|
2fc1201924
|
Add hotwords support to .Net (#689)
|
2024-03-22 21:40:42 +08:00 |
|
Fangjun Kuang
|
24f437a6f1
|
Refactor github actions tests (#688)
|
2024-03-22 21:22:42 +08:00 |
|
Fangjun Kuang
|
c8770aec20
|
Add nuget package for Windows x86 (#683)
|
2024-03-21 14:57:01 +08:00 |
|
Fangjun Kuang
|
6571fc9552
|
Add tts play example for .Net. (#676)
It plays the generated audio via a speaker as it is generating.
|
2024-03-19 17:33:15 +08:00 |
|
Fangjun Kuang
|
f70fdd156c
|
Support using T-head-Semi/csi-nn2 for RISC-V (#637)
|
2024-03-06 18:21:50 +08:00 |
|
Fangjun Kuang
|
13260cdf49
|
Use self-compiled onnxruntime shared lib. (#635)
|
2024-03-06 11:03:24 +08:00 |
|
Fangjun Kuang
|
ed06ced16f
|
Add WebAssembly for NodeJS. (#628)
|
2024-03-03 20:00:36 +08:00 |
|
Fangjun Kuang
|
ac6825ff11
|
Refactor WebAssembly for nodejs (#626)
|
2024-03-02 12:31:36 +08:00 |
|
Fangjun Kuang
|
a65643b594
|
support onnxruntime v1.17.1 (#624)
|
2024-03-02 11:44:59 +08:00 |
|
Fangjun Kuang
|
f9db33c926
|
Add WebAssembly demo for streaming trilingual Paraformer (Chinese+Cantonese+English) (#618)
|
2024-03-01 15:20:56 +08:00 |
|
Fangjun Kuang
|
c093880d7c
|
Fix building wheels (#620)
|
2024-03-01 15:20:06 +08:00 |
|
Fangjun Kuang
|
ee37d9bd92
|
Support RISC-V (#609)
|
2024-02-26 06:57:18 +08:00 |
|
Fangjun Kuang
|
16ba7e274a
|
Add WebAssembly for ASR (#604)
|
2024-02-23 17:39:11 +08:00 |
|
Fangjun Kuang
|
a2df3535b7
|
Install wasm tts in a separate directory (#600)
|
2024-02-22 11:30:08 +08:00 |
|
Fangjun Kuang
|
7c22398dd8
|
Publish wasm tts to model scope. (#599)
|
2024-02-22 09:57:05 +08:00 |
|
Fangjun Kuang
|
7c4b59932a
|
Refactor WebAssembly build script. (#598)
Make it easier to build WebAssembly for ASR.
|
2024-02-21 16:51:15 +08:00 |
|
Fangjun Kuang
|
25079b5c05
|
Fix CI tests. (#596)
|
2024-02-21 15:37:27 +08:00 |
|
Fangjun Kuang
|
12e5225401
|
Fix CI warnings (#590)
|
2024-02-20 15:28:47 +08:00 |
|
Fangjun Kuang
|
d2cc48ded5
|
Add more Chinese TTS models (Mandarin and Cantonese) (#589)
|
2024-02-20 15:05:35 +08:00 |
|
Fangjun Kuang
|
5f075d0fce
|
Support MinSizeRel and RelWithDebInfo build on Windows. (#586)
|
2024-02-20 10:22:02 +08:00 |
|
Fangjun Kuang
|
c68f39bd3c
|
Use onnxruntime static lib compiled with gcc8 on ubuntu 20.04 (#587)
|
2024-02-20 09:31:37 +08:00 |
|
Fangjun Kuang
|
64007a6193
|
Support building debug version on Windows (#583)
|
2024-02-18 10:39:55 +08:00 |
|
Fangjun Kuang
|
81da0fb7a6
|
Update onnxruntime from 1.16.3 to 1.17.0 (#581)
|
2024-02-17 12:43:42 +08:00 |
|