Commit Graph

166 Commits

Author SHA1 Message Date
Fangjun Kuang
c3a2e8a67c Refactor Java API (#806) 2024-04-24 18:41:48 +08:00
Fangjun Kuang
c7691650d7 Fix CI tests (#804) 2024-04-24 13:01:06 +08:00
Fangjun Kuang
9b67a476e6 Refactor the JNI interface to make it more modular and maintainable (#802) 2024-04-24 09:48:42 +08:00
Fangjun Kuang
c1608b3524 Support CED models (#792) 2024-04-19 15:20:37 +08:00
Fangjun Kuang
d97a283dbb Add Android demo for spoken language identification using Whisper multilingual models (#783) 2024-04-18 14:33:59 +08:00
Fangjun Kuang
3a43049ba1 Add JNI support for spoken language identification (#782) 2024-04-17 19:27:15 +08:00
Fangjun Kuang
69440e481f Add WearOS demo for audio tagging (#777) 2024-04-17 12:22:17 +08:00
Fangjun Kuang
bcd9e48150 Add Android demo for audio tagging (#776)
See https://k2-fsa.github.io/sherpa/onnx/audio-tagging/apk.html
2024-04-16 20:47:16 +08:00
Fangjun Kuang
13730ecbd8 Add C API for punctuation (#768) 2024-04-14 19:02:34 +08:00
Fangjun Kuang
329fe1aa8b Support adding punctuations to the speech recogntion result (#761) 2024-04-13 12:15:57 +08:00
Fangjun Kuang
f204e62b44 Add C API for audio tagging (#754) 2024-04-11 14:18:43 +08:00
Fangjun Kuang
042976ea6e Add C++ microphone examples for audio tagging (#749) 2024-04-10 21:00:35 +08:00
Fangjun Kuang
f20291cadc Support audio tagging using zipformer (#747) 2024-04-10 14:47:06 +08:00
Fangjun Kuang
6fb8ceda57 Add VAD examples using ALSA for recording (#739) 2024-04-08 16:41:01 +08:00
Fangjun Kuang
a5f8fbc83f Support heteronyms in Chinese TTS (#738) 2024-04-08 11:01:30 +08:00
Fangjun Kuang
dbff2eaadb Add C API for streaming HLG decoding (#734) 2024-04-05 10:31:20 +08:00
Fangjun Kuang
db67e00c77 Add HLG decoding for streaming CTC models (#731) 2024-04-03 21:31:42 +08:00
Fangjun Kuang
2ededa7e98 Fix building wasm in CI (#720) 2024-03-31 20:50:56 +08:00
Fangjun Kuang
6da4a1c12f Add Go API for speaker identification (#718) 2024-03-29 19:25:55 +08:00
Fangjun Kuang
2e0bccad36 Add C API for speaker embedding extractor. (#711) 2024-03-28 18:05:40 +08:00
Fangjun Kuang
12efbf7397 Sign released TTS APKs (#710) 2024-03-27 19:34:37 +08:00
Fangjun Kuang
4e040c596e Support including TTS conditionally. (#699) 2024-03-26 17:21:35 +08:00
Fangjun Kuang
305c373107 Add C# API for spoken language identification (#697) 2024-03-25 18:45:09 +08:00
Fangjun Kuang
ab7cff2513 Add C API for spoken language identification. (#695) 2024-03-25 15:16:47 +08:00
Fangjun Kuang
0d258dd150 Support spoken language identification with whisper (#694) 2024-03-24 22:57:00 +08:00
Fangjun Kuang
3cdad9b5d1 Use manylinux in CI test (#692) 2024-03-24 07:54:32 +08:00
Fangjun Kuang
1952772654 Add timestamps and tokens for .Net's online models. (#690) 2024-03-23 18:51:56 +08:00
Fangjun Kuang
2fc1201924 Add hotwords support to .Net (#689) 2024-03-22 21:40:42 +08:00
Fangjun Kuang
24f437a6f1 Refactor github actions tests (#688) 2024-03-22 21:22:42 +08:00
Fangjun Kuang
c8770aec20 Add nuget package for Windows x86 (#683) 2024-03-21 14:57:01 +08:00
Fangjun Kuang
6571fc9552 Add tts play example for .Net. (#676)
It plays the generated audio via a speaker as it is generating.
2024-03-19 17:33:15 +08:00
Fangjun Kuang
f70fdd156c Support using T-head-Semi/csi-nn2 for RISC-V (#637) 2024-03-06 18:21:50 +08:00
Fangjun Kuang
13260cdf49 Use self-compiled onnxruntime shared lib. (#635) 2024-03-06 11:03:24 +08:00
Fangjun Kuang
ed06ced16f Add WebAssembly for NodeJS. (#628) 2024-03-03 20:00:36 +08:00
Fangjun Kuang
ac6825ff11 Refactor WebAssembly for nodejs (#626) 2024-03-02 12:31:36 +08:00
Fangjun Kuang
a65643b594 support onnxruntime v1.17.1 (#624) 2024-03-02 11:44:59 +08:00
Fangjun Kuang
f9db33c926 Add WebAssembly demo for streaming trilingual Paraformer (Chinese+Cantonese+English) (#618) 2024-03-01 15:20:56 +08:00
Fangjun Kuang
c093880d7c Fix building wheels (#620) 2024-03-01 15:20:06 +08:00
Fangjun Kuang
ee37d9bd92 Support RISC-V (#609) 2024-02-26 06:57:18 +08:00
Fangjun Kuang
16ba7e274a Add WebAssembly for ASR (#604) 2024-02-23 17:39:11 +08:00
Fangjun Kuang
a2df3535b7 Install wasm tts in a separate directory (#600) 2024-02-22 11:30:08 +08:00
Fangjun Kuang
7c22398dd8 Publish wasm tts to model scope. (#599) 2024-02-22 09:57:05 +08:00
Fangjun Kuang
7c4b59932a Refactor WebAssembly build script. (#598)
Make it easier to build WebAssembly for ASR.
2024-02-21 16:51:15 +08:00
Fangjun Kuang
25079b5c05 Fix CI tests. (#596) 2024-02-21 15:37:27 +08:00
Fangjun Kuang
12e5225401 Fix CI warnings (#590) 2024-02-20 15:28:47 +08:00
Fangjun Kuang
d2cc48ded5 Add more Chinese TTS models (Mandarin and Cantonese) (#589) 2024-02-20 15:05:35 +08:00
Fangjun Kuang
5f075d0fce Support MinSizeRel and RelWithDebInfo build on Windows. (#586) 2024-02-20 10:22:02 +08:00
Fangjun Kuang
c68f39bd3c Use onnxruntime static lib compiled with gcc8 on ubuntu 20.04 (#587) 2024-02-20 09:31:37 +08:00
Fangjun Kuang
64007a6193 Support building debug version on Windows (#583) 2024-02-18 10:39:55 +08:00
Fangjun Kuang
81da0fb7a6 Update onnxruntime from 1.16.3 to 1.17.0 (#581) 2024-02-17 12:43:42 +08:00