Commit Graph

188 Commits

Author SHA1 Message Date
Fangjun Kuang
677bc1da3e Add Speaker ID demo for C# (#862) 2024-05-11 13:27:33 +08:00
Fangjun Kuang
46e4e5b7ac Add C++ support for streaming NeMo CTC models. (#857) 2024-05-10 16:26:43 +08:00
Fangjun Kuang
17cd3a5f01 Add C++ runtime for non-streaming faster conformer transducer from NeMo. (#854) 2024-05-10 12:15:39 +08:00
Fangjun Kuang
5d8c35e44e Add C++ support for non-streaming NeMo fast conformer hybrid transducer ctc (the ctc branch) (#848) 2024-05-09 15:32:22 +08:00
Fangjun Kuang
5ed3ec1c04 Export non-streaming NeMo faster conformer hybrid transducer and ctc to sherpa-onnx (#847) 2024-05-09 13:59:47 +08:00
Fangjun Kuang
68b25abf27 Export NeMo FastConformer Hybrid Transducer Large Streaming to ONNX (#844) 2024-05-08 19:07:49 +08:00
Fangjun Kuang
a9f936e92b Export NeMo FastConformer Hybrid Transducer-CTC Large Streaming to ONNX. (#843) 2024-05-08 12:33:46 +08:00
Fangjun Kuang
dbaa26ff4b Publish node-addon-api npm package for linux arm64 (#841) 2024-05-07 23:05:40 +08:00
Fangjun Kuang
d2e86b0415 Add links to pre-built APKs and pre-trained models to README. (#840) 2024-05-07 12:28:42 +08:00
Fangjun Kuang
37a4135dd7 Publish npm package with node-addon-api for Windows (#838) 2024-05-06 16:21:29 +08:00
Fangjun Kuang
4f758e6cd3 Publish node-addon-api wrapper for sherpa-onnx as npm packages (#829) 2024-05-04 13:27:39 +08:00
Fangjun Kuang
2f9553d838 Begin to add node-addon-api for sherpa-onnx (#826) 2024-05-03 14:47:40 +08:00
Fangjun Kuang
fcd6024200 Fix typos in JNI TTS (#824) 2024-05-01 14:14:24 +08:00
Fangjun Kuang
cff207623e Add Java API for speaker identification (#822) 2024-04-29 21:23:56 +08:00
Fangjun Kuang
88202f05bb Add Java API for audio tagging (#820) 2024-04-28 22:26:04 +08:00
Fangjun Kuang
5407f880c0 Add Java and Kotlin API for punctuation models (#818) 2024-04-26 22:06:48 +08:00
Fangjun Kuang
db25986240 Add Java API for spoken language identification with whisper multilingual models (#817) 2024-04-26 19:05:39 +08:00
Fangjun Kuang
612002da57 Fix C# to support Chinese tts models using jieba (#815) 2024-04-26 11:50:07 +08:00
Fangjun Kuang
c693676d20 Fix building wheels for macOS (#814) 2024-04-26 10:05:39 +08:00
Fangjun Kuang
15772d2150 Add Java API for text-to-speech (#811) 2024-04-26 09:26:39 +08:00
Fangjun Kuang
f7b3735621 Add CTC HLG decoding for JNI (#810) 2024-04-25 17:20:02 +08:00
Fangjun Kuang
83cd533f67 Add Java API for non-streaming ASR (#807) 2024-04-24 21:03:26 +08:00
Fangjun Kuang
c3a2e8a67c Refactor Java API (#806) 2024-04-24 18:41:48 +08:00
Fangjun Kuang
c7691650d7 Fix CI tests (#804) 2024-04-24 13:01:06 +08:00
Fangjun Kuang
9b67a476e6 Refactor the JNI interface to make it more modular and maintainable (#802) 2024-04-24 09:48:42 +08:00
Fangjun Kuang
c1608b3524 Support CED models (#792) 2024-04-19 15:20:37 +08:00
Fangjun Kuang
d97a283dbb Add Android demo for spoken language identification using Whisper multilingual models (#783) 2024-04-18 14:33:59 +08:00
Fangjun Kuang
3a43049ba1 Add JNI support for spoken language identification (#782) 2024-04-17 19:27:15 +08:00
Fangjun Kuang
69440e481f Add WearOS demo for audio tagging (#777) 2024-04-17 12:22:17 +08:00
Fangjun Kuang
bcd9e48150 Add Android demo for audio tagging (#776)
See https://k2-fsa.github.io/sherpa/onnx/audio-tagging/apk.html
2024-04-16 20:47:16 +08:00
Fangjun Kuang
13730ecbd8 Add C API for punctuation (#768) 2024-04-14 19:02:34 +08:00
Fangjun Kuang
329fe1aa8b Support adding punctuations to the speech recogntion result (#761) 2024-04-13 12:15:57 +08:00
Fangjun Kuang
f204e62b44 Add C API for audio tagging (#754) 2024-04-11 14:18:43 +08:00
Fangjun Kuang
042976ea6e Add C++ microphone examples for audio tagging (#749) 2024-04-10 21:00:35 +08:00
Fangjun Kuang
f20291cadc Support audio tagging using zipformer (#747) 2024-04-10 14:47:06 +08:00
Fangjun Kuang
6fb8ceda57 Add VAD examples using ALSA for recording (#739) 2024-04-08 16:41:01 +08:00
Fangjun Kuang
a5f8fbc83f Support heteronyms in Chinese TTS (#738) 2024-04-08 11:01:30 +08:00
Fangjun Kuang
dbff2eaadb Add C API for streaming HLG decoding (#734) 2024-04-05 10:31:20 +08:00
Fangjun Kuang
db67e00c77 Add HLG decoding for streaming CTC models (#731) 2024-04-03 21:31:42 +08:00
Fangjun Kuang
2ededa7e98 Fix building wasm in CI (#720) 2024-03-31 20:50:56 +08:00
Fangjun Kuang
6da4a1c12f Add Go API for speaker identification (#718) 2024-03-29 19:25:55 +08:00
Fangjun Kuang
2e0bccad36 Add C API for speaker embedding extractor. (#711) 2024-03-28 18:05:40 +08:00
Fangjun Kuang
12efbf7397 Sign released TTS APKs (#710) 2024-03-27 19:34:37 +08:00
Fangjun Kuang
4e040c596e Support including TTS conditionally. (#699) 2024-03-26 17:21:35 +08:00
Fangjun Kuang
305c373107 Add C# API for spoken language identification (#697) 2024-03-25 18:45:09 +08:00
Fangjun Kuang
ab7cff2513 Add C API for spoken language identification. (#695) 2024-03-25 15:16:47 +08:00
Fangjun Kuang
0d258dd150 Support spoken language identification with whisper (#694) 2024-03-24 22:57:00 +08:00
Fangjun Kuang
3cdad9b5d1 Use manylinux in CI test (#692) 2024-03-24 07:54:32 +08:00
Fangjun Kuang
1952772654 Add timestamps and tokens for .Net's online models. (#690) 2024-03-23 18:51:56 +08:00
Fangjun Kuang
2fc1201924 Add hotwords support to .Net (#689) 2024-03-22 21:40:42 +08:00