Commit Graph

728 Commits

Author SHA1 Message Date
Fangjun Kuang
cd1fedaa49 Add Dart API for audio tagging (#1181) 2024-07-29 11:15:14 +08:00
Fangjun Kuang
69b6b47d91 Add vad with non-streaming ASR examples for Dart API (#1180) 2024-07-28 23:01:03 +08:00
Fangjun Kuang
d279c8d20e Add more Python examples for SenseVoice (#1179) 2024-07-28 21:54:38 +08:00
Fangjun Kuang
9e005f53c3 fix building MFC examples (#1178) 2024-07-28 14:07:25 +08:00
Fangjun Kuang
1f8e575133 Add TTS example for Java API. (#1176)
It plays the generated audio as it is still generating.
2024-07-28 12:07:19 +08:00
Fangjun Kuang
4e6aeff07e Refactor C API to prefix each API with SherpaOnnx. (#1171) 2024-07-26 18:47:02 +08:00
Fangjun Kuang
994c3e7c96 Add VAD + Non-streaming ASR example for JavaScript API. (#1170) 2024-07-26 12:42:08 +08:00
Fangjun Kuang
299f1a852b Fix style issues reported by clang-tidy (#1167) 2024-07-23 09:26:36 +08:00
thewh1teagle
d32a46169f feat: add directml support (#1153) 2024-07-22 23:50:48 +08:00
Fangjun Kuang
ea1d81bdfe C api example for sense voice (#1165) 2024-07-22 16:54:00 +08:00
Fangjun Kuang
dd300b1de5 Add Java and Kotlin API for sense voice (#1164) 2024-07-22 14:08:40 +08:00
Fangjun Kuang
ac8223bd8a Add Dart API for keyword spotter (#1162) 2024-07-22 10:53:34 +08:00
thewh1teagle
22a262f5e4 feat: add stt c api example (#1156) 2024-07-22 10:32:12 +08:00
Fangjun Kuang
1a471595a5 Fix Android build (#1161) 2024-07-22 09:27:30 +08:00
Fangjun Kuang
ffdb23a8ec Add dart API for SenseVoice (#1159) 2024-07-21 21:48:12 +08:00
Fangjun Kuang
70d14353bb Add WebAssembly for SenseVoice (#1158) 2024-07-21 15:39:55 +08:00
Fangjun Kuang
c3260ef842 Add JavaScript API for SenseVoice (#1157) 2024-07-21 10:14:14 +08:00
Fangjun Kuang
8f4d332aab Add Go API for SenseVoice (#1154) 2024-07-20 23:41:53 +08:00
Fangjun Kuang
e472180f2c Add C# API for SenseVoice models (#1151) 2024-07-20 17:09:23 +08:00
Fangjun Kuang
25f0a10468 Add C++ runtime for SenseVoice models (#1148) 2024-07-18 22:54:18 +08:00
Fangjun Kuang
3bae5c3fe5 test exported sense voice models (#1147) 2024-07-18 12:12:44 +08:00
Fangjun Kuang
346f419f39 export sense-voice to onnx (#1144) 2024-07-18 00:18:38 +08:00
Fangjun Kuang
4198d9a166 Provide pre-built wheels with CUDA support. (#1143) 2024-07-17 22:59:13 +08:00
Fangjun Kuang
803c02db0a publish all pre-built wheels to huggingface (#1142)
pypi.org provides only 10GB of free space for open-source projects.

Each new release of sherpa-onnx occupies about 800MB, so we have to delete previous releases otherwise pypi.org refuses to accept new releases due to limited spaces.

To let users install previous versions, we also publish wheels to huggingface and users can find them at

https://k2-fsa.github.io/sherpa/onnx/cpu.html
and
https://k2-fsa.github.io/sherpa/onnx/cpu-cn.html (for users without access to huggingface.co)
2024-07-17 14:41:27 +08:00
Fangjun Kuang
9e448d03bc Provide npm package for 32-bit Windows x86 (#1141) 2024-07-17 12:33:15 +08:00
Wei Kang
5b1fa8750f Fix hotwords OOV log (#1139) 2024-07-16 19:41:31 +08:00
Fangjun Kuang
960eb7529e Add C++ runtime for MeloTTS (#1138) 2024-07-16 15:55:02 +08:00
Fangjun Kuang
95485411fa Support English for MeloTTS models. (#1134) 2024-07-15 19:49:22 +08:00
Fangjun Kuang
fa07bbc176 Add APK for small paraformer (#1133) 2024-07-15 19:44:36 +08:00
Fangjun Kuang
b2c283fa2b Add Swift API for adding punctuations to text. (#1132) 2024-07-15 15:30:40 +08:00
Manickavela
11cfd33b10 encoder only trt ep for transducer (#1130) 2024-07-15 14:52:33 +08:00
Fangjun Kuang
c35200dccf Revert to onnxruntime 1.17.1 (#1131) 2024-07-15 14:24:08 +08:00
Fangjun Kuang
04c2319c2c Export MeloTTS to ONNX (#1129) 2024-07-15 10:47:19 +08:00
ivan provalov
de04b3b9bf Allow modify model config at decode time for ASR (#1124) 2024-07-13 22:30:47 +08:00
Fangjun Kuang
ab71c3976d Add int8 quantized whisper large models (#1126) 2024-07-13 22:30:06 +08:00
Fangjun Kuang
3951a12f8d Add pre-trained models for the Libriheavy dataset (#1122) 2024-07-13 19:21:13 +08:00
Fangjun Kuang
b5093e27f9 Fix publishing apks to huggingface (#1121)
Save APKs for each release in a separate directory.

Huggingface requires that each directory cannot contain more than 1000 files.

Since we have so many tts models and for each model we need to build APKs of 4 different ABIs,
it is a workaround for the huggingface's constraint by placing them into separate directories for different releases.
2024-07-13 16:14:00 +08:00
Fangjun Kuang
54e6e962bf Add microphone example for .Net keyword spotting (#1120) 2024-07-13 14:56:39 +08:00
Fangjun Kuang
4c626e89cf Fix releasing dart packages (#1117) 2024-07-13 10:51:54 +08:00
Fangjun Kuang
117cd7bb8c Support whisper large/large-v1/large-v2/large-v3 and distil-large-v2 (#1114) 2024-07-12 23:47:39 +08:00
Fangjun Kuang
d928f77d0e Add timestamps about streaming models for Swift API (#1113) 2024-07-12 17:39:46 +08:00
thewh1teagle
c0eaf86dbd feat: find best embedding matches (#1102) 2024-07-11 09:38:06 +08:00
Fangjun Kuang
1c104ea847 Update onnxruntime from v1.18.0 to v1.18.1 (#1107) 2024-07-11 09:35:28 +08:00
Fangjun Kuang
5a2603ff5c Handle invalid utf8 sequence from Whisper for Dart API. (#1106)
Fixes #1104
2024-07-10 21:48:23 +08:00
Fangjun Kuang
08c758520f Add keyword spotting for C# (#1105) 2024-07-10 21:18:46 +08:00
Fangjun Kuang
dd0ff2ca06 Support onnxruntime 1.18.0 (#906) 2024-07-10 17:05:26 +08:00
Fangjun Kuang
9e446b8501 Fix typos (#1101) 2024-07-09 20:08:47 +08:00
Fangjun Kuang
c2cc9dec58 Add Flush to VAD so that the last segment can be detected. (#1099) 2024-07-09 16:15:56 +08:00
Manix
3e4307e2fb updating trt workspace int64 (#1094)
Signed-off-by: Manix <manickavela1998@gmail.com>
2024-07-08 20:38:16 +08:00
Fangjun Kuang
4fd0493037 fix ios demo image for flutter tts (#1093) 2024-07-08 17:07:01 +08:00