Commit Graph

747 Commits

Author SHA1 Message Date
Fangjun Kuang
9ee2943ed4 Add CI tests for online punctuation models (#1226) 2024-08-06 18:10:30 +08:00
Fangjun Kuang
375c055ff8 Fix style issues for online punctuation source files (#1225) 2024-08-06 17:43:24 +08:00
jianyou
1414e4dc61 Add online punctuation and casing prediction model for English language (#1224) 2024-08-06 17:33:38 +08:00
Fangjun Kuang
52830cc910 Add MeloTTS example for ios (#1223) 2024-08-06 14:48:54 +08:00
Fangjun Kuang
6422966a7f Support passing TTS callback in Swift API (#1218) 2024-08-05 14:06:21 +08:00
Fangjun Kuang
9caa488019 Fix setting SenseVoice language. (#1214) 2024-08-04 19:02:23 +08:00
Fangjun Kuang
c2dce19140 Update README to include Rust. (#1212) 2024-08-04 12:20:05 +08:00
Fangjun Kuang
d5f486878d Remove libonnxruntime_providers_cuda.so as a dependency. (#1210) 2024-08-03 16:25:23 +08:00
Fangjun Kuang
561d04dd92 describe how to add new words for MeloTTS models (#1209) 2024-08-03 11:19:02 +08:00
Fangjun Kuang
35c1b4a7a9 Add ReazonSpeech Japanese pre-trained model (#1203) 2024-08-02 10:21:24 +08:00
Fangjun Kuang
53484fcd9b Fix reading non-standard wav files. (#1199) 2024-08-01 17:48:04 +08:00
Fangjun Kuang
ec98110e11 Add speaker identification and verification exmaple for Dart API (#1194) 2024-07-31 13:53:52 +08:00
Fangjun Kuang
963aaba82b Add Chinese+English tts example for flutter (#1192) 2024-07-30 18:38:43 +08:00
Fangjun Kuang
c1b5fce01b Fix copying asset files for flutter examples. (#1191)
If the target file exists but has a different file size, we need
to copy the source file to the target file.
2024-07-30 18:24:56 +08:00
Fangjun Kuang
9e02f88dbb Non-streaming WebSocket client for Java. (#1190) 2024-07-30 17:21:33 +08:00
Fangjun Kuang
06fd50f536 Add test about whisper large-v3 for .Net (#1187) 2024-07-29 20:49:38 +08:00
Fangjun Kuang
86b4c9f535 Fix splitting sentences for MeloTTS (#1186) 2024-07-29 17:04:45 +08:00
Fangjun Kuang
b1711ecaa1 Fix ffmpeg c api example (#1185) 2024-07-29 14:27:55 +08:00
Fangjun Kuang
646f99c870 Dart API for adding punctuations to text (#1182) 2024-07-29 12:41:52 +08:00
Fangjun Kuang
cd1fedaa49 Add Dart API for audio tagging (#1181) 2024-07-29 11:15:14 +08:00
Fangjun Kuang
69b6b47d91 Add vad with non-streaming ASR examples for Dart API (#1180) 2024-07-28 23:01:03 +08:00
Fangjun Kuang
d279c8d20e Add more Python examples for SenseVoice (#1179) 2024-07-28 21:54:38 +08:00
Fangjun Kuang
9e005f53c3 fix building MFC examples (#1178) 2024-07-28 14:07:25 +08:00
Fangjun Kuang
1f8e575133 Add TTS example for Java API. (#1176)
It plays the generated audio as it is still generating.
2024-07-28 12:07:19 +08:00
Fangjun Kuang
4e6aeff07e Refactor C API to prefix each API with SherpaOnnx. (#1171) 2024-07-26 18:47:02 +08:00
Fangjun Kuang
994c3e7c96 Add VAD + Non-streaming ASR example for JavaScript API. (#1170) 2024-07-26 12:42:08 +08:00
Fangjun Kuang
299f1a852b Fix style issues reported by clang-tidy (#1167) 2024-07-23 09:26:36 +08:00
thewh1teagle
d32a46169f feat: add directml support (#1153) 2024-07-22 23:50:48 +08:00
Fangjun Kuang
ea1d81bdfe C api example for sense voice (#1165) 2024-07-22 16:54:00 +08:00
Fangjun Kuang
dd300b1de5 Add Java and Kotlin API for sense voice (#1164) 2024-07-22 14:08:40 +08:00
Fangjun Kuang
ac8223bd8a Add Dart API for keyword spotter (#1162) 2024-07-22 10:53:34 +08:00
thewh1teagle
22a262f5e4 feat: add stt c api example (#1156) 2024-07-22 10:32:12 +08:00
Fangjun Kuang
1a471595a5 Fix Android build (#1161) 2024-07-22 09:27:30 +08:00
Fangjun Kuang
ffdb23a8ec Add dart API for SenseVoice (#1159) 2024-07-21 21:48:12 +08:00
Fangjun Kuang
70d14353bb Add WebAssembly for SenseVoice (#1158) 2024-07-21 15:39:55 +08:00
Fangjun Kuang
c3260ef842 Add JavaScript API for SenseVoice (#1157) 2024-07-21 10:14:14 +08:00
Fangjun Kuang
8f4d332aab Add Go API for SenseVoice (#1154) 2024-07-20 23:41:53 +08:00
Fangjun Kuang
e472180f2c Add C# API for SenseVoice models (#1151) 2024-07-20 17:09:23 +08:00
Fangjun Kuang
25f0a10468 Add C++ runtime for SenseVoice models (#1148) 2024-07-18 22:54:18 +08:00
Fangjun Kuang
3bae5c3fe5 test exported sense voice models (#1147) 2024-07-18 12:12:44 +08:00
Fangjun Kuang
346f419f39 export sense-voice to onnx (#1144) 2024-07-18 00:18:38 +08:00
Fangjun Kuang
4198d9a166 Provide pre-built wheels with CUDA support. (#1143) 2024-07-17 22:59:13 +08:00
Fangjun Kuang
803c02db0a publish all pre-built wheels to huggingface (#1142)
pypi.org provides only 10GB of free space for open-source projects.

Each new release of sherpa-onnx occupies about 800MB, so we have to delete previous releases otherwise pypi.org refuses to accept new releases due to limited spaces.

To let users install previous versions, we also publish wheels to huggingface and users can find them at

https://k2-fsa.github.io/sherpa/onnx/cpu.html
and
https://k2-fsa.github.io/sherpa/onnx/cpu-cn.html (for users without access to huggingface.co)
2024-07-17 14:41:27 +08:00
Fangjun Kuang
9e448d03bc Provide npm package for 32-bit Windows x86 (#1141) 2024-07-17 12:33:15 +08:00
Wei Kang
5b1fa8750f Fix hotwords OOV log (#1139) 2024-07-16 19:41:31 +08:00
Fangjun Kuang
960eb7529e Add C++ runtime for MeloTTS (#1138) 2024-07-16 15:55:02 +08:00
Fangjun Kuang
95485411fa Support English for MeloTTS models. (#1134) 2024-07-15 19:49:22 +08:00
Fangjun Kuang
fa07bbc176 Add APK for small paraformer (#1133) 2024-07-15 19:44:36 +08:00
Fangjun Kuang
b2c283fa2b Add Swift API for adding punctuations to text. (#1132) 2024-07-15 15:30:40 +08:00
Manickavela
11cfd33b10 encoder only trt ep for transducer (#1130) 2024-07-15 14:52:33 +08:00