Fangjun Kuang
cd1fedaa49
Add Dart API for audio tagging ( #1181 )
2024-07-29 11:15:14 +08:00
Fangjun Kuang
69b6b47d91
Add vad with non-streaming ASR examples for Dart API ( #1180 )
2024-07-28 23:01:03 +08:00
Fangjun Kuang
d279c8d20e
Add more Python examples for SenseVoice ( #1179 )
2024-07-28 21:54:38 +08:00
Fangjun Kuang
9e005f53c3
fix building MFC examples ( #1178 )
2024-07-28 14:07:25 +08:00
Fangjun Kuang
1f8e575133
Add TTS example for Java API. ( #1176 )
...
It plays the generated audio as it is still generating.
2024-07-28 12:07:19 +08:00
Fangjun Kuang
4e6aeff07e
Refactor C API to prefix each API with SherpaOnnx. ( #1171 )
2024-07-26 18:47:02 +08:00
Fangjun Kuang
994c3e7c96
Add VAD + Non-streaming ASR example for JavaScript API. ( #1170 )
2024-07-26 12:42:08 +08:00
Fangjun Kuang
299f1a852b
Fix style issues reported by clang-tidy ( #1167 )
2024-07-23 09:26:36 +08:00
thewh1teagle
d32a46169f
feat: add directml support ( #1153 )
2024-07-22 23:50:48 +08:00
Fangjun Kuang
ea1d81bdfe
C api example for sense voice ( #1165 )
2024-07-22 16:54:00 +08:00
Fangjun Kuang
dd300b1de5
Add Java and Kotlin API for sense voice ( #1164 )
2024-07-22 14:08:40 +08:00
Fangjun Kuang
ac8223bd8a
Add Dart API for keyword spotter ( #1162 )
2024-07-22 10:53:34 +08:00
thewh1teagle
22a262f5e4
feat: add stt c api example ( #1156 )
2024-07-22 10:32:12 +08:00
Fangjun Kuang
1a471595a5
Fix Android build ( #1161 )
2024-07-22 09:27:30 +08:00
Fangjun Kuang
ffdb23a8ec
Add dart API for SenseVoice ( #1159 )
2024-07-21 21:48:12 +08:00
Fangjun Kuang
70d14353bb
Add WebAssembly for SenseVoice ( #1158 )
2024-07-21 15:39:55 +08:00
Fangjun Kuang
c3260ef842
Add JavaScript API for SenseVoice ( #1157 )
2024-07-21 10:14:14 +08:00
Fangjun Kuang
8f4d332aab
Add Go API for SenseVoice ( #1154 )
2024-07-20 23:41:53 +08:00
Fangjun Kuang
e472180f2c
Add C# API for SenseVoice models ( #1151 )
2024-07-20 17:09:23 +08:00
Fangjun Kuang
25f0a10468
Add C++ runtime for SenseVoice models ( #1148 )
2024-07-18 22:54:18 +08:00
Fangjun Kuang
3bae5c3fe5
test exported sense voice models ( #1147 )
2024-07-18 12:12:44 +08:00
Fangjun Kuang
346f419f39
export sense-voice to onnx ( #1144 )
2024-07-18 00:18:38 +08:00
Fangjun Kuang
4198d9a166
Provide pre-built wheels with CUDA support. ( #1143 )
2024-07-17 22:59:13 +08:00
Fangjun Kuang
803c02db0a
publish all pre-built wheels to huggingface ( #1142 )
...
pypi.org provides only 10GB of free space for open-source projects.
Each new release of sherpa-onnx occupies about 800MB, so we have to delete previous releases otherwise pypi.org refuses to accept new releases due to limited spaces.
To let users install previous versions, we also publish wheels to huggingface and users can find them at
https://k2-fsa.github.io/sherpa/onnx/cpu.html
and
https://k2-fsa.github.io/sherpa/onnx/cpu-cn.html (for users without access to huggingface.co)
2024-07-17 14:41:27 +08:00
Fangjun Kuang
9e448d03bc
Provide npm package for 32-bit Windows x86 ( #1141 )
2024-07-17 12:33:15 +08:00
Wei Kang
5b1fa8750f
Fix hotwords OOV log ( #1139 )
2024-07-16 19:41:31 +08:00
Fangjun Kuang
960eb7529e
Add C++ runtime for MeloTTS ( #1138 )
2024-07-16 15:55:02 +08:00
Fangjun Kuang
95485411fa
Support English for MeloTTS models. ( #1134 )
2024-07-15 19:49:22 +08:00
Fangjun Kuang
fa07bbc176
Add APK for small paraformer ( #1133 )
2024-07-15 19:44:36 +08:00
Fangjun Kuang
b2c283fa2b
Add Swift API for adding punctuations to text. ( #1132 )
2024-07-15 15:30:40 +08:00
Manickavela
11cfd33b10
encoder only trt ep for transducer ( #1130 )
2024-07-15 14:52:33 +08:00
Fangjun Kuang
c35200dccf
Revert to onnxruntime 1.17.1 ( #1131 )
2024-07-15 14:24:08 +08:00
Fangjun Kuang
04c2319c2c
Export MeloTTS to ONNX ( #1129 )
2024-07-15 10:47:19 +08:00
ivan provalov
de04b3b9bf
Allow modify model config at decode time for ASR ( #1124 )
2024-07-13 22:30:47 +08:00
Fangjun Kuang
ab71c3976d
Add int8 quantized whisper large models ( #1126 )
2024-07-13 22:30:06 +08:00
Fangjun Kuang
3951a12f8d
Add pre-trained models for the Libriheavy dataset ( #1122 )
2024-07-13 19:21:13 +08:00
Fangjun Kuang
b5093e27f9
Fix publishing apks to huggingface ( #1121 )
...
Save APKs for each release in a separate directory.
Huggingface requires that each directory cannot contain more than 1000 files.
Since we have so many tts models and for each model we need to build APKs of 4 different ABIs,
it is a workaround for the huggingface's constraint by placing them into separate directories for different releases.
2024-07-13 16:14:00 +08:00
Fangjun Kuang
54e6e962bf
Add microphone example for .Net keyword spotting ( #1120 )
2024-07-13 14:56:39 +08:00
Fangjun Kuang
4c626e89cf
Fix releasing dart packages ( #1117 )
2024-07-13 10:51:54 +08:00
Fangjun Kuang
117cd7bb8c
Support whisper large/large-v1/large-v2/large-v3 and distil-large-v2 ( #1114 )
2024-07-12 23:47:39 +08:00
Fangjun Kuang
d928f77d0e
Add timestamps about streaming models for Swift API ( #1113 )
2024-07-12 17:39:46 +08:00
thewh1teagle
c0eaf86dbd
feat: find best embedding matches ( #1102 )
2024-07-11 09:38:06 +08:00
Fangjun Kuang
1c104ea847
Update onnxruntime from v1.18.0 to v1.18.1 ( #1107 )
2024-07-11 09:35:28 +08:00
Fangjun Kuang
5a2603ff5c
Handle invalid utf8 sequence from Whisper for Dart API. ( #1106 )
...
Fixes #1104
2024-07-10 21:48:23 +08:00
Fangjun Kuang
08c758520f
Add keyword spotting for C# ( #1105 )
2024-07-10 21:18:46 +08:00
Fangjun Kuang
dd0ff2ca06
Support onnxruntime 1.18.0 ( #906 )
2024-07-10 17:05:26 +08:00
Fangjun Kuang
9e446b8501
Fix typos ( #1101 )
2024-07-09 20:08:47 +08:00
Fangjun Kuang
c2cc9dec58
Add Flush to VAD so that the last segment can be detected. ( #1099 )
2024-07-09 16:15:56 +08:00
Manix
3e4307e2fb
updating trt workspace int64 ( #1094 )
...
Signed-off-by: Manix <manickavela1998@gmail.com >
2024-07-08 20:38:16 +08:00
Fangjun Kuang
4fd0493037
fix ios demo image for flutter tts ( #1093 )
2024-07-08 17:07:01 +08:00