enginex_bi_series-sherpa-onnx

EngineX-Iluvatar/enginex_bi_series-sherpa-onnx

Archived

Author	SHA1	Message	Date
Fangjun Kuang	cd1fedaa49	Add Dart API for audio tagging (#1181 )	2024-07-29 11:15:14 +08:00
Fangjun Kuang	69b6b47d91	Add vad with non-streaming ASR examples for Dart API (#1180 )	2024-07-28 23:01:03 +08:00
Fangjun Kuang	d279c8d20e	Add more Python examples for SenseVoice (#1179 )	2024-07-28 21:54:38 +08:00
Fangjun Kuang	9e005f53c3	fix building MFC examples (#1178 )	2024-07-28 14:07:25 +08:00
Fangjun Kuang	1f8e575133	Add TTS example for Java API. (#1176 ) It plays the generated audio as it is still generating.	2024-07-28 12:07:19 +08:00
Fangjun Kuang	4e6aeff07e	Refactor C API to prefix each API with SherpaOnnx. (#1171 )	2024-07-26 18:47:02 +08:00
Fangjun Kuang	994c3e7c96	Add VAD + Non-streaming ASR example for JavaScript API. (#1170 )	2024-07-26 12:42:08 +08:00
Fangjun Kuang	299f1a852b	Fix style issues reported by clang-tidy (#1167 )	2024-07-23 09:26:36 +08:00
thewh1teagle	d32a46169f	feat: add directml support (#1153 )	2024-07-22 23:50:48 +08:00
Fangjun Kuang	ea1d81bdfe	C api example for sense voice (#1165 )	2024-07-22 16:54:00 +08:00
Fangjun Kuang	dd300b1de5	Add Java and Kotlin API for sense voice (#1164 )	2024-07-22 14:08:40 +08:00
Fangjun Kuang	ac8223bd8a	Add Dart API for keyword spotter (#1162 )	2024-07-22 10:53:34 +08:00
thewh1teagle	22a262f5e4	feat: add stt c api example (#1156 )	2024-07-22 10:32:12 +08:00
Fangjun Kuang	1a471595a5	Fix Android build (#1161 )	2024-07-22 09:27:30 +08:00
Fangjun Kuang	ffdb23a8ec	Add dart API for SenseVoice (#1159 )	2024-07-21 21:48:12 +08:00
Fangjun Kuang	70d14353bb	Add WebAssembly for SenseVoice (#1158 )	2024-07-21 15:39:55 +08:00
Fangjun Kuang	c3260ef842	Add JavaScript API for SenseVoice (#1157 )	2024-07-21 10:14:14 +08:00
Fangjun Kuang	8f4d332aab	Add Go API for SenseVoice (#1154 )	2024-07-20 23:41:53 +08:00
Fangjun Kuang	e472180f2c	Add C# API for SenseVoice models (#1151 )	2024-07-20 17:09:23 +08:00
Fangjun Kuang	25f0a10468	Add C++ runtime for SenseVoice models (#1148 )	2024-07-18 22:54:18 +08:00
Fangjun Kuang	3bae5c3fe5	test exported sense voice models (#1147 )	2024-07-18 12:12:44 +08:00
Fangjun Kuang	346f419f39	export sense-voice to onnx (#1144 )	2024-07-18 00:18:38 +08:00
Fangjun Kuang	4198d9a166	Provide pre-built wheels with CUDA support. (#1143 )	2024-07-17 22:59:13 +08:00
Fangjun Kuang	803c02db0a	publish all pre-built wheels to huggingface (#1142 ) pypi.org provides only 10GB of free space for open-source projects. Each new release of sherpa-onnx occupies about 800MB, so we have to delete previous releases otherwise pypi.org refuses to accept new releases due to limited spaces. To let users install previous versions, we also publish wheels to huggingface and users can find them at https://k2-fsa.github.io/sherpa/onnx/cpu.html and https://k2-fsa.github.io/sherpa/onnx/cpu-cn.html (for users without access to huggingface.co)	2024-07-17 14:41:27 +08:00
Fangjun Kuang	9e448d03bc	Provide npm package for 32-bit Windows x86 (#1141 )	2024-07-17 12:33:15 +08:00
Wei Kang	5b1fa8750f	Fix hotwords OOV log (#1139 )	2024-07-16 19:41:31 +08:00
Fangjun Kuang	960eb7529e	Add C++ runtime for MeloTTS (#1138 )	2024-07-16 15:55:02 +08:00
Fangjun Kuang	95485411fa	Support English for MeloTTS models. (#1134 )	2024-07-15 19:49:22 +08:00
Fangjun Kuang	fa07bbc176	Add APK for small paraformer (#1133 )	2024-07-15 19:44:36 +08:00
Fangjun Kuang	b2c283fa2b	Add Swift API for adding punctuations to text. (#1132 )	2024-07-15 15:30:40 +08:00
Manickavela	11cfd33b10	encoder only trt ep for transducer (#1130 )	2024-07-15 14:52:33 +08:00
Fangjun Kuang	c35200dccf	Revert to onnxruntime 1.17.1 (#1131 )	2024-07-15 14:24:08 +08:00
Fangjun Kuang	04c2319c2c	Export MeloTTS to ONNX (#1129 )	2024-07-15 10:47:19 +08:00
ivan provalov	de04b3b9bf	Allow modify model config at decode time for ASR (#1124 )	2024-07-13 22:30:47 +08:00
Fangjun Kuang	ab71c3976d	Add int8 quantized whisper large models (#1126 )	2024-07-13 22:30:06 +08:00
Fangjun Kuang	3951a12f8d	Add pre-trained models for the Libriheavy dataset (#1122 )	2024-07-13 19:21:13 +08:00
Fangjun Kuang	b5093e27f9	Fix publishing apks to huggingface (#1121 ) Save APKs for each release in a separate directory. Huggingface requires that each directory cannot contain more than 1000 files. Since we have so many tts models and for each model we need to build APKs of 4 different ABIs, it is a workaround for the huggingface's constraint by placing them into separate directories for different releases.	2024-07-13 16:14:00 +08:00
Fangjun Kuang	54e6e962bf	Add microphone example for .Net keyword spotting (#1120 )	2024-07-13 14:56:39 +08:00
Fangjun Kuang	4c626e89cf	Fix releasing dart packages (#1117 )	2024-07-13 10:51:54 +08:00
Fangjun Kuang	117cd7bb8c	Support whisper large/large-v1/large-v2/large-v3 and distil-large-v2 (#1114 )	2024-07-12 23:47:39 +08:00
Fangjun Kuang	d928f77d0e	Add timestamps about streaming models for Swift API (#1113 )	2024-07-12 17:39:46 +08:00
thewh1teagle	c0eaf86dbd	feat: find best embedding matches (#1102 )	2024-07-11 09:38:06 +08:00
Fangjun Kuang	1c104ea847	Update onnxruntime from v1.18.0 to v1.18.1 (#1107 )	2024-07-11 09:35:28 +08:00
Fangjun Kuang	5a2603ff5c	Handle invalid utf8 sequence from Whisper for Dart API. (#1106 ) Fixes #1104	2024-07-10 21:48:23 +08:00
Fangjun Kuang	08c758520f	Add keyword spotting for C# (#1105 )	2024-07-10 21:18:46 +08:00
Fangjun Kuang	dd0ff2ca06	Support onnxruntime 1.18.0 (#906 )	2024-07-10 17:05:26 +08:00
Fangjun Kuang	9e446b8501	Fix typos (#1101 )	2024-07-09 20:08:47 +08:00
Fangjun Kuang	c2cc9dec58	Add Flush to VAD so that the last segment can be detected. (#1099 )	2024-07-09 16:15:56 +08:00
Manix	3e4307e2fb	updating trt workspace int64 (#1094 ) Signed-off-by: Manix <manickavela1998@gmail.com>	2024-07-08 20:38:16 +08:00
Fangjun Kuang	4fd0493037	fix ios demo image for flutter tts (#1093 )	2024-07-08 17:07:01 +08:00

1 2 3 4 5 ...

728 Commits