enginex_bi_series-sherpa-onnx

EngineX-Iluvatar/enginex_bi_series-sherpa-onnx

Archived

Author	SHA1	Message	Date
Fangjun Kuang	31d6206fde	HarmonyOS support for VAD. (#1561 )	2024-11-24 16:29:24 +08:00
Fangjun Kuang	2ca2985d04	Add C and C++ API for Moonshine models (#1476 )	2024-10-26 23:24:46 +08:00
Fangjun Kuang	ceb69ebd94	Add C++ API for non-streaming ASR (#1456 )	2024-10-23 16:40:12 +08:00
Fangjun Kuang	effd5ef2be	Add C++ API for streaming ASR. (#1455 ) It is a wrapper around the C API.	2024-10-23 12:07:43 +08:00
Fangjun Kuang	1ed803adc1	Dart API for speaker diarization (#1418 )	2024-10-11 21:17:41 +08:00
Fangjun Kuang	1d061df355	WebAssembly exmaple for speaker diarization (#1411 )	2024-10-10 22:14:45 +08:00
Fangjun Kuang	d468527f62	C API for speaker diarization (#1402 )	2024-10-09 17:10:03 +08:00
lxiao336	06b61ccad8	Allow more online models to load tokens file from the memory (#1352 ) Co-authored-by: xiao <shawl336@6163.com>	2024-09-20 16:38:41 +08:00
Fangjun Kuang	e7ffcbd677	Add APIs about max speech duration in VAD for various programming languages (#1349 )	2024-09-14 12:30:13 +08:00
Fangjun Kuang	544857b097	Fix building (#1343 )	2024-09-13 13:33:52 +08:00
lxiao336	65cfa7548a	re-pull-request allow tokens and hotwords be loaded from buffered string driectly (#1339 ) Co-authored-by: xiao <shawl336@163.com>	2024-09-13 09:58:17 +08:00
Fangjun Kuang	537e163dd0	WebAssembly example for VAD + Non-streaming ASR (#1284 )	2024-08-24 13:24:52 +08:00
Fangjun Kuang	5a2aa110b8	Text to speech API for Object Pascal. (#1273 )	2024-08-20 20:52:16 +08:00
Robin Zhong	62c4d4ab62	Add emotion, event of SenseVoice. (#1257 ) * Add emotion, event of SenseVoice. * Fix tokens size check and update java api. https://github.com/k2-fsa/sherpa-onnx/pull/1257	2024-08-14 15:50:13 +08:00
Fangjun Kuang	94e256244d	Add blank penalty for various language bindings. (#1234 )	2024-08-08 10:43:31 +08:00
Parth Khiera	ba4cb6169f	feat: addition of blank_penalty config in online_recognizer (#1232 )	2024-08-08 09:10:17 +08:00
Fangjun Kuang	4e6aeff07e	Refactor C API to prefix each API with SherpaOnnx. (#1171 )	2024-07-26 18:47:02 +08:00
Fangjun Kuang	25f0a10468	Add C++ runtime for SenseVoice models (#1148 )	2024-07-18 22:54:18 +08:00
Fangjun Kuang	960eb7529e	Add C++ runtime for MeloTTS (#1138 )	2024-07-16 15:55:02 +08:00
ivan provalov	de04b3b9bf	Allow modify model config at decode time for ASR (#1124 )	2024-07-13 22:30:47 +08:00
thewh1teagle	c0eaf86dbd	feat: find best embedding matches (#1102 )	2024-07-11 09:38:06 +08:00
Fangjun Kuang	c2cc9dec58	Add Flush to VAD so that the last segment can be detected. (#1099 )	2024-07-09 16:15:56 +08:00
Fangjun Kuang	9dd0e03568	Enable to stop TTS generation (#1041 )	2024-06-22 18:18:36 +08:00
Fangjun Kuang	6789c909d2	Inverse text normalization API of streaming ASR for various programming languages (#1022 )	2024-06-18 13:42:17 +08:00
Fangjun Kuang	6e09933d99	Inverse text normalization API for other programming languages (#1019 )	2024-06-17 17:02:39 +08:00
Fangjun Kuang	fd5a0d1e00	Add C++ runtime for Tele-AI/TeleSpeech-ASR (#970 )	2024-06-05 00:26:40 +08:00
9728Lin	9edb78e21b	Update c-api.h to hotwords (#962 )	2024-06-03 16:26:12 +08:00
Leo Huang	d45223034c	Added tokens, tokens_arr and json for offline recongnizer result (#936 ) Co-authored-by: leo <webmaster@360converter.com>	2024-05-29 12:53:28 +08:00
FakeEnd	a6c9b7986f	Changed the comment to the API GetKeywordResult input parameter description (#937 )	2024-05-29 12:45:58 +08:00
hantengc	1371c6b3f0	提供设置关键词的api，方便动态调整关键词来进行识别 (#923 )	2024-05-27 19:07:26 +08:00
Fangjun Kuang	8af2af8466	Add tail_paddings to Whisper C API. (#886 )	2024-05-17 09:20:07 +08:00
Fangjun Kuang	03c956a317	Add keyword spotting API for node-addon-api (#877 )	2024-05-14 20:26:48 +08:00
Fangjun Kuang	031134b4d4	Add TTS for node-addon-api (#871 )	2024-05-13 19:24:09 +08:00
Fangjun Kuang	6686c7d3e6	Add dict_dir arg to c api to support Chinese TTS models using jieba (#809 )	2024-04-25 12:28:31 +08:00
Fangjun Kuang	c1608b3524	Support CED models (#792 )	2024-04-19 15:20:37 +08:00
Fangjun Kuang	13730ecbd8	Add C API for punctuation (#768 )	2024-04-14 19:02:34 +08:00
Fangjun Kuang	f204e62b44	Add C API for audio tagging (#754 )	2024-04-11 14:18:43 +08:00
Fangjun Kuang	a5f8fbc83f	Support heteronyms in Chinese TTS (#738 )	2024-04-08 11:01:30 +08:00
Fangjun Kuang	c1c0f5bafd	return timestamps for WebAssembly (#737 )	2024-04-05 20:24:27 +08:00
Fangjun Kuang	dbff2eaadb	Add C API for streaming HLG decoding (#734 )	2024-04-05 10:31:20 +08:00
Fangjun Kuang	2e0bccad36	Add C API for speaker embedding extractor. (#711 )	2024-03-28 18:05:40 +08:00
Leo Huang	638f48f47a	Added progress for callback of tts generator (#712 ) Co-authored-by: leohwang <leohwang@360converter.com>	2024-03-28 17:12:20 +08:00
Fangjun Kuang	ab7cff2513	Add C API for spoken language identification. (#695 )	2024-03-25 15:16:47 +08:00
Fangjun Kuang	1952772654	Add timestamps and tokens for .Net's online models. (#690 )	2024-03-23 18:51:56 +08:00
Fangjun Kuang	acf0975153	Support whisper language/task in various language bindings. (#679 )	2024-03-20 16:43:35 +08:00
Viggo	842d04d7ae	support whisper language (#678 )	2024-03-20 10:16:22 +08:00
xinhecuican	f43139e803	c++ api for keyword spotter (#642 )	2024-03-11 10:23:46 +08:00
Fangjun Kuang	3232dff2cf	Support user provided data in tts callback. (#653 )	2024-03-09 18:15:03 +08:00
Fangjun Kuang	d771762868	Support WebAssembly for text-to-speech (#577 )	2024-02-08 23:39:12 +08:00
Fangjun Kuang	e475e750ac	Support streaming zipformer CTC (#496 ) * Support streaming zipformer CTC * test online zipformer2 CTC * Update doc of sherpa-onnx.cc * Add Python APIs for streaming zipformer2 ctc * Add Python API examples for streaming zipformer2 ctc * Swift API for streaming zipformer2 CTC * NodeJS API for streaming zipformer2 CTC * Kotlin API for streaming zipformer2 CTC * Golang API for streaming zipformer2 CTC * C# API for streaming zipformer2 CTC * Release v1.9.6	2023-12-22 13:46:33 +08:00

1 2

79 Commits