enginex-mr_series-sherpa-onnx

EngineX-Iluvatar/enginex-mr_series-sherpa-onnx

Archived

Author	SHA1	Message	Date
Fangjun Kuang	6686c7d3e6	Add dict_dir arg to c api to support Chinese TTS models using jieba (#809 )	2024-04-25 12:28:31 +08:00
Fangjun Kuang	c1608b3524	Support CED models (#792 )	2024-04-19 15:20:37 +08:00
Fangjun Kuang	13730ecbd8	Add C API for punctuation (#768 )	2024-04-14 19:02:34 +08:00
Fangjun Kuang	f204e62b44	Add C API for audio tagging (#754 )	2024-04-11 14:18:43 +08:00
Fangjun Kuang	a5f8fbc83f	Support heteronyms in Chinese TTS (#738 )	2024-04-08 11:01:30 +08:00
Fangjun Kuang	c1c0f5bafd	return timestamps for WebAssembly (#737 )	2024-04-05 20:24:27 +08:00
Fangjun Kuang	dbff2eaadb	Add C API for streaming HLG decoding (#734 )	2024-04-05 10:31:20 +08:00
Fangjun Kuang	2e0bccad36	Add C API for speaker embedding extractor. (#711 )	2024-03-28 18:05:40 +08:00
Leo Huang	638f48f47a	Added progress for callback of tts generator (#712 ) Co-authored-by: leohwang <leohwang@360converter.com>	2024-03-28 17:12:20 +08:00
Fangjun Kuang	69c7880c4d	Add Golang API for VAD (#708 )	2024-03-27 12:09:39 +08:00
Fangjun Kuang	4e040c596e	Support including TTS conditionally. (#699 )	2024-03-26 17:21:35 +08:00
Fangjun Kuang	ab7cff2513	Add C API for spoken language identification. (#695 )	2024-03-25 15:16:47 +08:00
Fangjun Kuang	1952772654	Add timestamps and tokens for .Net's online models. (#690 )	2024-03-23 18:51:56 +08:00
Fangjun Kuang	acf0975153	Support whisper language/task in various language bindings. (#679 )	2024-03-20 16:43:35 +08:00
Viggo	842d04d7ae	support whisper language (#678 )	2024-03-20 10:16:22 +08:00
Lovemefan	009ed2cd30	add WebAssembly for Kws (#648 )	2024-03-11 21:02:31 +08:00
xinhecuican	f43139e803	c++ api for keyword spotter (#642 )	2024-03-11 10:23:46 +08:00
Fangjun Kuang	3232dff2cf	Support user provided data in tts callback. (#653 )	2024-03-09 18:15:03 +08:00
Fangjun Kuang	ed06ced16f	Add WebAssembly for NodeJS. (#628 )	2024-03-03 20:00:36 +08:00
Fangjun Kuang	d771762868	Support WebAssembly for text-to-speech (#577 )	2024-02-08 23:39:12 +08:00
Fangjun Kuang	e475e750ac	Support streaming zipformer CTC (#496 ) * Support streaming zipformer CTC * test online zipformer2 CTC * Update doc of sherpa-onnx.cc * Add Python APIs for streaming zipformer2 ctc * Add Python API examples for streaming zipformer2 ctc * Swift API for streaming zipformer2 CTC * NodeJS API for streaming zipformer2 CTC * Kotlin API for streaming zipformer2 CTC * Golang API for streaming zipformer2 CTC * C# API for streaming zipformer2 CTC * Release v1.9.6	2023-12-22 13:46:33 +08:00
Fangjun Kuang	99ff6a834c	Play generated audio as it is generating. (#457 )	2023-12-02 15:35:11 +08:00
Fangjun Kuang	62dc3c3e46	Use piper-phonemize to convert text to token IDs (#453 )	2023-11-30 23:57:43 +08:00
Fangjun Kuang	fe977b8e8e	support nodejs (#438 )	2023-11-21 23:20:08 +08:00
Fangjun Kuang	1937717705	Add MFC TTS example on Windows (#378 )	2023-10-21 00:13:07 +08:00
Fangjun Kuang	a69d0a950e	Add Go API for TTS (#377 )	2023-10-20 15:57:52 +08:00
Fangjun Kuang	ea7c45b60c	Add C API for offline TTS. (#373 )	2023-10-19 17:38:23 +08:00
yujinqiu	d01682d968	Add vad clear api for better performance (#366 ) * Add vad clear api for better performance * rename to make naming consistent and remove macro * Fix linker error * Fix Vad.kt	2023-10-16 14:40:47 +08:00
yujinqiu	f6566c8ace	Expose VAD isDetected api to Swift (#356 )	2023-10-12 15:11:58 +08:00
Nick Fisher	b3e9986825	Add CreateOnlineStreamWithHotwords to C API (#323 ) * add default visibility to SHERPA_ONNX_EXPORT * expose CreateOnlineStreamWithHotwords method via C API Co-authored-by: Nick Fisher <nick.fisher@polyvox.app>	2023-09-19 17:32:42 +08:00
Wei Kang	a5d1c90807	Support c-api (#317 )	2023-09-18 16:24:57 +08:00
Fangjun Kuang	692a47dd80	Add Swift example for generating subtitles (#318 )	2023-09-18 15:16:54 +08:00
Fangjun Kuang	e2be532b32	Add timestamps for offline paraformer (#310 )	2023-09-14 19:33:41 +08:00
Fangjun Kuang	debab7c091	Add two-pass speech recognition Android/iOS demo (#304 )	2023-09-12 15:40:16 +08:00
Fangjun Kuang	e31f9e48c2	Fix various language binding APIs for tdnn and whisper models (#278 )	2023-08-16 22:15:10 +08:00
Fangjun Kuang	bc791d4996	Fix C api for Go and MFC to support streaming paraformer (#268 )	2023-08-14 17:02:23 +08:00
Fangjun Kuang	a8bdb4b38a	Support paraformer on iOS (#265 ) * Fix C API to support streaming paraformer * Fix Swift API * Support paraformer in iOS	2023-08-14 14:38:41 +08:00
Fangjun Kuang	79c2ce5dd4	Refactor online recognizer (#250 ) * Refactor online recognizer. Make it easier to support other streaming models. Note that it is a breaking change for the Python API. `sherpa_onnx.OnlineRecognizer()` used before should be replaced by `sherpa_onnx.OnlineRecognizer.from_transducer()`.	2023-08-09 20:27:31 +08:00
Wilson Wongso	64efbd82af	Implement Tokens in Swift and Kotlin (#227 ) Co-authored-by: duc <duc@appiphany.com.au>	2023-08-05 18:37:03 +08:00
Wilson Wongso	5a6b55c5a7	Reduce model initialization time for online speech recognition (#215 ) * Reduce model initialization time for online speech recognition * Fixed Styling --------- Co-authored-by: w11wo <wilsowong961@gmail.com>	2023-07-14 21:20:10 +08:00
Fangjun Kuang	f3206c49dc	Reduce model initialization time for offline speech recognition (#213 )	2023-07-14 18:07:27 +08:00
Fangjun Kuang	0abd7ce881	Add non-streaming speech recognition examples for MFC (#212 )	2023-07-14 17:00:14 +08:00
Jingzhao Ou	0ed501b8f1	Added provider option to sherpa-onnx and decode-file-c-api (#162 )	2023-06-03 04:57:48 +08:00
Fangjun Kuang	959f13eac8	Fix typos in .Net APIs (#156 )	2023-05-14 22:32:01 +08:00
Fangjun Kuang	7969cf44ac	Refactor C# code and support building nuget packages for cross-platforms (#144 )	2023-05-10 14:53:04 +08:00
manyeyes	d49a597431	Fix bug in DecodeMultipleOnlineStreams method in c-api.cc file (#128 ) Co-authored-by: zx <12345678>	2023-04-21 13:48:44 +08:00
Fangjun Kuang	5f31b22c12	Fix modified beam search for iOS and android (#76 ) * Use Int type for sampling rate * Fix swift * Fix iOS	2023-03-03 15:18:31 +08:00
Fangjun Kuang	7f72c13d9a	Code refactoring (#74 ) * Don't reset model state and feature extractor on endpointing * support passing decoding_method from commandline * Add modified_beam_search to Python API * fix C API example * Fix style issues	2023-03-03 12:10:59 +08:00
Fangjun Kuang	c63c4c3389	C api (#60 )	2023-02-24 16:42:46 +08:00

49 Commits