enginex_bi_series-sherpa-onnx

EngineX-Iluvatar/enginex_bi_series-sherpa-onnx

Archived

Author	SHA1	Message	Date
Fangjun Kuang	4e040c596e	Support including TTS conditionally. (#699 )	2024-03-26 17:21:35 +08:00
Fangjun Kuang	bd66f7a7d0	Build Android TTS APKs for coqui-ai/TTS models (#704 )	2024-03-26 14:05:26 +08:00
Fangjun Kuang	d364610605	Use a single thread when loading models (#703 )	2024-03-26 13:35:33 +08:00
Fangjun Kuang	305c373107	Add C# API for spoken language identification (#697 )	2024-03-25 18:45:09 +08:00
Fangjun Kuang	83a10a55a5	Add Swift API for spoken language identification. (#696 )	2024-03-25 16:22:25 +08:00
Fangjun Kuang	ab7cff2513	Add C API for spoken language identification. (#695 )	2024-03-25 15:16:47 +08:00
Fangjun Kuang	0d258dd150	Support spoken language identification with whisper (#694 )	2024-03-24 22:57:00 +08:00
Fangjun Kuang	3cdad9b5d1	Use manylinux in CI test (#692 )	2024-03-24 07:54:32 +08:00
Masoud	e60c897ce7	Update MainActivity.kt (#693 ) fix read-only test text box	2024-03-24 07:29:14 +08:00
Fangjun Kuang	1952772654	Add timestamps and tokens for .Net's online models. (#690 )	2024-03-23 18:51:56 +08:00
Fangjun Kuang	e6da2c5556	Fix build c api examples with alsa (#691 )	2024-03-23 16:16:24 +08:00
Karel Vesely	eaec4c83c2	Configurable low_freq high_freq, dithering (#664 )	2024-03-22 21:41:44 +08:00
Fangjun Kuang	2fc1201924	Add hotwords support to .Net (#689 )	2024-03-22 21:40:42 +08:00
Fangjun Kuang	24f437a6f1	Refactor github actions tests (#688 )	2024-03-22 21:22:42 +08:00
Masoud	1c77457d61	Update MainActivity.kt (#687 ) Appending a default text to test field. To faster check the voices	2024-03-22 19:04:14 +08:00
Fangjun Kuang	c8770aec20	Add nuget package for Windows x86 (#683 )	2024-03-21 14:57:01 +08:00
Fangjun Kuang	acf0975153	Support whisper language/task in various language bindings. (#679 )	2024-03-20 16:43:35 +08:00
Viggo	842d04d7ae	support whisper language (#678 )	2024-03-20 10:16:22 +08:00
Fangjun Kuang	6571fc9552	Add tts play example for .Net. (#676 ) It plays the generated audio via a speaker as it is generating.	2024-03-19 17:33:15 +08:00
foreversimon	ce60100f68	Add HotwordsFile and HotwordsScore fields to OnlineRecognizerConfig in C# API (#675 )	2024-03-19 15:04:08 +08:00
Bhaswati Saha	fda614d0d1	beam search value as parameter in offline_recognizer.py (#673 ) Co-authored-by: bhascns <bhaswati@mihup.com>	2024-03-18 18:43:05 +08:00
Fangjun Kuang	9d6eb3e834	small fixes to wasm kws. (#672 )	2024-03-18 15:28:10 +08:00
Lovemefan	009ed2cd30	add WebAssembly for Kws (#648 )	2024-03-11 21:02:31 +08:00
Fangjun Kuang	a628002d8f	Release v1.9.12 (#661 )	2024-03-11 18:52:34 +08:00
Fangjun Kuang	44d0ef9ae3	Print the time about the first message in tts. (#655 )	2024-03-11 11:05:42 +08:00
xinhecuican	f43139e803	c++ api for keyword spotter (#642 )	2024-03-11 10:23:46 +08:00
Fangjun Kuang	1777a5dd88	Use onnxruntime 1.17.1 for iOS. (#654 )	2024-03-10 14:26:36 +08:00
Fangjun Kuang	3232dff2cf	Support user provided data in tts callback. (#653 )	2024-03-09 18:15:03 +08:00
GaryLaurenceauAva	ac43c2d7b6	Expose 'language' 'task' 'tailPaddings' in OfflineWhisperModelConfig (#643 ) Co-authored-by: Gary <gary.laurenceau@gmail.com>	2024-03-08 19:52:30 +08:00
Fangjun Kuang	4b708e055c	Add microphone streaming ASR example for C API (#650 )	2024-03-08 19:31:46 +08:00
Fangjun Kuang	d3287f9494	Add Python ASR examples with alsa (#646 )	2024-03-08 11:34:48 +08:00
Wei Kang	e9e8d755d9	Fix detetion at the tail when using hotwords in streaming model (#638 )	2024-03-08 10:04:33 +08:00
Fangjun Kuang	f70fdd156c	Support using T-head-Semi/csi-nn2 for RISC-V (#637 )	2024-03-06 18:21:50 +08:00
Fangjun Kuang	bdf9243940	Allow to not use pre-installed onnxruntime libs. (#636 )	2024-03-06 14:40:23 +08:00
Fangjun Kuang	13260cdf49	Use self-compiled onnxruntime shared lib. (#635 )	2024-03-06 11:03:24 +08:00
Fangjun Kuang	5dc2eaf2b4	Fix building wheels from source. (#632 )	2024-03-04 16:39:51 +08:00
Fangjun Kuang	ed06ced16f	Add WebAssembly for NodeJS. (#628 )	2024-03-03 20:00:36 +08:00
Fangjun Kuang	ac6825ff11	Refactor WebAssembly for nodejs (#626 )	2024-03-02 12:31:36 +08:00
Fangjun Kuang	a65643b594	support onnxruntime v1.17.1 (#624 )	2024-03-02 11:44:59 +08:00
Fangjun Kuang	d56964371c	Support VITS models from icefall. (#625 )	2024-03-01 19:48:38 +08:00
dragon10	93836ff451	fixed variable's spell num_trailing_blanks (#623 ) Signed-off-by: lonngxiang <lonngxiang@gmial.com> Co-authored-by: lonngxiang <lonngxiang@gmial.com>	2024-03-01 17:02:10 +08:00
Fangjun Kuang	e2397cd1a4	Support Android NNAPI. (#622 )	2024-03-01 16:39:48 +08:00
Fangjun Kuang	f9db33c926	Add WebAssembly demo for streaming trilingual Paraformer (Chinese+Cantonese+English) (#618 )	2024-03-01 15:20:56 +08:00
Fangjun Kuang	c093880d7c	Fix building wheels (#620 )	2024-03-01 15:20:06 +08:00
Wei Kang	734bbd91dc	Add Python API for keyword spotting (#576 ) * Add alsa & microphone support for keyword spotting * Add python wrapper	2024-03-01 09:31:11 +08:00
Fangjun Kuang	8b7928e7d6	Fix computing features for whisper. (#617 )	2024-02-29 16:56:29 +08:00
Karel Vesely	38c072dcb2	Track token scores (#571 ) * add export of per-token scores (ys, lm, context) - for best path of the modified-beam-search decoding of transducer * refactoring JSON export of OnlineRecognitionResult, extending pybind11 API of OnlineRecognitionResult * export per-token scores also for greedy-search (online-transducer) - export un-scaled lm_probs (modified-beam search, online-transducer) - polishing * fill lm_probs/context_scores only if LM/ContextGraph is present (make Result smaller)	2024-02-29 06:28:45 +08:00
Fangjun Kuang	85d59b5840	Use hub.nuaa.cf to replace huggingface URL to download dependencies. (#614 )	2024-02-28 17:48:51 +08:00
Fangjun Kuang	0cb6d1b474	support using xnnpack as execution provider (#612 )	2024-02-28 17:32:48 +08:00
Fangjun Kuang	87a7030c08	Support using alsa to access the microphone with non-streaming ASR models (#517 )	2024-02-26 21:17:26 +08:00

1 2 3 4 5 ...

465 Commits