enginex-mr_series-sherpa-onnx

EngineX-Iluvatar/enginex-mr_series-sherpa-onnx

Archived

Author	SHA1	Message	Date
Fangjun Kuang	4f758e6cd3	Publish node-addon-api wrapper for sherpa-onnx as npm packages (#829 )	2024-05-04 13:27:39 +08:00
Fangjun Kuang	c7691650d7	Fix CI tests (#804 )	2024-04-24 13:01:06 +08:00
Fangjun Kuang	6b353bfb42	Add jieba for Chinese TTS models (#797 )	2024-04-21 14:47:13 +08:00
Fangjun Kuang	329fe1aa8b	Support adding punctuations to the speech recogntion result (#761 )	2024-04-13 12:15:57 +08:00
Fangjun Kuang	34d70a259f	Add Python API and Python examples for audio tagging (#753 )	2024-04-11 11:12:48 +08:00
布宝	d21c45d0ea	Add --continue to wget (#750 ) Also, switch to github mirror	2024-04-11 09:07:31 +08:00
Fangjun Kuang	042976ea6e	Add C++ microphone examples for audio tagging (#749 )	2024-04-10 21:00:35 +08:00
Fangjun Kuang	f20291cadc	Support audio tagging using zipformer (#747 )	2024-04-10 14:47:06 +08:00
Fangjun Kuang	db1b3ab1f3	Fix building OpenFst on Windows. (#744 )	2024-04-09 11:17:46 +08:00
Fangjun Kuang	0d90b34e4a	Support Chinese heteronyms on Android for TTS. (#742 )	2024-04-08 21:36:47 +08:00
Fangjun Kuang	6fb8ceda57	Add VAD examples using ALSA for recording (#739 )	2024-04-08 16:41:01 +08:00
Fangjun Kuang	a5f8fbc83f	Support heteronyms in Chinese TTS (#738 )	2024-04-08 11:01:30 +08:00
Fangjun Kuang	dbff2eaadb	Add C API for streaming HLG decoding (#734 )	2024-04-05 10:31:20 +08:00
Fangjun Kuang	db67e00c77	Add HLG decoding for streaming CTC models (#731 )	2024-04-03 21:31:42 +08:00
hantengc	ccb2d435ec	add openfst.cmake file (#707 ) 1. When compiling locally, openfst is missing.so add this file to the sherpa-onnx/cmake folder	2024-03-27 11:31:26 +08:00
Fangjun Kuang	4e040c596e	Support including TTS conditionally. (#699 )	2024-03-26 17:21:35 +08:00
Fangjun Kuang	0d258dd150	Support spoken language identification with whisper (#694 )	2024-03-24 22:57:00 +08:00
Fangjun Kuang	1952772654	Add timestamps and tokens for .Net's online models. (#690 )	2024-03-23 18:51:56 +08:00
Karel Vesely	eaec4c83c2	Configurable low_freq high_freq, dithering (#664 )	2024-03-22 21:41:44 +08:00
Fangjun Kuang	f70fdd156c	Support using T-head-Semi/csi-nn2 for RISC-V (#637 )	2024-03-06 18:21:50 +08:00
Fangjun Kuang	bdf9243940	Allow to not use pre-installed onnxruntime libs. (#636 )	2024-03-06 14:40:23 +08:00
Fangjun Kuang	13260cdf49	Use self-compiled onnxruntime shared lib. (#635 )	2024-03-06 11:03:24 +08:00
Fangjun Kuang	a65643b594	support onnxruntime v1.17.1 (#624 )	2024-03-02 11:44:59 +08:00
Fangjun Kuang	8b7928e7d6	Fix computing features for whisper. (#617 )	2024-02-29 16:56:29 +08:00
Fangjun Kuang	85d59b5840	Use hub.nuaa.cf to replace huggingface URL to download dependencies. (#614 )	2024-02-28 17:48:51 +08:00
Fangjun Kuang	0cb6d1b474	support using xnnpack as execution provider (#612 )	2024-02-28 17:32:48 +08:00
Fangjun Kuang	87a7030c08	Support using alsa to access the microphone with non-streaming ASR models (#517 )	2024-02-26 21:17:26 +08:00
Fangjun Kuang	ee37d9bd92	Support RISC-V (#609 )	2024-02-26 06:57:18 +08:00
Fangjun Kuang	67acd34dcd	Use alsa to read microphone in speaker identification demo. (#605 )	2024-02-23 19:27:51 +08:00
Fangjun Kuang	5f075d0fce	Support MinSizeRel and RelWithDebInfo build on Windows. (#586 )	2024-02-20 10:22:02 +08:00
Fangjun Kuang	c68f39bd3c	Use onnxruntime static lib compiled with gcc8 on ubuntu 20.04 (#587 )	2024-02-20 09:31:37 +08:00
Fangjun Kuang	64007a6193	Support building debug version on Windows (#583 )	2024-02-18 10:39:55 +08:00
Fangjun Kuang	81da0fb7a6	Update onnxruntime from 1.16.3 to 1.17.0 (#581 )	2024-02-17 12:43:42 +08:00
Fangjun Kuang	d771762868	Support WebAssembly for text-to-speech (#577 )	2024-02-08 23:39:12 +08:00
Fangjun Kuang	0b18ccfbb2	C++ API demo for speaker identification with portaudio. (#561 )	2024-01-30 11:21:43 +08:00
Fangjun Kuang	a9e7747736	Fix cmake variables to point to the project root directory. (#545 )	2024-01-24 19:21:23 +08:00
Wei Kang	b6c020901a	decoder for open vocabulary keyword spotting (#505 ) * various fixes to ContextGraph to support open vocabulary keywords decoder * Add keyword spotter runtime * Add binary * First version works * Minor fixes * update text2token * default values * Add jni for kws * add kws android project * Minor fixes * Remove unused interface * Minor fixes * Add workflow * handle extra info in texts * Minor fixes * Add more comments * Fix ci * fix cpp style * Add input box in android demo so that users can specify their keywords * Fix cpp style * Fix comments * Minor fixes * Minor fixes * minor fixes * Minor fixes * Minor fixes * Add CI * Fix code style * cpplint * Fix comments * Fix error	2024-01-20 22:52:41 +08:00
Fangjun Kuang	2024e96639	Add C++ runtime for speaker verification models from NeMo (#527 )	2024-01-13 21:42:09 +08:00
Fangjun Kuang	33c03f78b2	Fix CI (#485 )	2023-12-15 10:25:03 +08:00
Fangjun Kuang	9ff6185b7c	fix building linux x86 wheels (#484 )	2023-12-14 21:37:40 +08:00
Fangjun Kuang	b18812ceff	Play generated audio using alsa for TTS (#482 )	2023-12-13 22:28:03 +08:00
Fangjun Kuang	cae0231f93	Fix releasing go packages (#476 )	2023-12-09 00:07:52 +08:00
Fangjun Kuang	99ff6a834c	Play generated audio as it is generating. (#457 )	2023-12-02 15:35:11 +08:00
Fangjun Kuang	62dc3c3e46	Use piper-phonemize to convert text to token IDs (#453 )	2023-11-30 23:57:43 +08:00
Fangjun Kuang	db41778e99	Support piper-phonemize (#452 )	2023-11-28 19:12:58 +08:00
Fangjun Kuang	8444d54c4e	Update to onnxruntime 1.16.3 (#446 )	2023-11-24 14:39:03 +08:00
Fangjun Kuang	eeda1e190e	Build building for iOS (#430 )	2023-11-16 21:14:25 +08:00
Fangjun Kuang	9884cf71e7	Update onnxruntime to v1.16.2 (#421 )	2023-11-12 11:29:33 +08:00
Fangjun Kuang	68f0e59688	Add a C++ example to show streaming VAD + non-streaming ASR. (#420 )	2023-11-11 22:54:27 +08:00
Fangjun Kuang	86baf43c6b	support reading rule FST for Android TTS (#410 )	2023-11-06 10:38:40 +08:00

1 2 3

119 Commits