enginex_bi_series-sherpa-onnx

EngineX-Iluvatar/enginex_bi_series-sherpa-onnx

Archived

Author	SHA1	Message	Date
Fangjun Kuang	fd9a687ec2	Add Pascal/Go/C#/Dart API for NeMo Canary ASR models (#2367 ) Add support for the new NeMo Canary ASR model across multiple language bindings by introducing a Canary model configuration and setter method on the offline recognizer. - Define Canary model config in Pascal, Go, C#, Dart and update converter functions - Add SetConfig API for offline recognizer (Pascal, Go, C#, Dart) - Extend CI/workflows and example scripts to test non-streaming Canary decoding	2025-07-10 14:53:33 +08:00
Fangjun Kuang	3bf986d08d	Support non-streaming zipformer CTC ASR models (#2340 ) This PR adds support for non-streaming Zipformer CTC ASR models across multiple language bindings, WebAssembly, examples, and CI workflows. - Introduces a new OfflineZipformerCtcModelConfig in C/C++, Python, Swift, Java, Kotlin, Go, Dart, Pascal, and C# APIs - Updates initialization, freeing, and recognition logic to include Zipformer CTC in WASM and Node.js - Adds example scripts and CI steps for downloading, building, and running Zipformer CTC models Model doc is available at https://k2-fsa.github.io/sherpa/onnx/pretrained_models/offline-ctc/icefall/zipformer.html	2025-07-04 15:57:07 +08:00
Fangjun Kuang	282211c01f	Remove portaudio-go in Go API examples. (#2317 ) Replace the deprecated portaudio-go integration with malgo in the Go real-time speech recognition example and correct version string typos in the Node.js examples. - Fixed “verison” typo in Node.js console logs. - Swapped out portaudio-go for malgo in the Go microphone example, introducing initRecognizer, callback-driven streaming, and sample conversion. - Removed portaudio-go from go.mod.	2025-06-26 11:33:50 +08:00
Fangjun Kuang	bda427f4b2	Add API to get version information (#2309 )	2025-06-25 00:22:21 +08:00
愚者自愚	116977b5d4	Add Go implementation of the TTS generation callback (#2213 )	2025-05-14 16:09:31 +08:00
Fangjun Kuang	fcb4c4eb2c	Add Go API for homophone replacer (#2168 )	2025-04-30 23:47:38 +08:00
Fangjun Kuang	ba7d8b63f0	Add Go API for Dolphin CTC models (#2090 )	2025-04-03 00:02:09 +08:00
Jov	ef759b7b8b	fix case (#2037 ) v should be V	2025-03-21 16:46:13 +08:00
Jov	572c8d292c	fix vits dict dir config (#2036 )	2025-03-21 16:30:54 +08:00
Fangjun Kuang	0aacf02dd8	Add C++ runtime for vocos (#2014 )	2025-03-17 17:05:15 +08:00
Fangjun Kuang	d78f408362	Add Go API for speech enhancement GTCRN models (#1991 )	2025-03-11 19:33:05 +08:00
franck-li	0dcaf3a061	go.mod set to use go 1.17, and use unsafe.Slice to optimize the code (#1920 ) Co-authored-by: liyuzhi <liyuzhi@info.easeus.com.cn>	2025-02-25 15:31:15 +08:00
Fangjun Kuang	87a968b55d	Add Go API for FireRedAsr AED Model (#1879 )	2025-02-17 16:04:07 +08:00
Fangjun Kuang	f5bf8c8d4a	Add Go API for audio tagging (#1840 )	2025-02-11 12:07:28 +08:00
Fangjun Kuang	e1a88a799f	Add Go API for Kokoro TTS 1.0 (#1804 )	2025-02-07 15:18:02 +08:00
Fangjun Kuang	8b989a851c	Fix keyword spotting. (#1689 ) Reset the stream right after detecting a keyword	2025-01-20 16:41:10 +08:00
Fangjun Kuang	2086f8c55b	Add Go API for Kokoro TTS models (#1722 )	2025-01-16 17:35:31 +08:00
Fangjun Kuang	46330b25cc	Add Go API for MatchaTTS models (#1685 )	2025-01-06 08:03:03 +08:00
Fangjun Kuang	49154c957b	Add Go API for Keyword spotting (#1662 )	2024-12-31 11:25:32 +08:00
windy	0f4b1f41e2	🔧 build(portaudio-go): Fixed version 1.0.3 (#1614 ) Co-authored-by: windy <deretame123@gmail.com>	2024-12-12 19:39:43 +08:00
Fangjun Kuang	3d3edabb5f	Add Go API for Moonshine models (#1479 )	2024-10-27 09:39:09 +08:00
Fangjun Kuang	052b8645ba	Add Go API examples for adding punctuations to text. (#1478 )	2024-10-27 09:04:05 +08:00
Fangjun Kuang	df681e9807	Go API for speaker diarization (#1403 )	2024-10-09 20:10:44 +08:00
Fangjun Kuang	e7ffcbd677	Add APIs about max speech duration in VAD for various programming languages (#1349 )	2024-09-14 12:30:13 +08:00
Emmanuel Schmidbauer	a8556e31ba	add Tokens []string, Timestamps []float32, Lang string, Emotion string, Event string (#1277 )	2024-08-27 06:35:59 +08:00
Fangjun Kuang	8f4d332aab	Add Go API for SenseVoice (#1154 )	2024-07-20 23:41:53 +08:00
Fangjun Kuang	b5093e27f9	Fix publishing apks to huggingface (#1121 ) Save APKs for each release in a separate directory. Huggingface requires that each directory cannot contain more than 1000 files. Since we have so many tts models and for each model we need to build APKs of 4 different ABIs, it is a workaround for the huggingface's constraint by placing them into separate directories for different releases.	2024-07-13 16:14:00 +08:00
Fangjun Kuang	dd0ff2ca06	Support onnxruntime 1.18.0 (#906 )	2024-07-10 17:05:26 +08:00
Fangjun Kuang	6789c909d2	Inverse text normalization API of streaming ASR for various programming languages (#1022 )	2024-06-18 13:42:17 +08:00
Fangjun Kuang	6e09933d99	Inverse text normalization API for other programming languages (#1019 )	2024-06-17 17:02:39 +08:00
Fangjun Kuang	fd5a0d1e00	Add C++ runtime for Tele-AI/TeleSpeech-ASR (#970 )	2024-06-05 00:26:40 +08:00
Fangjun Kuang	fdcae56a14	Fix Go tests (#897 )	2024-05-21 11:50:13 +08:00
Fangjun Kuang	68b8b88b5a	Add Python API for punctuation models. (#762 )	2024-04-13 13:28:17 +08:00
Fangjun Kuang	f20291cadc	Support audio tagging using zipformer (#747 )	2024-04-10 14:47:06 +08:00
Fangjun Kuang	c9ae7595d5	Fix go API examples with portaudio on Windows. (#746 )	2024-04-10 09:56:35 +08:00
Fangjun Kuang	a5f8fbc83f	Support heteronyms in Chinese TTS (#738 )	2024-04-08 11:01:30 +08:00
Fangjun Kuang	dbff2eaadb	Add C API for streaming HLG decoding (#734 )	2024-04-05 10:31:20 +08:00
Fangjun Kuang	43af1e6951	Release v1.9.15 (#719 )	2024-03-29 19:58:04 +08:00
Fangjun Kuang	6da4a1c12f	Add Go API for speaker identification (#718 )	2024-03-29 19:25:55 +08:00
Fangjun Kuang	a042f44076	Add Golang API for spoken language identification. (#709 )	2024-03-27 19:40:25 +08:00
Fangjun Kuang	69c7880c4d	Add Golang API for VAD (#708 )	2024-03-27 12:09:39 +08:00
Fangjun Kuang	acf0975153	Support whisper language/task in various language bindings. (#679 )	2024-03-20 16:43:35 +08:00
Fangjun Kuang	e475e750ac	Support streaming zipformer CTC (#496 ) * Support streaming zipformer CTC * test online zipformer2 CTC * Update doc of sherpa-onnx.cc * Add Python APIs for streaming zipformer2 ctc * Add Python API examples for streaming zipformer2 ctc * Swift API for streaming zipformer2 CTC * NodeJS API for streaming zipformer2 CTC * Kotlin API for streaming zipformer2 CTC * Golang API for streaming zipformer2 CTC * C# API for streaming zipformer2 CTC * Release v1.9.6	2023-12-22 13:46:33 +08:00
Fangjun Kuang	cae0231f93	Fix releasing go packages (#476 )	2023-12-09 00:07:52 +08:00
Fangjun Kuang	1937717705	Add MFC TTS example on Windows (#378 )	2023-10-21 00:13:07 +08:00
Fangjun Kuang	a69d0a950e	Add Go API for TTS (#377 )	2023-10-20 15:57:52 +08:00
Fangjun Kuang	86b18184c9	Fix Go examples (#300 )	2023-09-07 15:27:41 +08:00
Fangjun Kuang	e31f9e48c2	Fix various language binding APIs for tdnn and whisper models (#278 )	2023-08-16 22:15:10 +08:00
Fangjun Kuang	f709c95c5f	Support multilingual whisper models (#274 )	2023-08-16 00:28:52 +08:00
Fangjun Kuang	bc791d4996	Fix C api for Go and MFC to support streaming paraformer (#268 )	2023-08-14 17:02:23 +08:00

1 2

54 Commits