enginex-mr_series-sherpa-onnx

EngineX-Iluvatar/enginex-mr_series-sherpa-onnx

Archived

Author	SHA1	Message	Date
Fangjun Kuang	b76cd9033a	Support decoding with byte-level BPE (bbpe) models. (#1633 )	2024-12-20 19:21:32 +08:00
Fangjun Kuang	2101227269	Add streaming ASR support for HarmonyOS. (#1565 )	2024-11-26 18:36:56 +08:00
Askars	5f50cbf65a	context_state is not set correctly when previous context is passed after reset (#1393 ) Co-authored-by: vsd-vector <askars.salimbajevs@tilde.lv>	2024-10-03 16:42:09 +08:00
Fangjun Kuang	544857b097	Fix building (#1343 )	2024-09-13 13:33:52 +08:00
lxiao336	65cfa7548a	re-pull-request allow tokens and hotwords be loaded from buffered string driectly (#1339 ) Co-authored-by: xiao <shawl336@163.com>	2024-09-13 09:58:17 +08:00
Askars	fa20ae1552	Preserve previous result as context for next segment (#1335 ) Co-authored-by: vsd-vector <askars.salimbajevs@tilde.lv>	2024-09-11 10:44:13 +08:00
SilverSulfide	888f74bf3c	Re-implement LM rescore for online transducer (#1231 ) Co-authored-by: Martins Kronis <martins.kuznecovs@tilde.lv>	2024-09-06 10:01:25 +08:00
Fangjun Kuang	349d957da2	Add inverse text normalization for online ASR (#1020 )	2024-06-17 18:39:23 +08:00
Fangjun Kuang	e1201225f2	Add Android APK for Korean (#1015 )	2024-06-16 19:17:15 +08:00
Wei Kang	a38881817c	Support customize scores for hotwords (#926 ) * Support customize scores for hotwords * Skip blank lines	2024-05-31 12:34:30 +08:00
Fangjun Kuang	082f230dfb	Fix nemo streaming transducer greedy search (#944 )	2024-05-30 15:31:10 +08:00
Sangeet Sagar	3f472a9993	Add C++ runtime for streaming faster conformer transducer from NeMo. (#889 ) Co-authored-by: sangeet2020 <15uec053@gmail.com>	2024-05-30 13:55:03 +08:00
Fangjun Kuang	50a2eaa41f	Reset encoder states on endpointing for streaming transducer. (#924 )	2024-05-28 17:06:17 +08:00
Wei Kang	b012b78ceb	Encode hotwords in C++ side (#828 ) * Encode hotwords in C++ side	2024-05-20 19:41:36 +08:00
Fangjun Kuang	17cd3a5f01	Add C++ runtime for non-streaming faster conformer transducer from NeMo. (#854 )	2024-05-10 12:15:39 +08:00
Karel Vesely	2e45d327a5	Adding temperature scaling on Joiner logits: (#789 ) * Adding temperature scaling on Joiner logits: - T hard-coded to 2.0 - so far best result NCE 0.122 (still not so high) - the BPE scores were rescaled with 0.2 (but then also incorrect words get high confidence, visually reasonable histograms are for 0.5 scale) - BPE->WORD score merging done by min(.) function (tried also prob-product, and also arithmetic, geometric, harmonic mean) - without temperature scaling (i.e. scale 1.0), the best NCE was 0.032 (here product merging was best) Results seem consistent with: https://arxiv.org/abs/2110.15222 Everything tuned on a very-small set of 100 sentences with 813 words and 10.2% WER, a Czech model. I also experimented with blank posteriors mixed into the BPE confidences, but no NCE improvement found, so not pushing that. Temperature scling added also to the Greedy search confidences. * making `temperature_scale` configurable from outside	2024-04-26 09:44:26 +08:00
Fangjun Kuang	6bf2099781	Fix code style issues (#774 )	2024-04-16 09:46:15 +08:00
Manix	fb4aee83ac	Adding warm up for Zipformer2 (#766 ) Signed-off-by: manickavela1998@gmail.com <manickavela1998@gmail.com>	2024-04-16 09:16:55 +08:00
Karel Vesely	eaec4c83c2	Configurable low_freq high_freq, dithering (#664 )	2024-03-22 21:41:44 +08:00
Karel Vesely	38c072dcb2	Track token scores (#571 ) * add export of per-token scores (ys, lm, context) - for best path of the modified-beam-search decoding of transducer * refactoring JSON export of OnlineRecognitionResult, extending pybind11 API of OnlineRecognitionResult * export per-token scores also for greedy-search (online-transducer) - export un-scaled lm_probs (modified-beam search, online-transducer) - polishing * fill lm_probs/context_scores only if LM/ContextGraph is present (make Result smaller)	2024-02-29 06:28:45 +08:00
Karel Vesely	3f2a17ef47	Fixes issue #535 , fix hexa 1-char tokens in ASR output. (#550 ) - Avoid output like : `[' K', '<0x64>', '<0x79>', 'ť', ' a', '<0x75>', 'to', 'bu', '<0x73>', '<0x75>', ... ]` with regular 500 BPE units. - Don't rewrite 1-char tokens in range [ 0x20 (space) .. 0x7E (tilde) ]	2024-01-26 19:23:20 +08:00
chiiyeh	e7b18a2139	add blank_penalty for online transducer (#548 )	2024-01-26 12:12:13 +08:00
Fangjun Kuang	e215d0c39a	Fix Byte BPE string results for Python. (#512 ) It ignores invalid UTF8 strings.	2024-01-03 16:03:24 +08:00
Fangjun Kuang	98b67ad850	Fix reading hotwords file for android (#354 )	2023-10-11 12:20:50 +08:00
Fangjun Kuang	33a5765169	Print a more user-friendly error message when using --hotwords-file. (#344 )	2023-09-26 11:04:20 +08:00
Fangjun Kuang	552a267c23	Set is_final and start_time for online websocket server. (#342 ) * Set is_final and start_time for online websocket server. * Convert timestamps to a json array	2023-09-25 15:12:07 +08:00
Wei Kang	47184f9db7	Refactor hotwords，support loading hotwords from file (#296 )	2023-09-14 19:33:17 +08:00
Fangjun Kuang	a12ebfab22	treat unk as blank (#299 )	2023-09-07 15:12:29 +08:00
Fangjun Kuang	a0a747a0c0	add endpointing for online websocket server (#294 )	2023-08-31 14:41:04 +08:00
Fangjun Kuang	6038e2aa62	Support streaming paraformer (#263 )	2023-08-14 10:32:14 +08:00
frankyoujian	9dcad7e963	Reinitialize context state after Reset stream when using contexts (#257 )	2023-08-10 14:19:40 +08:00
Fangjun Kuang	79c2ce5dd4	Refactor online recognizer (#250 ) * Refactor online recognizer. Make it easier to support other streaming models. Note that it is a breaking change for the Python API. `sherpa_onnx.OnlineRecognizer()` used before should be replaced by `sherpa_onnx.OnlineRecognizer.from_transducer()`.	2023-08-09 20:27:31 +08:00

32 Commits