Fangjun Kuang
|
316424b382
|
Add C++ and Python API for FireRedASR AED models (#1867)
|
2025-02-16 22:45:24 +08:00 |
|
Fangjun Kuang
|
b6f0f5fc2e
|
Support removing invalid utf-8 sequences. (#1648)
|
2024-12-25 19:32:13 +08:00 |
|
Fangjun Kuang
|
2101227269
|
Add streaming ASR support for HarmonyOS. (#1565)
|
2024-11-26 18:36:56 +08:00 |
|
Fangjun Kuang
|
298b6b6fda
|
Add non-streaming ASR support for HarmonyOS. (#1564)
|
2024-11-26 16:38:35 +08:00 |
|
Fangjun Kuang
|
6ee8c99c5d
|
Fix building (#1508)
|
2024-11-03 19:47:04 +08:00 |
|
Fangjun Kuang
|
9ab89c33bc
|
Support building GPU-capable sherpa-onnx on Linux aarch64. (#1500)
Thanks to @Peakyxh for providing pre-built onnxruntime libraries
with CUDA support for Linux aarch64.
Tested on Jetson nano b01
|
2024-11-01 11:16:28 +08:00 |
|
Fangjun Kuang
|
669f5ef441
|
Add C++ runtime and Python APIs for Moonshine models (#1473)
|
2024-10-26 14:34:07 +08:00 |
|
Fangjun Kuang
|
707cf792c5
|
Add GigaAM NeMo transducer model for Russian ASR (#1467)
|
2024-10-25 15:20:13 +08:00 |
|
Fangjun Kuang
|
b41f6d2c94
|
Support GigaAM CTC models for Russian ASR (#1464)
See also https://github.com/salute-developers/GigaAM
|
2024-10-25 10:55:16 +08:00 |
|
Fangjun Kuang
|
1a471595a5
|
Fix Android build (#1161)
|
2024-07-22 09:27:30 +08:00 |
|
Fangjun Kuang
|
25f0a10468
|
Add C++ runtime for SenseVoice models (#1148)
|
2024-07-18 22:54:18 +08:00 |
|
ivan provalov
|
de04b3b9bf
|
Allow modify model config at decode time for ASR (#1124)
|
2024-07-13 22:30:47 +08:00 |
|
Fangjun Kuang
|
6789c909d2
|
Inverse text normalization API of streaming ASR for various programming languages (#1022)
|
2024-06-18 13:42:17 +08:00 |
|
Fangjun Kuang
|
b0f7ed3ee3
|
Add inverse text normalization for non-streaming ASR (#1017)
|
2024-06-17 14:28:53 +08:00 |
|
Fangjun Kuang
|
fd5a0d1e00
|
Add C++ runtime for Tele-AI/TeleSpeech-ASR (#970)
|
2024-06-05 00:26:40 +08:00 |
|
Fangjun Kuang
|
17cd3a5f01
|
Add C++ runtime for non-streaming faster conformer transducer from NeMo. (#854)
|
2024-05-10 12:15:39 +08:00 |
|
Fangjun Kuang
|
5d8c35e44e
|
Add C++ support for non-streaming NeMo fast conformer hybrid transducer ctc (the ctc branch) (#848)
|
2024-05-09 15:32:22 +08:00 |
|
Fangjun Kuang
|
d364610605
|
Use a single thread when loading models (#703)
|
2024-03-26 13:35:33 +08:00 |
|
Fangjun Kuang
|
b83b3e3cd1
|
Support non-streaming WeNet CTC models. (#426)
|
2023-11-15 14:23:20 +08:00 |
|
Fangjun Kuang
|
407602445d
|
Add CTC HLG decoding using OpenFst (#349)
|
2023-10-08 11:32:39 +08:00 |
|
Fangjun Kuang
|
debab7c091
|
Add two-pass speech recognition Android/iOS demo (#304)
|
2023-09-12 15:40:16 +08:00 |
|
Fangjun Kuang
|
a4bff28e21
|
Support TDNN models from the yesno recipe from icefall (#262)
|
2023-08-12 19:50:22 +08:00 |
|
Fangjun Kuang
|
45b9d4ab37
|
Support whisper models (#238)
|
2023-08-07 12:34:18 +08:00 |
|
Fangjun Kuang
|
f3206c49dc
|
Reduce model initialization time for offline speech recognition (#213)
|
2023-07-14 18:07:27 +08:00 |
|
Fangjun Kuang
|
d82251cba6
|
Support the latest offline zipformer model (#180)
|
2023-06-19 14:09:24 +08:00 |
|
Fangjun Kuang
|
80060c276d
|
Begin to support CTC models (#119)
Please see https://k2-fsa.github.io/sherpa/onnx/pretrained_models/offline-ctc/nemo/index.html for a list of pre-trained CTC models from NeMo.
|
2023-04-07 23:11:34 +08:00 |
|
Jason's Lab
|
5e5620ea23
|
add zipformer model_type (#97)
Co-authored-by: lichao <www.563042811@qq.com>
|
2023-03-29 20:36:39 +08:00 |
|
Fangjun Kuang
|
423d89e9a5
|
Support paraformer. (#95)
|
2023-03-28 17:59:54 +08:00 |
|
Fangjun Kuang
|
dffb0fd43c
|
Refactor offline recognizer. (#94)
* Refactor offline recognizer.
The purpose is to make it easier to support different types of models.
|
2023-03-27 14:59:40 +08:00 |
|