Fangjun Kuang
0de7e1b9f0
Add C++ and Python API for Dolphin CTC models ( #2085 )
2025-04-02 19:09:00 +08:00
Fangjun Kuang
488a6e687c
Add C++ runtime for speech enhancement GTCRN models ( #1977 )
...
See also https://github.com/Xiaobin-Rong/gtcrn
2025-03-10 18:11:16 +08:00
Fangjun Kuang
2f9a2b20a1
Fix publishing macos pre-built artifacts ( #1922 )
2025-02-26 11:52:01 +08:00
Fangjun Kuang
930986b06c
Release v1.10.38 ( #1688 )
2025-01-06 11:31:18 +08:00
Fangjun Kuang
2c2926af7d
Add C++ runtime for Matcha-TTS ( #1627 )
2024-12-31 12:44:14 +08:00
Fangjun Kuang
38d64a6d81
Fix building macOS libs ( #1656 )
2024-12-27 18:15:41 +08:00
Fangjun Kuang
669f5ef441
Add C++ runtime and Python APIs for Moonshine models ( #1473 )
2024-10-26 14:34:07 +08:00
Fangjun Kuang
ceb69ebd94
Add C++ API for non-streaming ASR ( #1456 )
2024-10-23 16:40:12 +08:00
Fangjun Kuang
effd5ef2be
Add C++ API for streaming ASR. ( #1455 )
...
It is a wrapper around the C API.
2024-10-23 12:07:43 +08:00
Fangjun Kuang
59407edcad
C++ API for speaker diarization ( #1396 )
2024-10-09 12:01:20 +08:00
Fangjun Kuang
576a3aa90d
Add non-streaming ONNX models for Russian ASR ( #1358 )
2024-09-18 13:43:49 +08:00
Fangjun Kuang
9ee2943ed4
Add CI tests for online punctuation models ( #1226 )
2024-08-06 18:10:30 +08:00
Fangjun Kuang
35c1b4a7a9
Add ReazonSpeech Japanese pre-trained model ( #1203 )
2024-08-02 10:21:24 +08:00
Fangjun Kuang
a25075101c
Build sherpa-onnx as a single shared library ( #1078 )
...
When `-D BUILD_SHARED_LIBS=ON` is passed to `cmake`, it builds a single shared library.
Specifically,
- For C APIs, it builds `libsherpa-onnx-c-api.so`
- For Python APIs, it builds `_sherpa_onnx.cpython-xx-xx.so`
- For Kotlin and Java APIs, it builds `libsherpa-onnx-jni.so`
There is no `libsherpa-onnx-core.so` any longer.
Note it affects only shared libraries.
2024-07-06 16:41:54 +08:00
Fangjun Kuang
dd69a1b56b
Release v1.9.30 ( #1016 )
2024-06-17 11:57:38 +08:00
Fangjun Kuang
fd5a0d1e00
Add C++ runtime for Tele-AI/TeleSpeech-ASR ( #970 )
2024-06-05 00:26:40 +08:00
Fangjun Kuang
17cd3a5f01
Add C++ runtime for non-streaming faster conformer transducer from NeMo. ( #854 )
2024-05-10 12:15:39 +08:00
Fangjun Kuang
5d8c35e44e
Add C++ support for non-streaming NeMo fast conformer hybrid transducer ctc (the ctc branch) ( #848 )
2024-05-09 15:32:22 +08:00
Fangjun Kuang
13730ecbd8
Add C API for punctuation ( #768 )
2024-04-14 19:02:34 +08:00
Fangjun Kuang
329fe1aa8b
Support adding punctuations to the speech recogntion result ( #761 )
2024-04-13 12:15:57 +08:00
Fangjun Kuang
f204e62b44
Add C API for audio tagging ( #754 )
2024-04-11 14:18:43 +08:00
Fangjun Kuang
f20291cadc
Support audio tagging using zipformer ( #747 )
2024-04-10 14:47:06 +08:00
Fangjun Kuang
2e0bccad36
Add C API for speaker embedding extractor. ( #711 )
2024-03-28 18:05:40 +08:00
Fangjun Kuang
4e040c596e
Support including TTS conditionally. ( #699 )
2024-03-26 17:21:35 +08:00
Fangjun Kuang
ab7cff2513
Add C API for spoken language identification. ( #695 )
2024-03-25 15:16:47 +08:00
Fangjun Kuang
0d258dd150
Support spoken language identification with whisper ( #694 )
2024-03-24 22:57:00 +08:00
Fangjun Kuang
1952772654
Add timestamps and tokens for .Net's online models. ( #690 )
2024-03-23 18:51:56 +08:00
Fangjun Kuang
24f437a6f1
Refactor github actions tests ( #688 )
2024-03-22 21:22:42 +08:00
Wei Kang
2ff1049079
change modelscope link to github for build-kws-apki ( #540 )
2024-01-24 16:40:14 +08:00
Wei Kang
b6c020901a
decoder for open vocabulary keyword spotting ( #505 )
...
* various fixes to ContextGraph to support open vocabulary keywords decoder
* Add keyword spotter runtime
* Add binary
* First version works
* Minor fixes
* update text2token
* default values
* Add jni for kws
* add kws android project
* Minor fixes
* Remove unused interface
* Minor fixes
* Add workflow
* handle extra info in texts
* Minor fixes
* Add more comments
* Fix ci
* fix cpp style
* Add input box in android demo so that users can specify their keywords
* Fix cpp style
* Fix comments
* Minor fixes
* Minor fixes
* minor fixes
* Minor fixes
* Minor fixes
* Add CI
* Fix code style
* cpplint
* Fix comments
* Fix error
2024-01-20 22:52:41 +08:00
Fangjun Kuang
fac4f6bc7c
Support streaming conformer CTC models from wenet ( #427 )
2023-11-16 10:35:23 +08:00
Fangjun Kuang
b80b7e5144
Support linking onnxruntime statically for macOS ( #403 )
2023-10-31 20:24:43 +08:00
Fangjun Kuang
2a932aca86
Add ccache to CI ( #375 )
2023-10-20 11:17:37 +08:00
Fangjun Kuang
ea7c45b60c
Add C API for offline TTS. ( #373 )
2023-10-19 17:38:23 +08:00
Fangjun Kuang
ffeff3b8a3
Fix a typo for Go ( #298 )
2023-09-07 11:29:00 +08:00
Fangjun Kuang
496c5dd7f5
Fix pip install on Windows for Python 3.11 ( #271 )
2023-08-14 22:22:46 +08:00
Fangjun Kuang
6038e2aa62
Support streaming paraformer ( #263 )
2023-08-14 10:32:14 +08:00
Fangjun Kuang
f7c05b1570
Add CI test for Whisper models ( #239 )
2023-08-07 19:24:52 +08:00
Fangjun Kuang
0b5fa24134
Upload pre-compiled binaries and libraries on release ( #206 )
2023-07-10 17:59:08 +08:00
Fangjun Kuang
80060c276d
Begin to support CTC models ( #119 )
...
Please see https://k2-fsa.github.io/sherpa/onnx/pretrained_models/offline-ctc/nemo/index.html for a list of pre-trained CTC models from NeMo.
2023-04-07 23:11:34 +08:00
Fangjun Kuang
f69b4e9396
Add tests for zipformer ( #106 )
2023-03-31 04:47:20 +08:00
Fangjun Kuang
5572246253
Add non-streaming ASR ( #92 )
2023-03-26 08:53:42 +08:00
Fangjun Kuang
c63c4c3389
C api ( #60 )
2023-02-24 16:42:46 +08:00
Fangjun Kuang
ea09d5fbc5
Add Python API ( #31 )
2023-02-19 19:36:03 +08:00
Fangjun Kuang
cb8f85ff83
Add online LSTM transducer model ( #25 )
2023-02-18 21:35:15 +08:00