Commit Graph

60 Commits

Author SHA1 Message Date
Fangjun Kuang
2b2788332e Add C++ support for UVR models (#2269) 2025-06-01 17:22:08 +08:00
Fangjun Kuang
0de7e1b9f0 Add C++ and Python API for Dolphin CTC models (#2085) 2025-04-02 19:09:00 +08:00
Fangjun Kuang
488a6e687c Add C++ runtime for speech enhancement GTCRN models (#1977)
See also https://github.com/Xiaobin-Rong/gtcrn
2025-03-10 18:11:16 +08:00
Fangjun Kuang
eebe19997d Build wheels for rknn linux aarch64 (#1928) 2025-02-26 18:58:57 +08:00
Fangjun Kuang
dc2f7e9f9b Fix publishing linux pre-built artifacts (#1919) 2025-02-25 15:22:50 +08:00
Fangjun Kuang
930986b06c Release v1.10.38 (#1688) 2025-01-06 11:31:18 +08:00
Fangjun Kuang
2c2926af7d Add C++ runtime for Matcha-TTS (#1627) 2024-12-31 12:44:14 +08:00
Fangjun Kuang
669f5ef441 Add C++ runtime and Python APIs for Moonshine models (#1473) 2024-10-26 14:34:07 +08:00
Fangjun Kuang
b41f6d2c94 Support GigaAM CTC models for Russian ASR (#1464)
See also https://github.com/salute-developers/GigaAM
2024-10-25 10:55:16 +08:00
Fangjun Kuang
ceb69ebd94 Add C++ API for non-streaming ASR (#1456) 2024-10-23 16:40:12 +08:00
Fangjun Kuang
effd5ef2be Add C++ API for streaming ASR. (#1455)
It is a wrapper around the C API.
2024-10-23 12:07:43 +08:00
Fangjun Kuang
59407edcad C++ API for speaker diarization (#1396) 2024-10-09 12:01:20 +08:00
Fangjun Kuang
576a3aa90d Add non-streaming ONNX models for Russian ASR (#1358) 2024-09-18 13:43:49 +08:00
Fangjun Kuang
ca729faebf Support reading multi-channel wave files with 8/16/32-bit encoded samples (#1258) 2024-08-15 14:54:43 +08:00
Fangjun Kuang
9ee2943ed4 Add CI tests for online punctuation models (#1226) 2024-08-06 18:10:30 +08:00
Fangjun Kuang
35c1b4a7a9 Add ReazonSpeech Japanese pre-trained model (#1203) 2024-08-02 10:21:24 +08:00
Fangjun Kuang
a25075101c Build sherpa-onnx as a single shared library (#1078)
When `-D BUILD_SHARED_LIBS=ON` is passed to `cmake`, it builds a single shared library.

Specifically, 

- For C APIs, it builds `libsherpa-onnx-c-api.so`
- For Python APIs, it builds `_sherpa_onnx.cpython-xx-xx.so`
- For Kotlin and Java APIs, it builds `libsherpa-onnx-jni.so`

There is no `libsherpa-onnx-core.so` any longer.

Note it affects only shared libraries.
2024-07-06 16:41:54 +08:00
Fangjun Kuang
6b7ced2317 Fix the alsa-lib version to v1.2.12 (#1048) 2024-06-23 20:20:38 +08:00
Fangjun Kuang
dd69a1b56b Release v1.9.30 (#1016) 2024-06-17 11:57:38 +08:00
Fangjun Kuang
fd5a0d1e00 Add C++ runtime for Tele-AI/TeleSpeech-ASR (#970) 2024-06-05 00:26:40 +08:00
Fangjun Kuang
4f21aabd3c Fix CI for JavaScript and Python APIs. (#901) 2024-05-22 13:57:00 +08:00
Fangjun Kuang
b445956675 Fix CI tests. (#898) 2024-05-21 20:37:29 +08:00
Fangjun Kuang
46e4e5b7ac Add C++ support for streaming NeMo CTC models. (#857) 2024-05-10 16:26:43 +08:00
Fangjun Kuang
17cd3a5f01 Add C++ runtime for non-streaming faster conformer transducer from NeMo. (#854) 2024-05-10 12:15:39 +08:00
Fangjun Kuang
5d8c35e44e Add C++ support for non-streaming NeMo fast conformer hybrid transducer ctc (the ctc branch) (#848) 2024-05-09 15:32:22 +08:00
Fangjun Kuang
13730ecbd8 Add C API for punctuation (#768) 2024-04-14 19:02:34 +08:00
Fangjun Kuang
329fe1aa8b Support adding punctuations to the speech recogntion result (#761) 2024-04-13 12:15:57 +08:00
Fangjun Kuang
f204e62b44 Add C API for audio tagging (#754) 2024-04-11 14:18:43 +08:00
Fangjun Kuang
f20291cadc Support audio tagging using zipformer (#747) 2024-04-10 14:47:06 +08:00
Fangjun Kuang
db67e00c77 Add HLG decoding for streaming CTC models (#731) 2024-04-03 21:31:42 +08:00
Fangjun Kuang
2e0bccad36 Add C API for speaker embedding extractor. (#711) 2024-03-28 18:05:40 +08:00
Fangjun Kuang
4e040c596e Support including TTS conditionally. (#699) 2024-03-26 17:21:35 +08:00
Fangjun Kuang
ab7cff2513 Add C API for spoken language identification. (#695) 2024-03-25 15:16:47 +08:00
Fangjun Kuang
0d258dd150 Support spoken language identification with whisper (#694) 2024-03-24 22:57:00 +08:00
Fangjun Kuang
3cdad9b5d1 Use manylinux in CI test (#692) 2024-03-24 07:54:32 +08:00
Fangjun Kuang
1952772654 Add timestamps and tokens for .Net's online models. (#690) 2024-03-23 18:51:56 +08:00
Fangjun Kuang
13260cdf49 Use self-compiled onnxruntime shared lib. (#635) 2024-03-06 11:03:24 +08:00
Fangjun Kuang
25079b5c05 Fix CI tests. (#596) 2024-02-21 15:37:27 +08:00
Fangjun Kuang
12e5225401 Fix CI warnings (#590) 2024-02-20 15:28:47 +08:00
Fangjun Kuang
c68f39bd3c Use onnxruntime static lib compiled with gcc8 on ubuntu 20.04 (#587) 2024-02-20 09:31:37 +08:00
Wei Kang
2ff1049079 change modelscope link to github for build-kws-apki (#540) 2024-01-24 16:40:14 +08:00
Wei Kang
b6c020901a decoder for open vocabulary keyword spotting (#505)
* various fixes to ContextGraph to support open vocabulary keywords decoder

* Add keyword spotter runtime

* Add binary

* First version works

* Minor fixes

* update text2token

* default values

* Add jni for kws

* add kws android project

* Minor fixes

* Remove unused interface

* Minor fixes

* Add workflow

* handle extra info in texts

* Minor fixes

* Add more comments

* Fix ci

* fix cpp style

* Add input box in android demo so that users can specify their keywords

* Fix cpp style

* Fix comments

* Minor fixes

* Minor fixes

* minor fixes

* Minor fixes

* Minor fixes

* Add CI

* Fix code style

* cpplint

* Fix comments

* Fix error
2024-01-20 22:52:41 +08:00
Fangjun Kuang
e475e750ac Support streaming zipformer CTC (#496)
* Support streaming zipformer CTC

* test online zipformer2 CTC

* Update doc of sherpa-onnx.cc

* Add Python APIs for streaming zipformer2 ctc

* Add Python API examples for streaming zipformer2 ctc

* Swift API for streaming zipformer2 CTC

* NodeJS API for streaming zipformer2 CTC

* Kotlin API for streaming zipformer2 CTC

* Golang API for streaming zipformer2 CTC

* C# API for streaming zipformer2 CTC

* Release v1.9.6
2023-12-22 13:46:33 +08:00
Fangjun Kuang
3ae984f148 Remove the 30-second constraint from whisper. (#471) 2023-12-07 17:47:08 +08:00
Fangjun Kuang
ac00edab5b Build MFC examples for Windows x86 (Win32) (#434)
Also, strip binaries on Linux before uploading.
2023-11-18 16:13:09 +08:00
Fangjun Kuang
fac4f6bc7c Support streaming conformer CTC models from wenet (#427) 2023-11-16 10:35:23 +08:00
Fangjun Kuang
b83b3e3cd1 Support non-streaming WeNet CTC models. (#426) 2023-11-15 14:23:20 +08:00
Fangjun Kuang
2a932aca86 Add ccache to CI (#375) 2023-10-20 11:17:37 +08:00
Fangjun Kuang
ea7c45b60c Add C API for offline TTS. (#373) 2023-10-19 17:38:23 +08:00
Fangjun Kuang
532ed142d2 Support linking onnxruntime lib statically on Linux (#326) 2023-09-21 10:15:42 +08:00