Commit Graph

135 Commits

Author SHA1 Message Date
Fangjun Kuang
117cd7bb8c Support whisper large/large-v1/large-v2/large-v3 and distil-large-v2 (#1114) 2024-07-12 23:47:39 +08:00
Fangjun Kuang
1c104ea847 Update onnxruntime from v1.18.0 to v1.18.1 (#1107) 2024-07-11 09:35:28 +08:00
Fangjun Kuang
dd0ff2ca06 Support onnxruntime 1.18.0 (#906) 2024-07-10 17:05:26 +08:00
Fangjun Kuang
a25075101c Build sherpa-onnx as a single shared library (#1078)
When `-D BUILD_SHARED_LIBS=ON` is passed to `cmake`, it builds a single shared library.

Specifically, 

- For C APIs, it builds `libsherpa-onnx-c-api.so`
- For Python APIs, it builds `_sherpa_onnx.cpython-xx-xx.so`
- For Kotlin and Java APIs, it builds `libsherpa-onnx-jni.so`

There is no `libsherpa-onnx-core.so` any longer.

Note it affects only shared libraries.
2024-07-06 16:41:54 +08:00
Fangjun Kuang
598c12c4e5 Fix CI tests (#1061) 2024-06-27 18:05:18 +08:00
hantengc
dfca4500aa Resolve issues with using pre-installed onnxruntime (#1058)
There is an issue with the directory, ${location_onnxruntime_header_dir}
may be NOT-FOUND
2024-06-25 16:21:36 +08:00
Fangjun Kuang
a11c859971 Support clang-tidy (#1034) 2024-06-19 20:51:57 +08:00
Fangjun Kuang
6789c909d2 Inverse text normalization API of streaming ASR for various programming languages (#1022) 2024-06-18 13:42:17 +08:00
Fangjun Kuang
d08cc04567 Add VAD example for Dart API (#996) 2024-06-14 10:37:16 +08:00
Fangjun Kuang
fd5a0d1e00 Add C++ runtime for Tele-AI/TeleSpeech-ASR (#970) 2024-06-05 00:26:40 +08:00
Fangjun Kuang
f1cff83ef9 Add address sanitizer and undefined behavior sanitizer (#951) 2024-05-31 13:17:01 +08:00
Fangjun Kuang
86ccb86006 fix building errors introduced by simple-sentencepiece (#915) 2024-05-24 14:48:04 +08:00
Fangjun Kuang
b09fb7f5cd Support Windows arm64 (#911) 2024-05-24 11:18:14 +08:00
Fangjun Kuang
2db777587e Fix CI tests. (#907) 2024-05-23 14:49:37 +08:00
Fangjun Kuang
4f21aabd3c Fix CI for JavaScript and Python APIs. (#901) 2024-05-22 13:57:00 +08:00
Wei Kang
b012b78ceb Encode hotwords in C++ side (#828)
* Encode hotwords in C++ side
2024-05-20 19:41:36 +08:00
Fangjun Kuang
4f758e6cd3 Publish node-addon-api wrapper for sherpa-onnx as npm packages (#829) 2024-05-04 13:27:39 +08:00
Fangjun Kuang
c7691650d7 Fix CI tests (#804) 2024-04-24 13:01:06 +08:00
Fangjun Kuang
6b353bfb42 Add jieba for Chinese TTS models (#797) 2024-04-21 14:47:13 +08:00
Fangjun Kuang
329fe1aa8b Support adding punctuations to the speech recogntion result (#761) 2024-04-13 12:15:57 +08:00
Fangjun Kuang
34d70a259f Add Python API and Python examples for audio tagging (#753) 2024-04-11 11:12:48 +08:00
布宝
d21c45d0ea Add --continue to wget (#750)
Also, switch to github mirror
2024-04-11 09:07:31 +08:00
Fangjun Kuang
042976ea6e Add C++ microphone examples for audio tagging (#749) 2024-04-10 21:00:35 +08:00
Fangjun Kuang
f20291cadc Support audio tagging using zipformer (#747) 2024-04-10 14:47:06 +08:00
Fangjun Kuang
db1b3ab1f3 Fix building OpenFst on Windows. (#744) 2024-04-09 11:17:46 +08:00
Fangjun Kuang
0d90b34e4a Support Chinese heteronyms on Android for TTS. (#742) 2024-04-08 21:36:47 +08:00
Fangjun Kuang
6fb8ceda57 Add VAD examples using ALSA for recording (#739) 2024-04-08 16:41:01 +08:00
Fangjun Kuang
a5f8fbc83f Support heteronyms in Chinese TTS (#738) 2024-04-08 11:01:30 +08:00
Fangjun Kuang
dbff2eaadb Add C API for streaming HLG decoding (#734) 2024-04-05 10:31:20 +08:00
Fangjun Kuang
db67e00c77 Add HLG decoding for streaming CTC models (#731) 2024-04-03 21:31:42 +08:00
hantengc
ccb2d435ec add openfst.cmake file (#707)
1. When compiling locally, openfst is missing.so add this file to the sherpa-onnx/cmake folder
2024-03-27 11:31:26 +08:00
Fangjun Kuang
4e040c596e Support including TTS conditionally. (#699) 2024-03-26 17:21:35 +08:00
Fangjun Kuang
0d258dd150 Support spoken language identification with whisper (#694) 2024-03-24 22:57:00 +08:00
Fangjun Kuang
1952772654 Add timestamps and tokens for .Net's online models. (#690) 2024-03-23 18:51:56 +08:00
Karel Vesely
eaec4c83c2 Configurable low_freq high_freq, dithering (#664) 2024-03-22 21:41:44 +08:00
Fangjun Kuang
f70fdd156c Support using T-head-Semi/csi-nn2 for RISC-V (#637) 2024-03-06 18:21:50 +08:00
Fangjun Kuang
bdf9243940 Allow to not use pre-installed onnxruntime libs. (#636) 2024-03-06 14:40:23 +08:00
Fangjun Kuang
13260cdf49 Use self-compiled onnxruntime shared lib. (#635) 2024-03-06 11:03:24 +08:00
Fangjun Kuang
a65643b594 support onnxruntime v1.17.1 (#624) 2024-03-02 11:44:59 +08:00
Fangjun Kuang
8b7928e7d6 Fix computing features for whisper. (#617) 2024-02-29 16:56:29 +08:00
Fangjun Kuang
85d59b5840 Use hub.nuaa.cf to replace huggingface URL to download dependencies. (#614) 2024-02-28 17:48:51 +08:00
Fangjun Kuang
0cb6d1b474 support using xnnpack as execution provider (#612) 2024-02-28 17:32:48 +08:00
Fangjun Kuang
87a7030c08 Support using alsa to access the microphone with non-streaming ASR models (#517) 2024-02-26 21:17:26 +08:00
Fangjun Kuang
ee37d9bd92 Support RISC-V (#609) 2024-02-26 06:57:18 +08:00
Fangjun Kuang
67acd34dcd Use alsa to read microphone in speaker identification demo. (#605) 2024-02-23 19:27:51 +08:00
Fangjun Kuang
5f075d0fce Support MinSizeRel and RelWithDebInfo build on Windows. (#586) 2024-02-20 10:22:02 +08:00
Fangjun Kuang
c68f39bd3c Use onnxruntime static lib compiled with gcc8 on ubuntu 20.04 (#587) 2024-02-20 09:31:37 +08:00
Fangjun Kuang
64007a6193 Support building debug version on Windows (#583) 2024-02-18 10:39:55 +08:00
Fangjun Kuang
81da0fb7a6 Update onnxruntime from 1.16.3 to 1.17.0 (#581) 2024-02-17 12:43:42 +08:00
Fangjun Kuang
d771762868 Support WebAssembly for text-to-speech (#577) 2024-02-08 23:39:12 +08:00