Fangjun Kuang
3622104133
Add C# API for Moonshine models. ( #1483 )
...
* Also, return timestamps for non-streaming ASR.
2024-10-27 13:14:25 +08:00
Fangjun Kuang
54468a7370
Add Dart API for Moonshine models. ( #1481 )
2024-10-27 12:04:12 +08:00
Fangjun Kuang
6f261d39f3
Add JavaScript API for Moonshine models ( #1480 )
2024-10-27 11:31:01 +08:00
Fangjun Kuang
669f5ef441
Add C++ runtime and Python APIs for Moonshine models ( #1473 )
2024-10-26 14:34:07 +08:00
Fangjun Kuang
b41f6d2c94
Support GigaAM CTC models for Russian ASR ( #1464 )
...
See also https://github.com/salute-developers/GigaAM
2024-10-25 10:55:16 +08:00
Fangjun Kuang
ceb69ebd94
Add C++ API for non-streaming ASR ( #1456 )
2024-10-23 16:40:12 +08:00
Fangjun Kuang
effd5ef2be
Add C++ API for streaming ASR. ( #1455 )
...
It is a wrapper around the C API.
2024-10-23 12:07:43 +08:00
Fangjun Kuang
1ed803adc1
Dart API for speaker diarization ( #1418 )
2024-10-11 21:17:41 +08:00
Fangjun Kuang
eefc172095
JavaScript API with WebAssembly for speaker diarization ( #1414 )
...
#1408 uses [node-addon-api](https://github.com/nodejs/node-addon-api ) to call C API from JavaScript, whereas this pull request uses WebAssembly to call C API from JavaScript.
2024-10-11 11:40:10 +08:00
Fangjun Kuang
67349b52f2
JavaScript API (node-addon) for speaker diarization ( #1408 )
2024-10-10 15:51:31 +08:00
Fangjun Kuang
a45e5dba99
C# API for speaker diarization ( #1407 )
2024-10-10 14:29:05 +08:00
Fangjun Kuang
1571344509
Swift API for speaker diarization ( #1404 )
2024-10-09 23:25:39 +08:00
Fangjun Kuang
8535b1d3bb
Python API for speaker diarization. ( #1400 )
2024-10-09 14:13:26 +08:00
Fangjun Kuang
59407edcad
C++ API for speaker diarization ( #1396 )
2024-10-09 12:01:20 +08:00
Fangjun Kuang
b965f14cf0
Add Python API for clustering ( #1385 )
2024-09-30 11:33:15 +08:00
Fangjun Kuang
576a3aa90d
Add non-streaming ONNX models for Russian ASR ( #1358 )
2024-09-18 13:43:49 +08:00
Lim Yao Chong
3bffc24d64
Add Python binding for online punctuation models ( #1312 )
2024-09-09 10:26:53 +08:00
Fangjun Kuang
a2a70900d6
ADD VAD+ASR example for dart with CircularBuffer. ( #1293 )
2024-08-27 19:29:34 +08:00
Fangjun Kuang
5ed8e31868
Add VAD and keyword spotting for the Node package with WebAssembly ( #1286 )
2024-08-24 23:05:54 +08:00
Fangjun Kuang
ca729faebf
Support reading multi-channel wave files with 8/16/32-bit encoded samples ( #1258 )
2024-08-15 14:54:43 +08:00
Fangjun Kuang
9ee2943ed4
Add CI tests for online punctuation models ( #1226 )
2024-08-06 18:10:30 +08:00
Fangjun Kuang
35c1b4a7a9
Add ReazonSpeech Japanese pre-trained model ( #1203 )
2024-08-02 10:21:24 +08:00
Fangjun Kuang
53484fcd9b
Fix reading non-standard wav files. ( #1199 )
2024-08-01 17:48:04 +08:00
Fangjun Kuang
ec98110e11
Add speaker identification and verification exmaple for Dart API ( #1194 )
2024-07-31 13:53:52 +08:00
Fangjun Kuang
06fd50f536
Add test about whisper large-v3 for .Net ( #1187 )
2024-07-29 20:49:38 +08:00
Fangjun Kuang
646f99c870
Dart API for adding punctuations to text ( #1182 )
2024-07-29 12:41:52 +08:00
Fangjun Kuang
cd1fedaa49
Add Dart API for audio tagging ( #1181 )
2024-07-29 11:15:14 +08:00
Fangjun Kuang
69b6b47d91
Add vad with non-streaming ASR examples for Dart API ( #1180 )
2024-07-28 23:01:03 +08:00
Fangjun Kuang
d279c8d20e
Add more Python examples for SenseVoice ( #1179 )
2024-07-28 21:54:38 +08:00
Fangjun Kuang
994c3e7c96
Add VAD + Non-streaming ASR example for JavaScript API. ( #1170 )
2024-07-26 12:42:08 +08:00
Fangjun Kuang
ac8223bd8a
Add Dart API for keyword spotter ( #1162 )
2024-07-22 10:53:34 +08:00
Fangjun Kuang
ffdb23a8ec
Add dart API for SenseVoice ( #1159 )
2024-07-21 21:48:12 +08:00
Fangjun Kuang
70d14353bb
Add WebAssembly for SenseVoice ( #1158 )
2024-07-21 15:39:55 +08:00
Fangjun Kuang
c3260ef842
Add JavaScript API for SenseVoice ( #1157 )
2024-07-21 10:14:14 +08:00
Fangjun Kuang
e472180f2c
Add C# API for SenseVoice models ( #1151 )
2024-07-20 17:09:23 +08:00
Fangjun Kuang
25f0a10468
Add C++ runtime for SenseVoice models ( #1148 )
2024-07-18 22:54:18 +08:00
Fangjun Kuang
9e448d03bc
Provide npm package for 32-bit Windows x86 ( #1141 )
2024-07-17 12:33:15 +08:00
Fangjun Kuang
b2c283fa2b
Add Swift API for adding punctuations to text. ( #1132 )
2024-07-15 15:30:40 +08:00
Fangjun Kuang
1c104ea847
Update onnxruntime from v1.18.0 to v1.18.1 ( #1107 )
2024-07-11 09:35:28 +08:00
Fangjun Kuang
08c758520f
Add keyword spotting for C# ( #1105 )
2024-07-10 21:18:46 +08:00
Fangjun Kuang
dd0ff2ca06
Support onnxruntime 1.18.0 ( #906 )
2024-07-10 17:05:26 +08:00
Fangjun Kuang
a25075101c
Build sherpa-onnx as a single shared library ( #1078 )
...
When `-D BUILD_SHARED_LIBS=ON` is passed to `cmake`, it builds a single shared library.
Specifically,
- For C APIs, it builds `libsherpa-onnx-c-api.so`
- For Python APIs, it builds `_sherpa_onnx.cpython-xx-xx.so`
- For Kotlin and Java APIs, it builds `libsherpa-onnx-jni.so`
There is no `libsherpa-onnx-core.so` any longer.
Note it affects only shared libraries.
2024-07-06 16:41:54 +08:00
Fangjun Kuang
ab21131f7f
Swift API for keyword spotting. ( #1027 )
2024-06-18 16:51:30 +08:00
Fangjun Kuang
6789c909d2
Inverse text normalization API of streaming ASR for various programming languages ( #1022 )
2024-06-18 13:42:17 +08:00
Fangjun Kuang
349d957da2
Add inverse text normalization for online ASR ( #1020 )
2024-06-17 18:39:23 +08:00
Fangjun Kuang
6e09933d99
Inverse text normalization API for other programming languages ( #1019 )
2024-06-17 17:02:39 +08:00
Fangjun Kuang
b0f7ed3ee3
Add inverse text normalization for non-streaming ASR ( #1017 )
2024-06-17 14:28:53 +08:00
Fangjun Kuang
e52d32b95b
Add TTS API and examples for Dart ( #1010 )
2024-06-15 14:30:36 +08:00
Fangjun Kuang
e3077670c6
Add streaming ASR examples for Dart API ( #1009 )
2024-06-15 11:48:54 +08:00
Fangjun Kuang
d94506698d
Add non-streaming ASR examples for Dart API ( #1007 )
2024-06-14 18:40:16 +08:00