Fangjun Kuang
|
3d3edabb5f
|
Add Go API for Moonshine models (#1479)
|
2024-10-27 09:39:09 +08:00 |
|
Fangjun Kuang
|
052b8645ba
|
Add Go API examples for adding punctuations to text. (#1478)
|
2024-10-27 09:04:05 +08:00 |
|
Fangjun Kuang
|
2ca2985d04
|
Add C and C++ API for Moonshine models (#1476)
|
2024-10-26 23:24:46 +08:00 |
|
Fangjun Kuang
|
bd4b223920
|
Add Kotlin and Java API for Moonshine models (#1474)
|
2024-10-26 22:30:29 +08:00 |
|
Fangjun Kuang
|
669f5ef441
|
Add C++ runtime and Python APIs for Moonshine models (#1473)
|
2024-10-26 14:34:07 +08:00 |
|
Fangjun Kuang
|
0f2732e4e8
|
Publish pre-built JNI libs for Linux aarch64 (#1472)
|
2024-10-26 09:59:18 +08:00 |
|
Fangjun Kuang
|
b06b460851
|
Begin to support https://github.com/usefulsensors/moonshine (#1470)
|
2024-10-26 09:51:16 +08:00 |
|
Fangjun Kuang
|
707cf792c5
|
Add GigaAM NeMo transducer model for Russian ASR (#1467)
|
2024-10-25 15:20:13 +08:00 |
|
Fangjun Kuang
|
b41f6d2c94
|
Support GigaAM CTC models for Russian ASR (#1464)
See also https://github.com/salute-developers/GigaAM
|
2024-10-25 10:55:16 +08:00 |
|
Fangjun Kuang
|
ceb69ebd94
|
Add C++ API for non-streaming ASR (#1456)
|
2024-10-23 16:40:12 +08:00 |
|
Fangjun Kuang
|
effd5ef2be
|
Add C++ API for streaming ASR. (#1455)
It is a wrapper around the C API.
|
2024-10-23 12:07:43 +08:00 |
|
Fangjun Kuang
|
620597f501
|
Support https://huggingface.co/Revai/reverb-diarization-v1 (#1437)
|
2024-10-17 11:58:14 +08:00 |
|
Fangjun Kuang
|
df4150dc5d
|
Upload speaker embedding models to huggingface (#1428)
See also
https://huggingface.co/spaces/k2-fsa/speaker-diarization
|
2024-10-14 16:20:00 +08:00 |
|
Fangjun Kuang
|
5a22f74b2b
|
Android demo for speaker diarization (#1423)
|
2024-10-13 14:02:57 +08:00 |
|
Fangjun Kuang
|
5e273c5be4
|
Pascal API for speaker diarization (#1420)
|
2024-10-12 12:28:38 +08:00 |
|
Fangjun Kuang
|
1ed803adc1
|
Dart API for speaker diarization (#1418)
|
2024-10-11 21:17:41 +08:00 |
|
Fangjun Kuang
|
1851ff6337
|
Java API for speaker diarization (#1416)
|
2024-10-11 16:51:40 +08:00 |
|
Fangjun Kuang
|
eefc172095
|
JavaScript API with WebAssembly for speaker diarization (#1414)
#1408 uses [node-addon-api](https://github.com/nodejs/node-addon-api) to call C API from JavaScript, whereas this pull request uses WebAssembly to call C API from JavaScript.
|
2024-10-11 11:40:10 +08:00 |
|
Fangjun Kuang
|
1d061df355
|
WebAssembly exmaple for speaker diarization (#1411)
|
2024-10-10 22:14:45 +08:00 |
|
Fangjun Kuang
|
a45e5dba99
|
C# API for speaker diarization (#1407)
|
2024-10-10 14:29:05 +08:00 |
|
Fangjun Kuang
|
df681e9807
|
Go API for speaker diarization (#1403)
|
2024-10-09 20:10:44 +08:00 |
|
Fangjun Kuang
|
8535b1d3bb
|
Python API for speaker diarization. (#1400)
|
2024-10-09 14:13:26 +08:00 |
|
Fangjun Kuang
|
59407edcad
|
C++ API for speaker diarization (#1396)
|
2024-10-09 12:01:20 +08:00 |
|
Fangjun Kuang
|
70165cb42d
|
Speaker diarization example with onnxruntime Python API (#1395)
|
2024-10-06 16:37:29 +08:00 |
|
Fangjun Kuang
|
66feecb2b5
|
support whisper turbo (#1390)
|
2024-10-02 18:13:34 +08:00 |
|
Fangjun Kuang
|
b965f14cf0
|
Add Python API for clustering (#1385)
|
2024-09-30 11:33:15 +08:00 |
|
Fangjun Kuang
|
bc08160820
|
Export Pyannote speaker segmentation models to onnx (#1382)
|
2024-09-29 14:23:56 +08:00 |
|
Fangjun Kuang
|
11f0cb7e1c
|
Support Parakeet models from NeMo (#1381)
|
2024-09-27 17:12:00 +08:00 |
|
lxiao336
|
06b61ccad8
|
Allow more online models to load tokens file from the memory (#1352)
Co-authored-by: xiao <shawl336@6163.com>
|
2024-09-20 16:38:41 +08:00 |
|
Fangjun Kuang
|
647b63ea44
|
Release v1.10.27 (#1359)
|
2024-09-19 10:49:29 +08:00 |
|
Fangjun Kuang
|
576a3aa90d
|
Add non-streaming ONNX models for Russian ASR (#1358)
|
2024-09-18 13:43:49 +08:00 |
|
Fangjun Kuang
|
e7ffcbd677
|
Add APIs about max speech duration in VAD for various programming languages (#1349)
|
2024-09-14 12:30:13 +08:00 |
|
Fangjun Kuang
|
544857b097
|
Fix building (#1343)
|
2024-09-13 13:33:52 +08:00 |
|
lxiao336
|
65cfa7548a
|
re-pull-request allow tokens and hotwords be loaded from buffered string driectly (#1339)
Co-authored-by: xiao <shawl336@163.com>
|
2024-09-13 09:58:17 +08:00 |
|
Fangjun Kuang
|
e66d4c414a
|
Fix releasing dart packages. (#1317)
|
2024-09-04 12:12:13 +08:00 |
|
Fangjun Kuang
|
d60a4d418e
|
Provide prebuilt .jar files for different java versions. (#1307)
|
2024-08-30 14:16:31 +08:00 |
|
Fangjun Kuang
|
6b8877f185
|
Downgrade flutter sdk versions. (#1305)
|
2024-08-30 11:47:27 +08:00 |
|
Fangjun Kuang
|
0ccd3a4c3f
|
remove extra files from linux/macos/windows jni libs (#1301)
|
2024-08-29 10:45:38 +08:00 |
|
Fangjun Kuang
|
9064430c3e
|
Fix releasing wasm app for vad+asr (#1300)
|
2024-08-29 08:47:38 +08:00 |
|
Fangjun Kuang
|
6ec57327ce
|
add vad+sense voice example for C API (#1291)
|
2024-08-27 16:11:24 +08:00 |
|
Fangjun Kuang
|
5ed8e31868
|
Add VAD and keyword spotting for the Node package with WebAssembly (#1286)
|
2024-08-24 23:05:54 +08:00 |
|
Fangjun Kuang
|
537e163dd0
|
WebAssembly example for VAD + Non-streaming ASR (#1284)
|
2024-08-24 13:24:52 +08:00 |
|
Fangjun Kuang
|
1ef8a7a202
|
Add WebAssembly for VAD (#1281)
|
2024-08-23 17:08:37 +08:00 |
|
Fangjun Kuang
|
fb09f8fae3
|
Set batch size to 1 for more streaming ASR models (#1280)
|
2024-08-23 11:06:55 +08:00 |
|
Fangjun Kuang
|
0e0d04a97a
|
Provide models for mobile-only platforms by fixing batch size to 1 (#1276)
|
2024-08-22 19:36:24 +08:00 |
|
Fangjun Kuang
|
5a2aa110b8
|
Text to speech API for Object Pascal. (#1273)
|
2024-08-20 20:52:16 +08:00 |
|
Fangjun Kuang
|
f93f0ca94d
|
Use a separate thread to initialize models for lazarus examples. (#1270)
So that the main thread is not blocked and the user interface is responsive.
|
2024-08-18 14:59:48 +08:00 |
|
Fangjun Kuang
|
63713ecbf0
|
Build generating subtitles APPs for more models (#1265)
|
2024-08-16 20:11:24 +08:00 |
|
Fangjun Kuang
|
fbe35ba736
|
Add Lazarus example for generating subtitles using Silero VAD with non-streaming ASR (#1251)
|
2024-08-15 22:19:45 +08:00 |
|
Fangjun Kuang
|
ca729faebf
|
Support reading multi-channel wave files with 8/16/32-bit encoded samples (#1258)
|
2024-08-15 14:54:43 +08:00 |
|