Fangjun Kuang
|
2ca2985d04
|
Add C and C++ API for Moonshine models (#1476)
|
2024-10-26 23:24:46 +08:00 |
|
Fangjun Kuang
|
ceb69ebd94
|
Add C++ API for non-streaming ASR (#1456)
|
2024-10-23 16:40:12 +08:00 |
|
Fangjun Kuang
|
effd5ef2be
|
Add C++ API for streaming ASR. (#1455)
It is a wrapper around the C API.
|
2024-10-23 12:07:43 +08:00 |
|
Fangjun Kuang
|
1ed803adc1
|
Dart API for speaker diarization (#1418)
|
2024-10-11 21:17:41 +08:00 |
|
Fangjun Kuang
|
1d061df355
|
WebAssembly exmaple for speaker diarization (#1411)
|
2024-10-10 22:14:45 +08:00 |
|
Fangjun Kuang
|
d468527f62
|
C API for speaker diarization (#1402)
|
2024-10-09 17:10:03 +08:00 |
|
lxiao336
|
06b61ccad8
|
Allow more online models to load tokens file from the memory (#1352)
Co-authored-by: xiao <shawl336@6163.com>
|
2024-09-20 16:38:41 +08:00 |
|
Fangjun Kuang
|
e7ffcbd677
|
Add APIs about max speech duration in VAD for various programming languages (#1349)
|
2024-09-14 12:30:13 +08:00 |
|
lxiao336
|
65cfa7548a
|
re-pull-request allow tokens and hotwords be loaded from buffered string driectly (#1339)
Co-authored-by: xiao <shawl336@163.com>
|
2024-09-13 09:58:17 +08:00 |
|
Fangjun Kuang
|
ca30d83915
|
Avoid SherpaOnnxSpeakerEmbeddingManagerFreeBestMatches freeing null. (#1296)
Fixes #1295
|
2024-08-28 10:42:36 +08:00 |
|
Fangjun Kuang
|
537e163dd0
|
WebAssembly example for VAD + Non-streaming ASR (#1284)
|
2024-08-24 13:24:52 +08:00 |
|
Fangjun Kuang
|
5a2aa110b8
|
Text to speech API for Object Pascal. (#1273)
|
2024-08-20 20:52:16 +08:00 |
|
Robin Zhong
|
62c4d4ab62
|
Add emotion, event of SenseVoice. (#1257)
* Add emotion, event of SenseVoice.
* Fix tokens size check and update java api.
https://github.com/k2-fsa/sherpa-onnx/pull/1257
|
2024-08-14 15:50:13 +08:00 |
|
Fangjun Kuang
|
5791b695ea
|
Pascal API for streaming ASR (#1246)
|
2024-08-12 19:55:51 +08:00 |
|
Fangjun Kuang
|
94e256244d
|
Add blank penalty for various language bindings. (#1234)
|
2024-08-08 10:43:31 +08:00 |
|
Parth Khiera
|
ba4cb6169f
|
feat: addition of blank_penalty config in online_recognizer (#1232)
|
2024-08-08 09:10:17 +08:00 |
|
Fangjun Kuang
|
4e6aeff07e
|
Refactor C API to prefix each API with SherpaOnnx. (#1171)
|
2024-07-26 18:47:02 +08:00 |
|
Fangjun Kuang
|
25f0a10468
|
Add C++ runtime for SenseVoice models (#1148)
|
2024-07-18 22:54:18 +08:00 |
|
Fangjun Kuang
|
960eb7529e
|
Add C++ runtime for MeloTTS (#1138)
|
2024-07-16 15:55:02 +08:00 |
|
ivan provalov
|
de04b3b9bf
|
Allow modify model config at decode time for ASR (#1124)
|
2024-07-13 22:30:47 +08:00 |
|
thewh1teagle
|
c0eaf86dbd
|
feat: find best embedding matches (#1102)
|
2024-07-11 09:38:06 +08:00 |
|
Fangjun Kuang
|
c2cc9dec58
|
Add Flush to VAD so that the last segment can be detected. (#1099)
|
2024-07-09 16:15:56 +08:00 |
|
Manix
|
55decb7bee
|
Add config for TensorRT and CUDA execution provider (#992)
Signed-off-by: manickavela1998@gmail.com <manickavela1998@gmail.com>
Signed-off-by: manickavela1998@gmail.com <manickavela.arumugam@uniphore.com>
|
2024-07-05 15:18:37 +08:00 |
|
Fangjun Kuang
|
03ebdf3fc6
|
Fix possible segfault in C API. (#1059)
|
2024-06-26 09:57:19 +08:00 |
|
Fangjun Kuang
|
9dd0e03568
|
Enable to stop TTS generation (#1041)
|
2024-06-22 18:18:36 +08:00 |
|
Fangjun Kuang
|
6789c909d2
|
Inverse text normalization API of streaming ASR for various programming languages (#1022)
|
2024-06-18 13:42:17 +08:00 |
|
Fangjun Kuang
|
6e09933d99
|
Inverse text normalization API for other programming languages (#1019)
|
2024-06-17 17:02:39 +08:00 |
|
Fangjun Kuang
|
fd5a0d1e00
|
Add C++ runtime for Tele-AI/TeleSpeech-ASR (#970)
|
2024-06-05 00:26:40 +08:00 |
|
9728Lin
|
9edb78e21b
|
Update c-api.h to hotwords (#962)
|
2024-06-03 16:26:12 +08:00 |
|
Fangjun Kuang
|
f1cff83ef9
|
Add address sanitizer and undefined behavior sanitizer (#951)
|
2024-05-31 13:17:01 +08:00 |
|
Leo Huang
|
d45223034c
|
Added tokens, tokens_arr and json for offline recongnizer result (#936)
Co-authored-by: leo <webmaster@360converter.com>
|
2024-05-29 12:53:28 +08:00 |
|
hantengc
|
1371c6b3f0
|
提供设置关键词的api,方便动态调整关键词来进行识别 (#923)
|
2024-05-27 19:07:26 +08:00 |
|
Fangjun Kuang
|
8af2af8466
|
Add tail_paddings to Whisper C API. (#886)
|
2024-05-17 09:20:07 +08:00 |
|
Fangjun Kuang
|
03c956a317
|
Add keyword spotting API for node-addon-api (#877)
|
2024-05-14 20:26:48 +08:00 |
|
Fangjun Kuang
|
6686c7d3e6
|
Add dict_dir arg to c api to support Chinese TTS models using jieba (#809)
|
2024-04-25 12:28:31 +08:00 |
|
Fangjun Kuang
|
c1608b3524
|
Support CED models (#792)
|
2024-04-19 15:20:37 +08:00 |
|
Fangjun Kuang
|
13730ecbd8
|
Add C API for punctuation (#768)
|
2024-04-14 19:02:34 +08:00 |
|
Fangjun Kuang
|
f204e62b44
|
Add C API for audio tagging (#754)
|
2024-04-11 14:18:43 +08:00 |
|
Fangjun Kuang
|
a5f8fbc83f
|
Support heteronyms in Chinese TTS (#738)
|
2024-04-08 11:01:30 +08:00 |
|
Fangjun Kuang
|
c1c0f5bafd
|
return timestamps for WebAssembly (#737)
|
2024-04-05 20:24:27 +08:00 |
|
Fangjun Kuang
|
dbff2eaadb
|
Add C API for streaming HLG decoding (#734)
|
2024-04-05 10:31:20 +08:00 |
|
Fangjun Kuang
|
2e0bccad36
|
Add C API for speaker embedding extractor. (#711)
|
2024-03-28 18:05:40 +08:00 |
|
Leo Huang
|
638f48f47a
|
Added progress for callback of tts generator (#712)
Co-authored-by: leohwang <leohwang@360converter.com>
|
2024-03-28 17:12:20 +08:00 |
|
Fangjun Kuang
|
69c7880c4d
|
Add Golang API for VAD (#708)
|
2024-03-27 12:09:39 +08:00 |
|
Fangjun Kuang
|
4e040c596e
|
Support including TTS conditionally. (#699)
|
2024-03-26 17:21:35 +08:00 |
|
Fangjun Kuang
|
ab7cff2513
|
Add C API for spoken language identification. (#695)
|
2024-03-25 15:16:47 +08:00 |
|
Fangjun Kuang
|
1952772654
|
Add timestamps and tokens for .Net's online models. (#690)
|
2024-03-23 18:51:56 +08:00 |
|
Fangjun Kuang
|
acf0975153
|
Support whisper language/task in various language bindings. (#679)
|
2024-03-20 16:43:35 +08:00 |
|
Viggo
|
842d04d7ae
|
support whisper language (#678)
|
2024-03-20 10:16:22 +08:00 |
|
Lovemefan
|
009ed2cd30
|
add WebAssembly for Kws (#648)
|
2024-03-11 21:02:31 +08:00 |
|