Fangjun Kuang
db632dacf3
Fix CI for windows ( #2279 )
2025-06-04 10:35:48 +08:00
愚者自愚
116977b5d4
Add Go implementation of the TTS generation callback ( #2213 )
2025-05-14 16:09:31 +08:00
Fangjun Kuang
fcb4c4eb2c
Add Go API for homophone replacer ( #2168 )
2025-04-30 23:47:38 +08:00
Fangjun Kuang
ba7d8b63f0
Add Go API for Dolphin CTC models ( #2090 )
2025-04-03 00:02:09 +08:00
Fangjun Kuang
d78f408362
Add Go API for speech enhancement GTCRN models ( #1991 )
2025-03-11 19:33:05 +08:00
franck-li
8aaae91d4a
add SherpaOnnxOfflineRecognizerSetConfig binding for go, and optimize the new/free for C.struct_SherpaOnnxOfflineRecognizerConfig ptr ( #1976 )
...
Co-authored-by: liyuzhi <liyuzhi@info.easeus.com.cn >
2025-03-10 18:04:12 +08:00
Fangjun Kuang
815ebac8f9
Fix building wheels for Python 3.7 ( #1933 )
2025-02-27 13:02:46 +08:00
xcel3011
b042f5e179
fix: AddPunct panic for Go( #1921 )
2025-02-25 18:09:28 +08:00
franck-li
0dcaf3a061
go.mod set to use go 1.17, and use unsafe.Slice to optimize the code ( #1920 )
...
Co-authored-by: liyuzhi <liyuzhi@info.easeus.com.cn >
2025-02-25 15:31:15 +08:00
franck-li
808587accd
change [1<<28] to [1<<10], to fix build issues on GOARCH=386 that [1<<28] too large ( #1916 )
...
Co-authored-by: liyuzhi <liyuzhi@info.easeus.com.cn >
2025-02-25 09:12:55 +08:00
Fangjun Kuang
87a968b55d
Add Go API for FireRedAsr AED Model ( #1879 )
2025-02-17 16:04:07 +08:00
ilibx
2dd84b4582
[update] fixed bug: create golang instance succeed while the c struct create failed ( #1860 )
2025-02-14 14:20:29 +08:00
Fangjun Kuang
f5bf8c8d4a
Add Go API for audio tagging ( #1840 )
2025-02-11 12:07:28 +08:00
Fangjun Kuang
69f489f0cd
Support scaling the duration of a pause in TTS. ( #1820 )
2025-02-08 12:47:26 +08:00
Fangjun Kuang
e1a88a799f
Add Go API for Kokoro TTS 1.0 ( #1804 )
2025-02-07 15:18:02 +08:00
Fangjun Kuang
8b989a851c
Fix keyword spotting. ( #1689 )
...
Reset the stream right after detecting a keyword
2025-01-20 16:41:10 +08:00
Fangjun Kuang
2086f8c55b
Add Go API for Kokoro TTS models ( #1722 )
2025-01-16 17:35:31 +08:00
Fangjun Kuang
46330b25cc
Add Go API for MatchaTTS models ( #1685 )
2025-01-06 08:03:03 +08:00
Fangjun Kuang
49154c957b
Add Go API for Keyword spotting ( #1662 )
2024-12-31 11:25:32 +08:00
Fangjun Kuang
3d3edabb5f
Add Go API for Moonshine models ( #1479 )
2024-10-27 09:39:09 +08:00
Fangjun Kuang
052b8645ba
Add Go API examples for adding punctuations to text. ( #1478 )
2024-10-27 09:04:05 +08:00
Fangjun Kuang
593b96758b
Add Go API for offline punctuation models ( #1434 )
...
It is contributed by a community user
from [our QQ group](https://k2-fsa.github.io/sherpa/social-groups.html#qq ).
2024-10-16 17:16:47 +08:00
Fangjun Kuang
1d061df355
WebAssembly exmaple for speaker diarization ( #1411 )
2024-10-10 22:14:45 +08:00
Fangjun Kuang
df681e9807
Go API for speaker diarization ( #1403 )
2024-10-09 20:10:44 +08:00
Fangjun Kuang
d8809b520e
Fix CI errors introduced by supporting loading keywords from buffers ( #1366 )
2024-09-20 19:04:21 +08:00
lllwan
bf06b268d0
Fix sherpa_onnx.go ( #1353 )
2024-09-17 13:39:56 +08:00
Fangjun Kuang
e7ffcbd677
Add APIs about max speech duration in VAD for various programming languages ( #1349 )
2024-09-14 12:30:13 +08:00
Fangjun Kuang
544857b097
Fix building ( #1343 )
2024-09-13 13:33:52 +08:00
Emmanuel Schmidbauer
a8556e31ba
add Tokens []string, Timestamps []float32, Lang string, Emotion string, Event string ( #1277 )
2024-08-27 06:35:59 +08:00
Parth Khiera
ba4cb6169f
feat: addition of blank_penalty config in online_recognizer ( #1232 )
2024-08-08 09:10:17 +08:00
Fangjun Kuang
4e6aeff07e
Refactor C API to prefix each API with SherpaOnnx. ( #1171 )
2024-07-26 18:47:02 +08:00
Fangjun Kuang
8f4d332aab
Add Go API for SenseVoice ( #1154 )
2024-07-20 23:41:53 +08:00
Fangjun Kuang
e472180f2c
Add C# API for SenseVoice models ( #1151 )
2024-07-20 17:09:23 +08:00
Fangjun Kuang
c35200dccf
Revert to onnxruntime 1.17.1 ( #1131 )
2024-07-15 14:24:08 +08:00
Fangjun Kuang
1c104ea847
Update onnxruntime from v1.18.0 to v1.18.1 ( #1107 )
2024-07-11 09:35:28 +08:00
Fangjun Kuang
dd0ff2ca06
Support onnxruntime 1.18.0 ( #906 )
2024-07-10 17:05:26 +08:00
Fangjun Kuang
c2cc9dec58
Add Flush to VAD so that the last segment can be detected. ( #1099 )
2024-07-09 16:15:56 +08:00
Fangjun Kuang
a25075101c
Build sherpa-onnx as a single shared library ( #1078 )
...
When `-D BUILD_SHARED_LIBS=ON` is passed to `cmake`, it builds a single shared library.
Specifically,
- For C APIs, it builds `libsherpa-onnx-c-api.so`
- For Python APIs, it builds `_sherpa_onnx.cpython-xx-xx.so`
- For Kotlin and Java APIs, it builds `libsherpa-onnx-jni.so`
There is no `libsherpa-onnx-core.so` any longer.
Note it affects only shared libraries.
2024-07-06 16:41:54 +08:00
Fangjun Kuang
6789c909d2
Inverse text normalization API of streaming ASR for various programming languages ( #1022 )
2024-06-18 13:42:17 +08:00
Fangjun Kuang
6e09933d99
Inverse text normalization API for other programming languages ( #1019 )
2024-06-17 17:02:39 +08:00
Fangjun Kuang
fd5a0d1e00
Add C++ runtime for Tele-AI/TeleSpeech-ASR ( #970 )
2024-06-05 00:26:40 +08:00
Fangjun Kuang
f8dbc10146
Fix CI ( #964 )
2024-06-04 17:05:49 +08:00
Fangjun Kuang
86ccb86006
fix building errors introduced by simple-sentencepiece ( #915 )
2024-05-24 14:48:04 +08:00
Fangjun Kuang
fdcae56a14
Fix Go tests ( #897 )
2024-05-21 11:50:13 +08:00
Fangjun Kuang
8af2af8466
Add tail_paddings to Whisper C API. ( #886 )
2024-05-17 09:20:07 +08:00
Fangjun Kuang
6686c7d3e6
Add dict_dir arg to c api to support Chinese TTS models using jieba ( #809 )
2024-04-25 12:28:31 +08:00
Fangjun Kuang
a5f8fbc83f
Support heteronyms in Chinese TTS ( #738 )
2024-04-08 11:01:30 +08:00
Fangjun Kuang
dbff2eaadb
Add C API for streaming HLG decoding ( #734 )
2024-04-05 10:31:20 +08:00
Fangjun Kuang
6da4a1c12f
Add Go API for speaker identification ( #718 )
2024-03-29 19:25:55 +08:00
Fangjun Kuang
a042f44076
Add Golang API for spoken language identification. ( #709 )
2024-03-27 19:40:25 +08:00