愚者自愚
116977b5d4
Add Go implementation of the TTS generation callback ( #2213 )
2025-05-14 16:09:31 +08:00
Fangjun Kuang
fcb4c4eb2c
Add Go API for homophone replacer ( #2168 )
2025-04-30 23:47:38 +08:00
Fangjun Kuang
ba7d8b63f0
Add Go API for Dolphin CTC models ( #2090 )
2025-04-03 00:02:09 +08:00
Jov
ef759b7b8b
fix case ( #2037 )
...
v should be V
2025-03-21 16:46:13 +08:00
Jov
572c8d292c
fix vits dict dir config ( #2036 )
2025-03-21 16:30:54 +08:00
Fangjun Kuang
0aacf02dd8
Add C++ runtime for vocos ( #2014 )
2025-03-17 17:05:15 +08:00
Fangjun Kuang
d78f408362
Add Go API for speech enhancement GTCRN models ( #1991 )
2025-03-11 19:33:05 +08:00
franck-li
0dcaf3a061
go.mod set to use go 1.17, and use unsafe.Slice to optimize the code ( #1920 )
...
Co-authored-by: liyuzhi <liyuzhi@info.easeus.com.cn >
2025-02-25 15:31:15 +08:00
Fangjun Kuang
87a968b55d
Add Go API for FireRedAsr AED Model ( #1879 )
2025-02-17 16:04:07 +08:00
Fangjun Kuang
f5bf8c8d4a
Add Go API for audio tagging ( #1840 )
2025-02-11 12:07:28 +08:00
Fangjun Kuang
e1a88a799f
Add Go API for Kokoro TTS 1.0 ( #1804 )
2025-02-07 15:18:02 +08:00
Fangjun Kuang
8b989a851c
Fix keyword spotting. ( #1689 )
...
Reset the stream right after detecting a keyword
2025-01-20 16:41:10 +08:00
Fangjun Kuang
2086f8c55b
Add Go API for Kokoro TTS models ( #1722 )
2025-01-16 17:35:31 +08:00
Fangjun Kuang
46330b25cc
Add Go API for MatchaTTS models ( #1685 )
2025-01-06 08:03:03 +08:00
Fangjun Kuang
49154c957b
Add Go API for Keyword spotting ( #1662 )
2024-12-31 11:25:32 +08:00
windy
0f4b1f41e2
🔧 build(portaudio-go): Fixed version 1.0.3 ( #1614 )
...
Co-authored-by: windy <deretame123@gmail.com >
2024-12-12 19:39:43 +08:00
Fangjun Kuang
3d3edabb5f
Add Go API for Moonshine models ( #1479 )
2024-10-27 09:39:09 +08:00
Fangjun Kuang
052b8645ba
Add Go API examples for adding punctuations to text. ( #1478 )
2024-10-27 09:04:05 +08:00
Fangjun Kuang
df681e9807
Go API for speaker diarization ( #1403 )
2024-10-09 20:10:44 +08:00
Fangjun Kuang
e7ffcbd677
Add APIs about max speech duration in VAD for various programming languages ( #1349 )
2024-09-14 12:30:13 +08:00
Emmanuel Schmidbauer
a8556e31ba
add Tokens []string, Timestamps []float32, Lang string, Emotion string, Event string ( #1277 )
2024-08-27 06:35:59 +08:00
Fangjun Kuang
8f4d332aab
Add Go API for SenseVoice ( #1154 )
2024-07-20 23:41:53 +08:00
Fangjun Kuang
b5093e27f9
Fix publishing apks to huggingface ( #1121 )
...
Save APKs for each release in a separate directory.
Huggingface requires that each directory cannot contain more than 1000 files.
Since we have so many tts models and for each model we need to build APKs of 4 different ABIs,
it is a workaround for the huggingface's constraint by placing them into separate directories for different releases.
2024-07-13 16:14:00 +08:00
Fangjun Kuang
dd0ff2ca06
Support onnxruntime 1.18.0 ( #906 )
2024-07-10 17:05:26 +08:00
Fangjun Kuang
6789c909d2
Inverse text normalization API of streaming ASR for various programming languages ( #1022 )
2024-06-18 13:42:17 +08:00
Fangjun Kuang
6e09933d99
Inverse text normalization API for other programming languages ( #1019 )
2024-06-17 17:02:39 +08:00
Fangjun Kuang
fd5a0d1e00
Add C++ runtime for Tele-AI/TeleSpeech-ASR ( #970 )
2024-06-05 00:26:40 +08:00
Fangjun Kuang
fdcae56a14
Fix Go tests ( #897 )
2024-05-21 11:50:13 +08:00
Fangjun Kuang
68b8b88b5a
Add Python API for punctuation models. ( #762 )
2024-04-13 13:28:17 +08:00
Fangjun Kuang
f20291cadc
Support audio tagging using zipformer ( #747 )
2024-04-10 14:47:06 +08:00
Fangjun Kuang
c9ae7595d5
Fix go API examples with portaudio on Windows. ( #746 )
2024-04-10 09:56:35 +08:00
Fangjun Kuang
a5f8fbc83f
Support heteronyms in Chinese TTS ( #738 )
2024-04-08 11:01:30 +08:00
Fangjun Kuang
dbff2eaadb
Add C API for streaming HLG decoding ( #734 )
2024-04-05 10:31:20 +08:00
Fangjun Kuang
43af1e6951
Release v1.9.15 ( #719 )
2024-03-29 19:58:04 +08:00
Fangjun Kuang
6da4a1c12f
Add Go API for speaker identification ( #718 )
2024-03-29 19:25:55 +08:00
Fangjun Kuang
a042f44076
Add Golang API for spoken language identification. ( #709 )
2024-03-27 19:40:25 +08:00
Fangjun Kuang
69c7880c4d
Add Golang API for VAD ( #708 )
2024-03-27 12:09:39 +08:00
Fangjun Kuang
acf0975153
Support whisper language/task in various language bindings. ( #679 )
2024-03-20 16:43:35 +08:00
Fangjun Kuang
e475e750ac
Support streaming zipformer CTC ( #496 )
...
* Support streaming zipformer CTC
* test online zipformer2 CTC
* Update doc of sherpa-onnx.cc
* Add Python APIs for streaming zipformer2 ctc
* Add Python API examples for streaming zipformer2 ctc
* Swift API for streaming zipformer2 CTC
* NodeJS API for streaming zipformer2 CTC
* Kotlin API for streaming zipformer2 CTC
* Golang API for streaming zipformer2 CTC
* C# API for streaming zipformer2 CTC
* Release v1.9.6
2023-12-22 13:46:33 +08:00
Fangjun Kuang
cae0231f93
Fix releasing go packages ( #476 )
2023-12-09 00:07:52 +08:00
Fangjun Kuang
1937717705
Add MFC TTS example on Windows ( #378 )
2023-10-21 00:13:07 +08:00
Fangjun Kuang
a69d0a950e
Add Go API for TTS ( #377 )
2023-10-20 15:57:52 +08:00
Fangjun Kuang
86b18184c9
Fix Go examples ( #300 )
2023-09-07 15:27:41 +08:00
Fangjun Kuang
e31f9e48c2
Fix various language binding APIs for tdnn and whisper models ( #278 )
2023-08-16 22:15:10 +08:00
Fangjun Kuang
f709c95c5f
Support multilingual whisper models ( #274 )
2023-08-16 00:28:52 +08:00
Fangjun Kuang
bc791d4996
Fix C api for Go and MFC to support streaming paraformer ( #268 )
2023-08-14 17:02:23 +08:00
Fangjun Kuang
b094868fb8
Add non-streaming websocket server for python ( #259 )
2023-08-11 15:56:24 +08:00
骨灰级
f73a799667
fix go build error ( #256 )
...
fix #255
2023-08-10 14:18:23 +08:00
Fangjun Kuang
92976b04a0
Update sherpa-onnx-go package version ( #233 )
2023-08-04 12:33:06 +08:00
Fangjun Kuang
e63d205b3d
Add go-api-examples ( #219 )
2023-07-21 17:56:09 +08:00