Commit Graph

308 Commits

Author SHA1 Message Date
Fangjun Kuang
0aacf02dd8 Add C++ runtime for vocos (#2014) 2025-03-17 17:05:15 +08:00
Fangjun Kuang
623cdc9eec Export vocos to sherpa-onnx (#2012) 2025-03-17 09:19:50 +08:00
Fangjun Kuang
c972554ad1 Add JavaScript API (wasm) for speech enhancement GTCRN models (#2007) 2025-03-15 17:41:23 +08:00
Fangjun Kuang
6a97f8adcf Add JavaScript (node-addon) API for speech enhancement GTCRN models (#1996) 2025-03-12 15:52:01 +08:00
Fangjun Kuang
fd78a482df Add Dart API for speech enhancement GTCRN models (#1993) 2025-03-12 12:39:08 +08:00
Fangjun Kuang
d78f408362 Add Go API for speech enhancement GTCRN models (#1991) 2025-03-11 19:33:05 +08:00
Fangjun Kuang
d3e27d5e21 Add C# API for speech enhancement GTCRN models (#1990) 2025-03-11 18:58:17 +08:00
franck-li
8aaae91d4a add SherpaOnnxOfflineRecognizerSetConfig binding for go, and optimize the new/free for C.struct_SherpaOnnxOfflineRecognizerConfig ptr (#1976)
Co-authored-by: liyuzhi <liyuzhi@info.easeus.com.cn>
2025-03-10 18:04:12 +08:00
Fangjun Kuang
6e261ed63f Export gtcrn models to sherpa-onnx (#1975) 2025-03-10 11:31:18 +08:00
Fangjun Kuang
dfcbc8d40b Add Kokoro v1.1-zh (#1942) 2025-02-28 15:47:59 +08:00
Fangjun Kuang
815ebac8f9 Fix building wheels for Python 3.7 (#1933) 2025-02-27 13:02:46 +08:00
xcel3011
b042f5e179 fix: AddPunct panic for Go(#1921) 2025-02-25 18:09:28 +08:00
franck-li
0dcaf3a061 go.mod set to use go 1.17, and use unsafe.Slice to optimize the code (#1920)
Co-authored-by: liyuzhi <liyuzhi@info.easeus.com.cn>
2025-02-25 15:31:15 +08:00
franck-li
808587accd change [1<<28] to [1<<10], to fix build issues on GOARCH=386 that [1<<28] too large (#1916)
Co-authored-by: liyuzhi <liyuzhi@info.easeus.com.cn>
2025-02-25 09:12:55 +08:00
Fangjun Kuang
4d79e6a007 Add C++ API for streaming zipformer ASR on RK NPU (#1908) 2025-02-24 19:07:37 +08:00
Sheldon Robinson
9c810ce3fe Fix #1901: UnicodeEncodeError running export_bpe_vocab.py (#1902) 2025-02-21 06:29:28 +08:00
Fangjun Kuang
774cf662a4 Fix generating Chinese lexicon for Kokoro TTS 1.0 (#1888) 2025-02-18 12:51:26 +08:00
Fangjun Kuang
26d5f1fa01 Fix kokoro lexicon. (#1886)
还钱 is mispronounced.
2025-02-18 11:09:16 +08:00
Fangjun Kuang
87a968b55d Add Go API for FireRedAsr AED Model (#1879) 2025-02-17 16:04:07 +08:00
Fangjun Kuang
d95d43112d Add C# API for FireRedAsr Model (#1875) 2025-02-17 13:32:41 +08:00
ilibx
2dd84b4582 [update] fixed bug: create golang instance succeed while the c struct create failed (#1860) 2025-02-14 14:20:29 +08:00
Fangjun Kuang
8b8ef1090b Fix CI (#1841) 2025-02-11 12:27:09 +08:00
Fangjun Kuang
f5bf8c8d4a Add Go API for audio tagging (#1840) 2025-02-11 12:07:28 +08:00
Fangjun Kuang
d5da9430e8 Add PengChengStarling models to sherpa-onnx (#1835) 2025-02-10 18:23:40 +08:00
Fangjun Kuang
9559a10bd3 Add C++ support for MatchaTTS models not from icefall. (#1834) 2025-02-10 15:38:29 +08:00
Fangjun Kuang
7d62ccf1fe Export MatchaTTS fa-en model to sherpa-onnx (#1832) 2025-02-10 14:38:08 +08:00
Fangjun Kuang
69f489f0cd Support scaling the duration of a pause in TTS. (#1820) 2025-02-08 12:47:26 +08:00
Fangjun Kuang
ae32dfaa56 Add C# API for Kokoro TTS 1.0 (#1805) 2025-02-07 15:39:03 +08:00
Fangjun Kuang
e1a88a799f Add Go API for Kokoro TTS 1.0 (#1804) 2025-02-07 15:18:02 +08:00
Fangjun Kuang
a52b819fb5 Add Android demo for Kokoro TTS 1.0 (#1799) 2025-02-07 13:07:30 +08:00
Fangjun Kuang
4372a7a7b0 Add Java and Koltin API for Kokoro TTS 1.0 (#1798) 2025-02-07 09:59:27 +08:00
Fangjun Kuang
c84a833863 Add C++ and Python API for Kokoro 1.0 multilingual TTS model (#1795) 2025-02-06 22:57:13 +08:00
Fangjun Kuang
08cefe8488 Export Kokoro 1.0 to sherpa-onnx (#1788) 2025-02-05 08:24:43 +08:00
Fangjun Kuang
8b989a851c Fix keyword spotting. (#1689)
Reset the stream right after detecting a keyword
2025-01-20 16:41:10 +08:00
Fangjun Kuang
99cef4198b Add Koltin and Java API for Kokoro TTS models (#1728) 2025-01-17 17:36:13 +08:00
Fangjun Kuang
2086f8c55b Add Go API for Kokoro TTS models (#1722) 2025-01-16 17:35:31 +08:00
Fangjun Kuang
cc812e6237 Add C# API for Kokoro TTS models (#1720) 2025-01-16 16:30:10 +08:00
Fangjun Kuang
9efe26a646 Export kokoro to sherpa-onnx (#1713) 2025-01-15 16:49:10 +08:00
Fangjun Kuang
0d20558b5e Fix passing strings from C# to C. (#1701)
See also
https://github.com/k2-fsa/sherpa-onnx/issues/1695#issuecomment-2585725190

We need to place a 0 at the end of the buffer.
2025-01-13 10:17:04 +08:00
徐络溟
ecc653871d Fix: export-onnx.py(expected all tensors to be on the same device) (#1699)
由于SenseVoiceSmall.from_pretrained()
    调用的funasr.auto.auto_model.AutoModel.build_model()默认device是cuda
    (在cuda available的环境中)
    ```py
    device = kwargs.get("device", "cuda")
    if not torch.cuda.is_available() or kwargs.get("ngpu", 1) == 0:
        device = "cpu"
        kwargs["batch_size"] = 1
    kwargs["device"] = device
    ```
    而export-onnx.py里的tensor默认都是cpu, 导致
    RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:0 and cpu
    所以直接在加载model的时候指定cpu
2025-01-10 19:26:36 +08:00
Fangjun Kuang
46330b25cc Add Go API for MatchaTTS models (#1685) 2025-01-06 08:03:03 +08:00
Fangjun Kuang
1fe5fe495f Add Android demo for MatchaTTS models. (#1683) 2025-01-06 06:44:09 +08:00
Fangjun Kuang
3eced3e7ee Add C# and JavaScript (wasm) API for MatchaTTS models (#1682) 2025-01-05 15:08:19 +08:00
Fangjun Kuang
0e299f30f5 Add JavaScript API (node-addon-api) for MatchaTTS models. (#1677) 2025-01-03 15:14:28 +08:00
Fangjun Kuang
49154c957b Add Go API for Keyword spotting (#1662) 2024-12-31 11:25:32 +08:00
Fangjun Kuang
08d771337b Add a byte-level BPE Chinese+English non-streaming zipformer model (#1645) 2024-12-24 16:56:49 +08:00
Fangjun Kuang
fe3265aa25 Add new tts models for Latvia and Persian+English (#1644) 2024-12-24 15:16:02 +08:00
Fangjun Kuang
b76cd9033a Support decoding with byte-level BPE (bbpe) models. (#1633) 2024-12-20 19:21:32 +08:00
Fangjun Kuang
1bae4085ca Add speaker diarization API for HarmonyOS. (#1609) 2024-12-10 16:03:03 +08:00
Fangjun Kuang
84821b1f99 Fix building node-addon package (#1598) 2024-12-06 10:11:18 +08:00