Commit Graph

981 Commits

Author SHA1 Message Date
Fangjun Kuang
ce71b6327a Fix building wheels (#1703) 2025-01-13 12:00:45 +08:00
Fangjun Kuang
cbe07ac1b6 Release v1.10.39 (#1702) 2025-01-13 10:28:05 +08:00
Fangjun Kuang
0d20558b5e Fix passing strings from C# to C. (#1701)
See also
https://github.com/k2-fsa/sherpa-onnx/issues/1695#issuecomment-2585725190

We need to place a 0 at the end of the buffer.
2025-01-13 10:17:04 +08:00
徐络溟
ecc653871d Fix: export-onnx.py(expected all tensors to be on the same device) (#1699)
由于SenseVoiceSmall.from_pretrained()
    调用的funasr.auto.auto_model.AutoModel.build_model()默认device是cuda
    (在cuda available的环境中)
    ```py
    device = kwargs.get("device", "cuda")
    if not torch.cuda.is_available() or kwargs.get("ngpu", 1) == 0:
        device = "cpu"
        kwargs["batch_size"] = 1
    kwargs["device"] = device
    ```
    而export-onnx.py里的tensor默认都是cpu, 导致
    RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:0 and cpu
    所以直接在加载model的时候指定cpu
2025-01-10 19:26:36 +08:00
Fangjun Kuang
0cb2db385a Add README for android libs. (#1693) 2025-01-07 11:04:10 +08:00
Fangjun Kuang
6d18430dbf Fix building without TTS (#1691) 2025-01-07 09:59:20 +08:00
Fangjun Kuang
930986b06c Release v1.10.38 (#1688) 2025-01-06 11:31:18 +08:00
Fangjun Kuang
d7c95d33a3 Add Dart API for MatchaTTS models (#1687) 2025-01-06 11:03:31 +08:00
Fangjun Kuang
c6fcd32552 Add Pascal API for MatchaTTS models. (#1686) 2025-01-06 10:04:35 +08:00
Fangjun Kuang
46330b25cc Add Go API for MatchaTTS models (#1685) 2025-01-06 08:03:03 +08:00
Fangjun Kuang
6f085babcc Add Swift API for MatchaTTS models. (#1684) 2025-01-06 07:23:45 +08:00
Fangjun Kuang
1fe5fe495f Add Android demo for MatchaTTS models. (#1683) 2025-01-06 06:44:09 +08:00
Fangjun Kuang
3eced3e7ee Add C# and JavaScript (wasm) API for MatchaTTS models (#1682) 2025-01-05 15:08:19 +08:00
Michael Lamothe
1ef9e5ee3a Update workflows to use .NET 8.0 also. (#1681) 2025-01-04 19:54:49 +08:00
Michael Lamothe
8a60985363 Upgraded to .NET 8 and made code style a little more internally consistent. (#1680) 2025-01-04 16:39:06 +08:00
Fangjun Kuang
bf3330c906 Add HarmonyOS examples for MatchaTTS. (#1678) 2025-01-03 17:09:29 +08:00
Fangjun Kuang
0e299f30f5 Add JavaScript API (node-addon-api) for MatchaTTS models. (#1677) 2025-01-03 15:14:28 +08:00
Fangjun Kuang
648903834b Add CXX API for MatchaTTS models (#1676) 2025-01-03 14:16:36 +08:00
Fangjun Kuang
9aa4897a9e Add C API for MatchaTTS models (#1675) 2025-01-03 12:17:26 +08:00
Fangjun Kuang
a4365dad82 Avoid adding tail padding for VAD in generate-subtitles.py (#1674) 2025-01-03 10:37:39 +08:00
Fangjun Kuang
a00d3b4821 Add Java API for Matcha-TTS models. (#1673) 2025-01-02 15:15:30 +08:00
Fangjun Kuang
f457baea42 Support Matcha-TTS models using espeak-ng (#1672) 2025-01-02 13:46:43 +08:00
Fangjun Kuang
3422b9388d Add Kotlin API for Matcha-TTS models. (#1668) 2024-12-31 19:20:52 +08:00
w-rui
0a43e9c879 Add constructor fromPtr() for all flutter class with factory ctor. (#1667)
Co-authored-by: wangrui <wangrui@oecore.com>
2024-12-31 18:07:52 +08:00
Fangjun Kuang
ebe92e523d Remove spaces after punctuations for TTS (#1666) 2024-12-31 16:06:27 +08:00
Fangjun Kuang
d3538531c4 Fix initialize TTS in Python. (#1664) 2024-12-31 15:14:56 +08:00
Fangjun Kuang
b2ad6f63f7 Release v1.10.37 (#1663) 2024-12-31 12:50:38 +08:00
Fangjun Kuang
2c2926af7d Add C++ runtime for Matcha-TTS (#1627) 2024-12-31 12:44:14 +08:00
yujinqiu
5c2cc48f50 Add swift online punctuation (#1661) 2024-12-31 11:26:32 +08:00
Fangjun Kuang
49154c957b Add Go API for Keyword spotting (#1662) 2024-12-31 11:25:32 +08:00
Fangjun Kuang
38d64a6d81 Fix building macOS libs (#1656) 2024-12-27 18:15:41 +08:00
Fangjun Kuang
268d562135 Add TeleSpeech CTC to non_streaming_server.py (#1649) 2024-12-26 11:11:03 +08:00
Fangjun Kuang
b6f0f5fc2e Support removing invalid utf-8 sequences. (#1648) 2024-12-25 19:32:13 +08:00
Fangjun Kuang
08d771337b Add a byte-level BPE Chinese+English non-streaming zipformer model (#1645) 2024-12-24 16:56:49 +08:00
Fangjun Kuang
fe3265aa25 Add new tts models for Latvia and Persian+English (#1644) 2024-12-24 15:16:02 +08:00
Fangjun Kuang
30a17b96fa Release v1.10.36 (#1643) 2024-12-24 11:50:58 +08:00
Fangjun Kuang
d00d1c6298 Fix GitHub actions. (#1642) 2024-12-24 11:34:35 +08:00
Roman Inflianskas
6613828d86 SherpaOnnxVadAsr: Offload runSecondPass to background thread for improved real-time audio processing (#1638)
This change ensures that the main audio processing loop is not blocked by
long-running operations in `runSecondPass`, improving responsiveness and
reducing the risk of missing parts of input speech.
2024-12-24 11:07:32 +08:00
Humorousf
a3d63130ba Update README.md (#1640) 2024-12-23 15:39:00 +08:00
thewh1teagle
4681bdfd8d feat: enable c api for android ci (#1635) 2024-12-20 22:42:31 +08:00
Fangjun Kuang
b76cd9033a Support decoding with byte-level BPE (bbpe) models. (#1633) 2024-12-20 19:21:32 +08:00
thewh1teagle
7192e576a9 feat: add checksum action (#1632) 2024-12-20 15:07:45 +08:00
Fangjun Kuang
86381e12ba Update readme to include jetson orin nx and nano b01 (#1631) 2024-12-19 18:52:42 +08:00
Fangjun Kuang
70ee779410 Support using onnxruntime 1.16.0 with CUDA 11.4 on Jetson Orin NX (Linux arm64 GPU). (#1630)
* Support using onnxruntime 1.16.0 with CUDA 11.4 on Jetson Orin NX.

The pre-built onnxruntime libs are provided by the community
using the following command:

```bash
./build.sh --build_shared_lib --config Release --update \
  --build --parallel --use_cuda \
  --cuda_home /usr/local/cuda \
  --cudnn_home /usr/lib/aarch64-linux-gnu 2>&1 | tee my-log.txt
```

See also https://github.com/microsoft/onnxruntime/discussions/11226

---

Info about the board:

```
Model: NVIDIA Orin NX T801-16GB - Jetpack 5.1.4 [L4T 35.6.0]
```

```
nvidia@nvidia-desktop:~/Downloads$ head -n 1 /etc/nv_tegra_release
# R35 (release), REVISION: 6.0, GCID: 37391689, BOARD: t186ref, EABI: aarch64, DATE: Wed Aug 28 09:12:27 UTC 2024

nvidia@nvidia-desktop:~/Downloads$ uname -r
5.10.216-tegra

nvidia@nvidia-desktop:~/Downloads$ lsb_release -i -r
Distributor ID:	Ubuntu
Release:	20.04

nvidia@nvidia-desktop:~/Downloads$ nvcc -V
nvcc: NVIDIA (R) Cuda compiler driver
Copyright (c) 2005-2022 NVIDIA Corporation
Built on Wed_Sep_21_10:43:33_PDT_2022
Cuda compilation tools, release 11.8, V11.8.89
Build cuda_11.8.r11.8/compiler.31833905_0

nvidia@nvidia-desktop:~/Downloads$ dpkg -l libcudnn8
Desired=Unknown/Install/Remove/Purge/Hold
| Status=Not/Inst/Conf-files/Unpacked/halF-conf/Half-inst/trig-aWait/Trig-pend
|/ Err?=(none)/Reinst-required (Status,Err: uppercase=bad)
||/ Name           Version              Architecture Description
+++-==============-====================-============-=================================
ii  libcudnn8      8.6.0.166-1+cuda11.4 arm64        cuDNN runtime libraries

nvidia@nvidia-desktop:~/Downloads$ dpkg -l tensorrt
Desired=Unknown/Install/Remove/Purge/Hold
| Status=Not/Inst/Conf-files/Unpacked/halF-conf/Half-inst/trig-aWait/Trig-pend
|/ Err?=(none)/Reinst-required (Status,Err: uppercase=bad)
||/ Name           Version            Architecture Description
+++-==============-==================-============-=================================
ii  tensorrt       8.5.2.2-1+cuda11.4 arm64        Meta package for TensorRT
```
2024-12-19 18:19:53 +08:00
sawich
5cc60de5c9 Rename maxNumStences to maxNumSentences (#1625) 2024-12-16 22:37:59 +08:00
Fangjun Kuang
ed8d8e41f9 Update readme to include Open-LLM-VTuber (#1622) 2024-12-16 10:47:07 +08:00
Fangjun Kuang
e639c70d78 Support linking onnxruntime statically for Android (#1619) 2024-12-14 09:53:44 +08:00
Fangjun Kuang
efb505f578 Update AAR version in Android Java demo (#1618) 2024-12-12 20:51:57 +08:00
Fangjun Kuang
e54c1f4533 Release v1.10.35 (#1617) 2024-12-12 20:07:47 +08:00
windy
0f4b1f41e2 🔧 build(portaudio-go): Fixed version 1.0.3 (#1614)
Co-authored-by: windy <deretame123@gmail.com>
2024-12-12 19:39:43 +08:00