Commit Graph

954 Commits

Author SHA1 Message Date
Fangjun Kuang
2c2926af7d Add C++ runtime for Matcha-TTS (#1627) 2024-12-31 12:44:14 +08:00
yujinqiu
5c2cc48f50 Add swift online punctuation (#1661) 2024-12-31 11:26:32 +08:00
Fangjun Kuang
49154c957b Add Go API for Keyword spotting (#1662) 2024-12-31 11:25:32 +08:00
Fangjun Kuang
38d64a6d81 Fix building macOS libs (#1656) 2024-12-27 18:15:41 +08:00
Fangjun Kuang
268d562135 Add TeleSpeech CTC to non_streaming_server.py (#1649) 2024-12-26 11:11:03 +08:00
Fangjun Kuang
b6f0f5fc2e Support removing invalid utf-8 sequences. (#1648) 2024-12-25 19:32:13 +08:00
Fangjun Kuang
08d771337b Add a byte-level BPE Chinese+English non-streaming zipformer model (#1645) 2024-12-24 16:56:49 +08:00
Fangjun Kuang
fe3265aa25 Add new tts models for Latvia and Persian+English (#1644) 2024-12-24 15:16:02 +08:00
Fangjun Kuang
30a17b96fa Release v1.10.36 (#1643) 2024-12-24 11:50:58 +08:00
Fangjun Kuang
d00d1c6298 Fix GitHub actions. (#1642) 2024-12-24 11:34:35 +08:00
Roman Inflianskas
6613828d86 SherpaOnnxVadAsr: Offload runSecondPass to background thread for improved real-time audio processing (#1638)
This change ensures that the main audio processing loop is not blocked by
long-running operations in `runSecondPass`, improving responsiveness and
reducing the risk of missing parts of input speech.
2024-12-24 11:07:32 +08:00
Humorousf
a3d63130ba Update README.md (#1640) 2024-12-23 15:39:00 +08:00
thewh1teagle
4681bdfd8d feat: enable c api for android ci (#1635) 2024-12-20 22:42:31 +08:00
Fangjun Kuang
b76cd9033a Support decoding with byte-level BPE (bbpe) models. (#1633) 2024-12-20 19:21:32 +08:00
thewh1teagle
7192e576a9 feat: add checksum action (#1632) 2024-12-20 15:07:45 +08:00
Fangjun Kuang
86381e12ba Update readme to include jetson orin nx and nano b01 (#1631) 2024-12-19 18:52:42 +08:00
Fangjun Kuang
70ee779410 Support using onnxruntime 1.16.0 with CUDA 11.4 on Jetson Orin NX (Linux arm64 GPU). (#1630)
* Support using onnxruntime 1.16.0 with CUDA 11.4 on Jetson Orin NX.

The pre-built onnxruntime libs are provided by the community
using the following command:

```bash
./build.sh --build_shared_lib --config Release --update \
  --build --parallel --use_cuda \
  --cuda_home /usr/local/cuda \
  --cudnn_home /usr/lib/aarch64-linux-gnu 2>&1 | tee my-log.txt
```

See also https://github.com/microsoft/onnxruntime/discussions/11226

---

Info about the board:

```
Model: NVIDIA Orin NX T801-16GB - Jetpack 5.1.4 [L4T 35.6.0]
```

```
nvidia@nvidia-desktop:~/Downloads$ head -n 1 /etc/nv_tegra_release
# R35 (release), REVISION: 6.0, GCID: 37391689, BOARD: t186ref, EABI: aarch64, DATE: Wed Aug 28 09:12:27 UTC 2024

nvidia@nvidia-desktop:~/Downloads$ uname -r
5.10.216-tegra

nvidia@nvidia-desktop:~/Downloads$ lsb_release -i -r
Distributor ID:	Ubuntu
Release:	20.04

nvidia@nvidia-desktop:~/Downloads$ nvcc -V
nvcc: NVIDIA (R) Cuda compiler driver
Copyright (c) 2005-2022 NVIDIA Corporation
Built on Wed_Sep_21_10:43:33_PDT_2022
Cuda compilation tools, release 11.8, V11.8.89
Build cuda_11.8.r11.8/compiler.31833905_0

nvidia@nvidia-desktop:~/Downloads$ dpkg -l libcudnn8
Desired=Unknown/Install/Remove/Purge/Hold
| Status=Not/Inst/Conf-files/Unpacked/halF-conf/Half-inst/trig-aWait/Trig-pend
|/ Err?=(none)/Reinst-required (Status,Err: uppercase=bad)
||/ Name           Version              Architecture Description
+++-==============-====================-============-=================================
ii  libcudnn8      8.6.0.166-1+cuda11.4 arm64        cuDNN runtime libraries

nvidia@nvidia-desktop:~/Downloads$ dpkg -l tensorrt
Desired=Unknown/Install/Remove/Purge/Hold
| Status=Not/Inst/Conf-files/Unpacked/halF-conf/Half-inst/trig-aWait/Trig-pend
|/ Err?=(none)/Reinst-required (Status,Err: uppercase=bad)
||/ Name           Version            Architecture Description
+++-==============-==================-============-=================================
ii  tensorrt       8.5.2.2-1+cuda11.4 arm64        Meta package for TensorRT
```
2024-12-19 18:19:53 +08:00
sawich
5cc60de5c9 Rename maxNumStences to maxNumSentences (#1625) 2024-12-16 22:37:59 +08:00
Fangjun Kuang
ed8d8e41f9 Update readme to include Open-LLM-VTuber (#1622) 2024-12-16 10:47:07 +08:00
Fangjun Kuang
e639c70d78 Support linking onnxruntime statically for Android (#1619) 2024-12-14 09:53:44 +08:00
Fangjun Kuang
efb505f578 Update AAR version in Android Java demo (#1618) 2024-12-12 20:51:57 +08:00
Fangjun Kuang
e54c1f4533 Release v1.10.35 (#1617) 2024-12-12 20:07:47 +08:00
windy
0f4b1f41e2 🔧 build(portaudio-go): Fixed version 1.0.3 (#1614)
Co-authored-by: windy <deretame123@gmail.com>
2024-12-12 19:39:43 +08:00
Fangjun Kuang
be87f866f3 Use aar in Android Java demo. (#1616) 2024-12-12 18:26:54 +08:00
Fangjun Kuang
4dc4f1a708 Provide sherpa-onnx.aar for Android (#1615) 2024-12-12 16:59:00 +08:00
Fangjun Kuang
9d4659fd29 Add missing changes about speaker identfication demo for HarmonyOS (#1612) 2024-12-11 12:01:13 +08:00
Fangjun Kuang
e011e849d4 Release v1.10.34 (#1611) 2024-12-10 20:29:33 +08:00
Fangjun Kuang
914cbad6a3 Add speaker diarization demo for HarmonyOS (#1610) 2024-12-10 20:11:44 +08:00
Fangjun Kuang
1bae4085ca Add speaker diarization API for HarmonyOS. (#1609) 2024-12-10 16:03:03 +08:00
Fangjun Kuang
14944d8c81 Add speaker identification demo for HarmonyOS (#1608) 2024-12-10 14:50:13 +08:00
Fangjun Kuang
314545f938 Add speaker identification APIs for HarmonyOS (#1607)
* Add speaker embedding extractor API for HarmonyOS

* Add ArkTS API for speaker identification
2024-12-09 19:23:18 +08:00
Fangjun Kuang
a743a4400f Add on-device real-time ASR demo for HarmonyOS (#1606) 2024-12-09 16:40:15 +08:00
Fangjun Kuang
91a43cccff Update doc links for HarmonyOS (#1601) 2024-12-06 17:38:40 +08:00
Fangjun Kuang
84821b1f99 Fix building node-addon package (#1598) 2024-12-06 10:11:18 +08:00
Fangjun Kuang
9352ccf7d5 Release v1.10.33 (#1591) 2024-12-04 14:51:46 +08:00
Fangjun Kuang
74a8735f7a Add on-device tex-to-speech (TTS) demo for HarmonyOS (#1590) 2024-12-04 14:27:12 +08:00
goddamnVincent
47a2dd4cf8 'update20241203' (#1589)
add '--modeling-unit' and "--bpe-vocab" to /sherpa-onnx/python-api-examples/streaming_server.py make it specifiable.
2024-12-04 09:22:24 +08:00
JiayuXu
0d6bf52844 fix: support both old and new websockets request headers format (#1588)
Co-authored-by: xujiayu <xujiayu@kaihong.com>
2024-12-03 17:22:12 +08:00
Fangjun Kuang
dc3287f3a8 Add HarmonyOS support for text-to-speech. (#1584) 2024-12-01 21:43:34 +08:00
Fangjun Kuang
a3d6e1acc7 Fix getting microphone permission for HarmonyOS VAD+ASR example (#1582) 2024-11-30 16:24:22 +08:00
Fangjun Kuang
c9d3b6cd8c Add microphone demo about VAD+ASR for HarmonyOS (#1581) 2024-11-30 15:23:45 +08:00
Fangjun Kuang
299f2392e2 Add CI to build HAPs for HarmonyOS (#1578) 2024-11-29 21:13:01 +08:00
Fangjun Kuang
be159f943e Fix publishing har packages for HarmonyOS (#1576) 2024-11-29 12:20:12 +08:00
Fangjun Kuang
f3f8961462 Add VAD+ASR demo for HarmonyOS (#1573) 2024-11-28 22:59:56 +08:00
Fangjun Kuang
315d8e2a47 Publish sherpa_onnx.har for HarmonyOS (#1572) 2024-11-28 17:30:16 +08:00
Fangjun Kuang
109fb799ca fix building for Android (#1568) 2024-11-27 10:36:16 +08:00
Fangjun Kuang
2101227269 Add streaming ASR support for HarmonyOS. (#1565) 2024-11-26 18:36:56 +08:00
Fangjun Kuang
298b6b6fda Add non-streaming ASR support for HarmonyOS. (#1564) 2024-11-26 16:38:35 +08:00
Fangjun Kuang
a4b79f077c Fix flutter ios (#1563) 2024-11-26 13:45:15 +08:00
Fangjun Kuang
31d6206fde HarmonyOS support for VAD. (#1561) 2024-11-24 16:29:24 +08:00