Commit Graph

1097 Commits

Author SHA1 Message Date
cjsdurj
b87fce9a7f c-api add wave write to buffer. (#1962)
Co-authored-by: jian.chen03 <jian.chen03@transwarp.io>
2025-03-10 17:21:23 +08:00
Fangjun Kuang
6e261ed63f Export gtcrn models to sherpa-onnx (#1975) 2025-03-10 11:31:18 +08:00
Fangjun Kuang
362ddf2c07 Add C++ demo for VAD+non-streaming ASR (#1964) 2025-03-07 11:49:46 +08:00
Fangjun Kuang
1e2328242d Test using sherpa-onnx as a cmake subproject (#1961) 2025-03-06 12:12:56 +08:00
Karel Vesely
7740dbfb96 Ebranchformer (#1951)
* adding ebranchformer encoder

* extend surfaced FeatureExtractorConfig

- so ebranchformer feature extraction can be configured from Python
- the GlobCmvn is not needed, as it is a module in the OnnxEncoder

* clean the code

* Integrating remarks from Fangjun
2025-03-04 19:41:09 +08:00
Fangjun Kuang
209eaaae1d Limit number of tokens per second for whisper. (#1958)
Otherwise, it spends lots of time in the loop if the EOT token
is not predicted.
2025-03-04 15:45:28 +08:00
Fangjun Kuang
49177530ff Update README to include projects that is using sherpa-onnx (#1956) 2025-03-04 14:45:07 +08:00
Fangjun Kuang
c9d6859df7 Add transducer modified_beam_search for RKNN. (#1949) 2025-03-03 13:15:25 +08:00
Fangjun Kuang
d5e7b51af5 Support RKNN for Zipformer CTC models. (#1948) 2025-03-02 21:40:13 +08:00
Fangjun Kuang
dfcbc8d40b Add Kokoro v1.1-zh (#1942) 2025-02-28 15:47:59 +08:00
Fangjun Kuang
f5dfcf8d2f Add Kotlin and Java API for online punctuation models (#1936) 2025-02-27 16:52:36 +08:00
Fangjun Kuang
815ebac8f9 Fix building wheels for Python 3.7 (#1933) 2025-02-27 13:02:46 +08:00
Fangjun Kuang
337d5f7a80 Release v1.10.46 (#1929) 2025-02-26 19:19:33 +08:00
Fangjun Kuang
eebe19997d Build wheels for rknn linux aarch64 (#1928) 2025-02-26 18:58:57 +08:00
Fangjun Kuang
82cb8a5dc3 Minor fixes for rknn (#1925) 2025-02-26 16:26:18 +08:00
Fangjun Kuang
2f9a2b20a1 Fix publishing macos pre-built artifacts (#1922) 2025-02-26 11:52:01 +08:00
xcel3011
b042f5e179 fix: AddPunct panic for Go(#1921) 2025-02-25 18:09:28 +08:00
franck-li
0dcaf3a061 go.mod set to use go 1.17, and use unsafe.Slice to optimize the code (#1920)
Co-authored-by: liyuzhi <liyuzhi@info.easeus.com.cn>
2025-02-25 15:31:15 +08:00
Fangjun Kuang
dc2f7e9f9b Fix publishing linux pre-built artifacts (#1919) 2025-02-25 15:22:50 +08:00
Grey Faulkenberry, MD MPH
70742b69ec Flutter Config toJson/fromJson (#1893) 2025-02-25 14:43:48 +08:00
franck-li
808587accd change [1<<28] to [1<<10], to fix build issues on GOARCH=386 that [1<<28] too large (#1916)
Co-authored-by: liyuzhi <liyuzhi@info.easeus.com.cn>
2025-02-25 09:12:55 +08:00
Fangjun Kuang
4d79e6a007 Add C++ API for streaming zipformer ASR on RK NPU (#1908) 2025-02-24 19:07:37 +08:00
Fangjun Kuang
bafd1103d0 Add onnxruntime 1.18.1 for Linux aarch64 GPU (#1914) 2025-02-24 16:14:14 +08:00
wanghsinche
7774e35749 feat: add mic example for better compatibility (#1909)
Co-authored-by: wanghsinche <wanghsinche>
2025-02-21 21:47:21 +08:00
ivan provalov
94728bfbee Fixing Whisper Model Token Normalization (#1904) 2025-02-21 12:58:01 +08:00
Fangjun Kuang
ed922e61b5 Fix publishing pre-built windows libraries (#1905) 2025-02-21 11:59:27 +08:00
Sheldon Robinson
9c810ce3fe Fix #1901: UnicodeEncodeError running export_bpe_vocab.py (#1902) 2025-02-21 06:29:28 +08:00
ivan provalov
4801094133 JNI Exception Handling (#1452) 2025-02-19 23:02:28 +08:00
張小凡
654d2285eb Reduce vad-whisper-c-api example code. (#1891) 2025-02-18 16:25:43 +08:00
Fangjun Kuang
774cf662a4 Fix generating Chinese lexicon for Kokoro TTS 1.0 (#1888) 2025-02-18 12:51:26 +08:00
luffy
4e83b3473b speaker-identification-with-vad-non-streaming-asr.py Lack of support for sense_voice. (#1884) 2025-02-18 12:34:47 +08:00
Fangjun Kuang
26d5f1fa01 Fix kokoro lexicon. (#1886)
还钱 is mispronounced.
2025-02-18 11:09:16 +08:00
Fangjun Kuang
9711ab2474 Release v1.10.45 (#1881) 2025-02-17 16:20:04 +08:00
Fangjun Kuang
614c51068b Add Pascal API for FireRedAsr AED Model (#1877) (#1880) 2025-02-17 16:06:18 +08:00
Fangjun Kuang
87a968b55d Add Go API for FireRedAsr AED Model (#1879) 2025-02-17 16:04:07 +08:00
Fangjun Kuang
b5d89d7bcb Add Dart API for FireRedAsr AED Model (#1877) 2025-02-17 15:17:08 +08:00
Fangjun Kuang
b03f6e6e8c Add Swift API for FireRedAsr AED Model (#1876) 2025-02-17 15:16:23 +08:00
Fangjun Kuang
d95d43112d Add C# API for FireRedAsr Model (#1875) 2025-02-17 13:32:41 +08:00
Fangjun Kuang
7ad44bc43a Add JavaScript API (WebAssembly) for FireRedAsr model. (#1874) 2025-02-17 12:54:18 +08:00
Fangjun Kuang
050df2a357 Add JavaScript API (node-addon) for FireRedAsr (#1873) 2025-02-17 12:24:52 +08:00
Fangjun Kuang
1d49dd2fb0 Add CXX API for FireRedAsr (#1872) 2025-02-17 11:46:13 +08:00
Fangjun Kuang
193d31333c Add C API for FireRedAsr AED model. (#1871) 2025-02-17 11:22:17 +08:00
Fangjun Kuang
d148860d2c Add Kotlin and Java API for FireRedAsr AED model (#1870) 2025-02-17 10:50:25 +08:00
Fangjun Kuang
316424b382 Add C++ and Python API for FireRedASR AED models (#1867) 2025-02-16 22:45:24 +08:00
Fangjun Kuang
2337169ee2 Export FireRedASR to sherpa-onnx. (#1865) 2025-02-16 16:54:48 +08:00
mah92
60beff1463 fixed typo in RTF calculations (#1861) 2025-02-14 19:29:32 +08:00
ilibx
2dd84b4582 [update] fixed bug: create golang instance succeed while the c struct create failed (#1860) 2025-02-14 14:20:29 +08:00
Fangjun Kuang
3825cf3126 Release v1.10.44 (#1857) 2025-02-13 19:08:13 +08:00
Fangjun Kuang
944400e399 Fix spliting text by languages for kokoro tts. (#1849) 2025-02-13 18:19:34 +08:00
Grey Faulkenberry, MD MPH
115e9c2247 Flutter OnlinePunctuation (#1854) 2025-02-13 15:57:06 +08:00