Commit Graph

1191 Commits

Author SHA1 Message Date
Fangjun Kuang
53518efd2f Add real-time speech recognition example for SenseVoice. (#2197) 2025-05-10 00:50:40 +08:00
Fangjun Kuang
4a833a7547 Fix displaying streaming speech recognition results for Python. (#2196) 2025-05-09 21:48:49 +08:00
Fangjun Kuang
a6834f6556 Show verbose logs in homophone replacer (#2194) 2025-05-09 10:48:30 +08:00
Fangjun Kuang
562a5f7d9b Fix building wheels for macOS (#2192) 2025-05-08 19:15:33 +08:00
Fangjun Kuang
baec2da745 Release v1.11.5 (#2187) 2025-05-08 11:39:16 +08:00
Fangjun Kuang
f9c99032c3 Avoid NaN in feature normalization. (#2186) 2025-05-08 11:22:47 +08:00
Fangjun Kuang
f00066db88 Add C++ runtime for parakeet-tdt-0.6b-v2. (#2181) 2025-05-06 16:59:01 +08:00
Fangjun Kuang
d660d5adea export parakeet-tdt-0.6b-v2 to sherpa-onnx (#2180) 2025-05-06 16:32:59 +08:00
Fangjun Kuang
abc4daa49a Release v1.11.4 (#2169) 2025-05-01 11:36:44 +08:00
Fangjun Kuang
fcb4c4eb2c Add Go API for homophone replacer (#2168) 2025-04-30 23:47:38 +08:00
Fangjun Kuang
85df96d528 Add Dart API for homophone replacer (#2167) 2025-04-30 23:15:28 +08:00
Fangjun Kuang
e537094b07 Add Kotlin and Java API for homophone replacer (#2166)
* Add Kotlin API for homonphone replacer

* Add Java API for homonphone replacer
2025-04-29 22:55:21 +08:00
Fangjun Kuang
50b5329572 Add C# API for homophone replacer (#2165) 2025-04-29 21:36:38 +08:00
Fangjun Kuang
63d01a9534 Add Swift API for homophone replacer. (#2164) 2025-04-29 18:50:41 +08:00
Fangjun Kuang
4a7a974a04 More fix for building without tts (#2162) 2025-04-29 16:31:31 +08:00
Fangjun Kuang
51f8824219 Add homonphone replacer example for Python API. (#2161) 2025-04-29 15:59:34 +08:00
Fangjun Kuang
73dba27145 Fix building without TTS (#2159) 2025-04-29 15:14:50 +08:00
Fangjun Kuang
9d25c90a59 Add JavaScript API (node-addon) for homophone replacer (#2158) 2025-04-28 20:52:42 +08:00
Fangjun Kuang
a0aef1f6cd Add JavaScript API (WASM) for homophone replacer (#2157) 2025-04-28 20:47:49 +08:00
Fangjun Kuang
e51c37eb2f Add C and CXX API for homophone replacer (#2156) 2025-04-27 22:09:13 +08:00
Fangjun Kuang
f64c58342b Support replacing homonphonic phrases (#2153) 2025-04-27 15:31:11 +08:00
Fangjun Kuang
e3280027f9 Support decoding multiple streams in Java API. (#2149) 2025-04-25 11:18:57 +08:00
Fangjun Kuang
48ab90aadc Fix setting OnlineModelConfig in Java API (#2147) 2025-04-24 16:25:36 +08:00
Fangjun Kuang
72742d5472 Fix punctuations for kokoro tts 1.1-zh. (#2146) 2025-04-24 15:08:47 +08:00
Karel Vesely
6a1efd8ac2 online-transducer: reset the encoder toghter with 2 previous output symbols (non-blank) (#2129)
* online-transducer: reset the encoder toghter with 2 previous output symbols (non-blank)

- added `reset_encoder` boolean member into the OnlineRecognizerConfig class
- by default the encoder is not reset

* pybind11, adding empty symbols for disabled modules (tts, diarization)

* reset_encoder, add default value (false) [pybind11]
2025-04-24 08:18:11 +08:00
Fangjun Kuang
921c4370e6 Fix building for open harmonyOS (#2142) 2025-04-21 23:07:16 +08:00
Fangjun Kuang
7cbb1bc433 Upload more onnx ASR models (#2141) 2025-04-21 18:57:41 +08:00
Fangjun Kuang
6cabaa11bf Export kokoro 1.0 int8 models (#2137) 2025-04-20 14:35:02 +08:00
Fangjun Kuang
be0f382a54 Support Giga AM transducer V2 (#2136) 2025-04-20 10:15:20 +08:00
Roman Inflianskas
01110cc22b Add support for GigaAM-CTC-v2 (#2135)
Related to #2098.
2025-04-19 22:13:39 +08:00
Nickolay V. Shmyrev
84ed5d4288 Expose dither in python API (#2127) 2025-04-17 16:47:48 +08:00
Karel Vesely
f3d23aa170 cmake build, configurable from env (#2115)
- make sure the defaults in `cmake/cmake_extension.py` variable
  `extra_cmake_args` can be overriden by `cmake_args` from
  `SHERPA_ONNX_CMAKE_ARGS` env variable
- fix a bug in `sherpa-onnx/csrc/parse-options.cc` which appears
  when using `-DSHERPA_ONNX_ENABLE_CHECK=ON`
- avoid copying binaries when these are disabled
2025-04-16 21:26:54 +08:00
Fangjun Kuang
7a78f2eb7a Fix building for HarmonyOS (#2125) 2025-04-15 18:00:07 +08:00
Fangjun Kuang
e3bce847c0 Support running sherpa-onnx with RK NPU on Android (#2124) 2025-04-15 16:42:28 +08:00
Fangjun Kuang
4ed8ec367e Use manylinux_2_28_x86_64 to build linux gpu for sherpa-onnx (#2123) 2025-04-14 18:53:36 +08:00
Fangjun Kuang
95ba6b4039 Generate subtitles with FireRedAsr models (#2112) 2025-04-10 10:35:24 +08:00
Fangjun Kuang
1c3a383002 Fix a typo in the JNI for Android. (#2108) 2025-04-09 09:02:41 +08:00
HaoWang0101
dcaf9dd208 Comment refinement: Add note about vocoder file for matcha TTS config (#2106) 2025-04-08 12:56:41 +08:00
Askars Salimbajevs
664b461d01 Disable strict hotword matching mode for offline transducer (#1837)
* Disable strict hotword matching mode for offline transducer. Also introduces new variable, so that later this mode can be switched on in the runtime.

* remove strict mode variable

---------

Co-authored-by: Askars Salimbajevs <askars.salimbajevs@tilde.lv>
2025-04-03 22:52:19 +08:00
Fangjun Kuang
31ced58f9a Release v1.11.3 (#2097) 2025-04-03 16:19:01 +08:00
Fangjun Kuang
8137ac9f0b Add Pascal API for Dolphin CTC models (#2096) 2025-04-03 16:00:22 +08:00
Fangjun Kuang
07a5701af6 Add Dart API for Dolphin CTC models (#2095) 2025-04-03 15:59:38 +08:00
Fangjun Kuang
903e825eba Add Javascript (node-addon) API for Dolphin CTC models (#2094) 2025-04-03 15:03:33 +08:00
Fangjun Kuang
639ad1744f Add Javascript (WebAssembly) API for Dolphin CTC models (#2093) 2025-04-03 15:02:06 +08:00
Fangjun Kuang
74f402e490 Add Swift API for Dolphin CTC models (#2091) 2025-04-03 00:03:11 +08:00
Fangjun Kuang
ba7d8b63f0 Add Go API for Dolphin CTC models (#2090) 2025-04-03 00:02:09 +08:00
Fangjun Kuang
2dc0f91904 Add C# API for Dolphin CTC models (#2089) 2025-04-02 23:36:22 +08:00
Askars Salimbajevs
18a6ed5ddc Preserve more context after endpointing in transducer (#2061) 2025-04-02 23:33:47 +08:00
Fangjun Kuang
da4aad1189 Add C and CXX API for Dolphin CTC models (#2088) 2025-04-02 21:54:20 +08:00
Fangjun Kuang
eee5575836 Add Kotlin and Java API for Dolphin CTC models (#2086) 2025-04-02 21:16:14 +08:00