Commit Graph

546 Commits

Author SHA1 Message Date
Fangjun Kuang
fd9a687ec2 Add Pascal/Go/C#/Dart API for NeMo Canary ASR models (#2367)
Add support for the new NeMo Canary ASR model across multiple language bindings by introducing a Canary model configuration and setter method on the offline recognizer.

- Define Canary model config in Pascal, Go, C#, Dart and update converter functions
- Add SetConfig API for offline recognizer (Pascal, Go, C#, Dart)
- Extend CI/workflows and example scripts to test non-streaming Canary decoding
2025-07-10 14:53:33 +08:00
Askars Salimbajevs
f0960342ad Add LODR support to online and offline recognizers (#2026)
This PR integrates LODR (Level-Ordered Deterministic Rescoring) support from Icefall into both online and offline recognizers, enabling LODR for LM shallow fusion and LM rescore.

- Extended OnlineLMConfig and OfflineLMConfig to include lodr_fst, lodr_scale, and lodr_backoff_id.
- Implemented LodrFst and LodrStateCost classes and wired them into RNN LM scoring in both online and offline code paths.
- Updated Python bindings, CLI entry points, examples, and CI test scripts to accept and exercise the new LODR options.
2025-07-09 16:23:46 +08:00
Fangjun Kuang
6122a678f5 Refactor exporting NeMo models (#2362)
Refactors and extends model export support to include new NeMo Parakeet TDT int8 variants for English and Japanese, updating the Kotlin API, export scripts, test runners, and CI workflows.

- Added support for two new int8 model types in OfflineRecognizer.kt.
- Enhanced Python export scripts to perform dynamic quantization and metadata injection.
- Updated shell scripts and GitHub workflows to package, test, and publish int8 model artifacts.
2025-07-09 16:02:12 +08:00
Fangjun Kuang
831aff187d Upload fp16 onnx model files for FireRedASR (#2360) 2025-07-08 13:46:03 +08:00
Fangjun Kuang
103e93d9f6 Add Java and Kotlin API for NeMo Canary models (#2359)
Add support for the NeMo Canary model in both Java and Kotlin APIs, wiring it through
JNI and updating examples and CI.

- Introduce OfflineCanaryModelConfig in Kotlin and Java with builder patterns
- Extend OfflineRecognizer to accept and apply the new canary config via setConfig
- Update JNI binding (GetOfflineConfig) and getOfflineModelConfig mapping (type 32), 
   plus examples and CI workflows
2025-07-08 13:45:26 +08:00
Fangjun Kuang
df4615ca1d Add C/CXX/JavaScript API for NeMo Canary models (#2357)
This PR introduces support for NeMo Canary models across C, C++, and JavaScript APIs 
by adding new Canary configuration structures, updating bindings, extending examples,
and enhancing CI workflows.

- Add OfflineCanaryModelConfig to all language bindings (C, C++, JS, ETS).
- Implement SetConfig methods and NAPI wrappers for updating recognizer config at runtime.
- Update examples and CI scripts to demonstrate and test NeMo Canary model usage.
2025-07-07 23:38:04 +08:00
Fangjun Kuang
0e738c356c Add C++ runtime and Python API for NeMo Canary models (#2352) 2025-07-07 17:03:49 +08:00
Fangjun Kuang
fce481c125 Add meta data to NeMo canary ONNX models (#2351) 2025-07-07 00:12:20 +08:00
Fangjun Kuang
d70b789582 Fix testing dart packages (#2345) 2025-07-04 22:27:24 +08:00
Fangjun Kuang
53a3ad366b Support linux aarch64 for Dart and Flutter (#2342)
Adds support for building and packaging Linux AArch64 (arm64) artifacts alongside x64 for Dart/Flutter plugins.

- Detects host architecture in CMake and adjusts library paths
- Extends test workflows to run on an ARM runner and handle linux-aarch64 paths
- Splits release pipeline into separate x64 and aarch64 build/package jobs
2025-07-04 19:33:48 +08:00
Fangjun Kuang
3bf986d08d Support non-streaming zipformer CTC ASR models (#2340)
This PR adds support for non-streaming Zipformer CTC ASR models across 
multiple language bindings, WebAssembly, examples, and CI workflows.

- Introduces a new OfflineZipformerCtcModelConfig in C/C++, Python, Swift, Java, Kotlin, Go, Dart, Pascal, and C# APIs
- Updates initialization, freeing, and recognition logic to include Zipformer CTC in WASM and Node.js
- Adds example scripts and CI steps for downloading, building, and running Zipformer CTC models

Model doc is available at
https://k2-fsa.github.io/sherpa/onnx/pretrained_models/offline-ctc/icefall/zipformer.html
2025-07-04 15:57:07 +08:00
Fangjun Kuang
ea3e583ac9 Fix static link without tts (#2328) 2025-06-30 14:21:01 +08:00
Fangjun Kuang
bda427f4b2 Add API to get version information (#2309) 2025-06-25 00:22:21 +08:00
Fangjun Kuang
6982b86c66 Support extra languages in multi-lang kokoro tts (#2303) 2025-06-20 11:22:52 +08:00
Fangjun Kuang
59d118c256 Refactor kokoro export (#2302)
- generate samples for https://k2-fsa.github.io/sherpa/onnx/tts/all/
- provide int8 model for kokoro v0.19 kokoro-int8-en-v0_19.tar.bz2
2025-06-18 20:30:10 +08:00
Fangjun Kuang
2913cce77c Add scripts for exporting Piper TTS models to sherpa-onnx (#2299) 2025-06-17 14:23:39 +08:00
Fangjun Kuang
d57e4f84de Add Python API for source separation (#2283) 2025-06-05 20:44:26 +08:00
Fangjun Kuang
6f0fac2064 Add jar for Java 24. (#2280) 2025-06-04 11:08:45 +08:00
Fangjun Kuang
db632dacf3 Fix CI for windows (#2279) 2025-06-04 10:35:48 +08:00
Fangjun Kuang
9539af5f5c Fix 32-bit arm CI (#2276) 2025-06-03 21:02:33 +08:00
Fangjun Kuang
6cb44d44e9 Export nvidia/canary-180m-flash to sherpa-onnx (#2272) 2025-06-02 22:28:15 +08:00
Fangjun Kuang
2b2788332e Add C++ support for UVR models (#2269) 2025-06-01 17:22:08 +08:00
Fangjun Kuang
921f0f40cb Add UVR models for source separation. (#2266) 2025-05-31 13:31:31 +08:00
Fangjun Kuang
d8b5a58898 repair rknn wheels (#2257) 2025-05-28 17:39:55 +08:00
Fangjun Kuang
16a3449945 Build APK with replace.fst (#2254) 2025-05-28 12:19:29 +08:00
Fangjun Kuang
716ba8317b Add C++ runtime for spleeter about source separation (#2242) 2025-05-23 22:30:57 +08:00
Fangjun Kuang
55a44793e6 Export spleeter model to onnx for source separation (#2237) 2025-05-22 15:09:38 +08:00
Fangjun Kuang
901b3f0150 Fix publishing binaries for RKNN (#2234) 2025-05-21 11:59:41 +08:00
Fangjun Kuang
5113094352 Fix building RKNN wheels (#2233) 2025-05-21 11:15:18 +08:00
Fangjun Kuang
d8bb20710d Add script to build APK for simulated-streaming-asr. (#2220) 2025-05-15 15:40:22 +08:00
Fangjun Kuang
99defc5b90 Add nodejs example for parakeet-tdt-0.6b-v2. (#2219) 2025-05-15 11:27:22 +08:00
Fangjun Kuang
fdda292d5a Add alsa-based streaming ASR example for sense voice. (#2207) 2025-05-13 19:08:09 +08:00
Fangjun Kuang
a6834f6556 Show verbose logs in homophone replacer (#2194) 2025-05-09 10:48:30 +08:00
Fangjun Kuang
562a5f7d9b Fix building wheels for macOS (#2192) 2025-05-08 19:15:33 +08:00
Fangjun Kuang
d660d5adea export parakeet-tdt-0.6b-v2 to sherpa-onnx (#2180) 2025-05-06 16:32:59 +08:00
Fangjun Kuang
fcb4c4eb2c Add Go API for homophone replacer (#2168) 2025-04-30 23:47:38 +08:00
Fangjun Kuang
85df96d528 Add Dart API for homophone replacer (#2167) 2025-04-30 23:15:28 +08:00
Fangjun Kuang
e537094b07 Add Kotlin and Java API for homophone replacer (#2166)
* Add Kotlin API for homonphone replacer

* Add Java API for homonphone replacer
2025-04-29 22:55:21 +08:00
Fangjun Kuang
63d01a9534 Add Swift API for homophone replacer. (#2164) 2025-04-29 18:50:41 +08:00
Fangjun Kuang
51f8824219 Add homonphone replacer example for Python API. (#2161) 2025-04-29 15:59:34 +08:00
Fangjun Kuang
9d25c90a59 Add JavaScript API (node-addon) for homophone replacer (#2158) 2025-04-28 20:52:42 +08:00
Fangjun Kuang
a0aef1f6cd Add JavaScript API (WASM) for homophone replacer (#2157) 2025-04-28 20:47:49 +08:00
Fangjun Kuang
e51c37eb2f Add C and CXX API for homophone replacer (#2156) 2025-04-27 22:09:13 +08:00
Fangjun Kuang
f64c58342b Support replacing homonphonic phrases (#2153) 2025-04-27 15:31:11 +08:00
Fangjun Kuang
e3280027f9 Support decoding multiple streams in Java API. (#2149) 2025-04-25 11:18:57 +08:00
Fangjun Kuang
7cbb1bc433 Upload more onnx ASR models (#2141) 2025-04-21 18:57:41 +08:00
Fangjun Kuang
6cabaa11bf Export kokoro 1.0 int8 models (#2137) 2025-04-20 14:35:02 +08:00
Fangjun Kuang
be0f382a54 Support Giga AM transducer V2 (#2136) 2025-04-20 10:15:20 +08:00
Roman Inflianskas
01110cc22b Add support for GigaAM-CTC-v2 (#2135)
Related to #2098.
2025-04-19 22:13:39 +08:00
Fangjun Kuang
e3bce847c0 Support running sherpa-onnx with RK NPU on Android (#2124) 2025-04-15 16:42:28 +08:00