Refactor exporting NeMo models (#2362)
Refactors and extends model export support to include new NeMo Parakeet TDT int8 variants for English and Japanese, updating the Kotlin API, export scripts, test runners, and CI workflows. - Added support for two new int8 model types in OfflineRecognizer.kt. - Enhanced Python export scripts to perform dynamic quantization and metadata injection. - Updated shell scripts and GitHub workflows to package, test, and publish int8 model artifacts.
This commit is contained in:
@@ -23,5 +23,6 @@ This folder contains scripts for exporting models from
|
||||
- https://catalog.ngc.nvidia.com/orgs/nvidia/teams/nemo/models/stt_multilingual_fastconformer_hybrid_large_pc
|
||||
|
||||
- https://catalog.ngc.nvidia.com/orgs/nvidia/teams/nemo/models/parakeet-tdt_ctc-110m
|
||||
- https://huggingface.co/nvidia/parakeet-tdt_ctc-0.6b-ja
|
||||
|
||||
to `sherpa-onnx`.
|
||||
|
||||
Reference in New Issue
Block a user