This repository has been archived on 2025-08-26. You can view files and clone it, but cannot push or open issues or pull requests.
Files

5.7 KiB

Introduction

This folder contains scripts to convert ASR models for mobile platforms supporting only batch size equal to 1.

The advantage of fixing the batch size to 1 is that it provides more opportunities for model optimization and quantization.

To give you a concrete example, for the following model https://k2-fsa.github.io/sherpa/onnx/pretrained_models/online-transducer/zipformer-transducer-models.html#csukuangfj-sherpa-onnx-streaming-zipformer-bilingual-zh-en-2023-02-20-bilingual-chinese-english

encoder-epoch-99-avg-1.onnx encoder-epoch-99-avg-1.int8.onnx
Dynamic batch size 315 MB 174 MB
Batch size fixed to 1 242 MB 100 MB

The following colab notebook provides examples to use the above two models.

WARNING: Tested with onnxruntime==1.16.3 onnx==1.15.0.

pip install onnxruntime==1.16.3 onnx==1.15.0

More examples

sherpa-onnx-streaming-zipformer-korean-2024-06-16

encoder-epoch-99-avg-1.onnx encoder-epoch-99-avg-1.int8.onnx
Dynamic batch size 279 MB 122 MB
Batch size fixed to 1 264 MB 107 MB

sherpa-onnx-streaming-zipformer-en-20M-2023-02-17

encoder-epoch-99-avg-1.onnx encoder-epoch-99-avg-1.int8.onnx
Dynamic batch size 85 MB 41 MB
Batch size fixed to 1 75 MB 32 MB

sherpa-onnx-streaming-zipformer-multi-zh-hans-2023-12-12

encoder-epoch-20-avg-1-chunk-16-left-128.onnx encoder-epoch-20-avg-1-chunk-16-left-128.int8.onnx
Dynamic batch size 249 MB 67 MB
Batch size fixed to 1 247 MB 65 MB

icefall-asr-zipformer-streaming-wenetspeech-20230615

encoder-epoch-12-avg-4-chunk-16-left-128.onnx encoder-epoch-12-avg-4-chunk-16-left-128.int8.onnx
Dynamic batch size 250 MB 68 MB
Batch size fixed to 1 247 MB 65 MB

sherpa-onnx-streaming-zipformer-en-2023-06-26

encoder-epoch-99-avg-1-chunk-16-left-128.onnx encoder-epoch-99-avg-1-chunk-16-left-128.int8.onnx
Dynamic batch size 250 MB 68 MB
Batch size fixed to 1 247 MB 65 MB

sherpa-onnx-streaming-zipformer-en-2023-06-21

encoder-epoch-99-avg-1.onnx encoder-epoch-99-avg-1.int8.onnx
Dynamic batch size 338 MB 180 MB
Batch size fixed to 1 264 MB 107 MB

sherpa-onnx-streaming-zipformer-en-2023-02-21

encoder-epoch-99-avg-1.onnx encoder-epoch-99-avg-1.int8.onnx
Dynamic batch size 279 MB 122 MB
Batch size fixed to 1 264 MB 107 MB

sherpa-onnx-streaming-zipformer-fr-2023-04-14

encoder-epoch-29-avg-9-with-averaged-model.onnx encoder-epoch-29-avg-9-with-averaged-model.int8.onnx
Dynamic batch size 279 MB 121 MB
Batch size fixed to 1 264 MB 107 MB

sherpa-onnx-streaming-zipformer-small-bilingual-zh-en-2023-02-16

encoder-epoch-99-avg-1.onnx encoder-epoch-99-avg-1.int8.onnx
Dynamic batch size 85 MB 41 MB
Batch size fixed to 1 75 MB 32 MB

sherpa-onnx-streaming-zipformer-zh-14M-2023-02-23

encoder-epoch-99-avg-1.onnx encoder-epoch-99-avg-1.int8.onnx
Dynamic batch size 40 MB 21 MB
Batch size fixed to 1 33 MB 15 MB

sherpa-onnx-kws-zipformer-wenetspeech-3.3M-2024-01-01

encoder-epoch-12-avg-2-chunk-16-left-64.onnx encoder-epoch-12-avg-2-chunk-16-left-64.int8.onnx
Dynamic batch size 12 MB 4.6 MB
Batch size fixed to 1 11 MB 3.9 MB

sherpa-onnx-kws-zipformer-gigaspeech-3.3M-2024-01-01

encoder-epoch-12-avg-2-chunk-16-left-64.onnx encoder-epoch-12-avg-2-chunk-16-left-64.int8.onnx
Dynamic batch size 12 MB 4.6 MB
Batch size fixed to 1 11 MB 3.9 MB