Add online LSTM transducer model (#25)
This commit is contained in:
82
README.md
82
README.md
@@ -2,89 +2,7 @@
|
||||
|
||||
Documentation: <https://k2-fsa.github.io/sherpa/onnx/index.html>
|
||||
|
||||
Try it in colab:
|
||||
[](https://colab.research.google.com/drive/1tmQbdlYeTl_klmtaGiUb7a7ZPz-AkBSH?usp=sharing)
|
||||
|
||||
See <https://github.com/k2-fsa/sherpa>
|
||||
|
||||
This repo uses [onnxruntime](https://github.com/microsoft/onnxruntime) and
|
||||
does not depend on libtorch.
|
||||
|
||||
We provide exported models in onnx format and they can be downloaded using
|
||||
the following links:
|
||||
|
||||
- English: <https://huggingface.co/csukuangfj/icefall-asr-librispeech-pruned-transducer-stateless3-2022-05-13>
|
||||
- Chinese: <https://huggingface.co/luomingshuang/icefall_asr_wenetspeech_pruned_transducer_stateless2>
|
||||
|
||||
**NOTE**: We provide only non-streaming models at present.
|
||||
|
||||
|
||||
**HINT**: The script for exporting the English model can be found at
|
||||
<https://github.com/k2-fsa/icefall/blob/master/egs/librispeech/ASR/pruned_transducer_stateless3/export.py>
|
||||
|
||||
**HINT**: The script for exporting the Chinese model can be found at
|
||||
<https://github.com/k2-fsa/icefall/blob/master/egs/wenetspeech/ASR/pruned_transducer_stateless2/export.py>
|
||||
|
||||
## Build for Linux/macOS
|
||||
|
||||
```bash
|
||||
git clone https://github.com/k2-fsa/sherpa-onnx
|
||||
cd sherpa-onnx
|
||||
mkdir build
|
||||
cd build
|
||||
cmake -DCMAKE_BUILD_TYPE=Release ..
|
||||
make -j6
|
||||
cd ..
|
||||
```
|
||||
|
||||
## Build for Windows
|
||||
|
||||
```bash
|
||||
git clone https://github.com/k2-fsa/sherpa-onnx
|
||||
cd sherpa-onnx
|
||||
mkdir build
|
||||
cd build
|
||||
cmake -DCMAKE_BUILD_TYPE=Release ..
|
||||
cmake --build . --config Release
|
||||
cd ..
|
||||
```
|
||||
|
||||
## Download the pretrained model (English)
|
||||
|
||||
```bash
|
||||
GIT_LFS_SKIP_SMUDGE=1 git clone https://huggingface.co/csukuangfj/icefall-asr-librispeech-pruned-transducer-stateless3-2022-05-13
|
||||
cd icefall-asr-librispeech-pruned-transducer-stateless3-2022-05-13
|
||||
git lfs pull --include "exp/onnx/*.onnx"
|
||||
cd ..
|
||||
|
||||
./build/bin/sherpa-onnx --help
|
||||
|
||||
./build/bin/sherpa-onnx \
|
||||
./icefall-asr-librispeech-pruned-transducer-stateless3-2022-05-13/data/lang_bpe_500/tokens.txt \
|
||||
./icefall-asr-librispeech-pruned-transducer-stateless3-2022-05-13/exp/onnx/encoder.onnx \
|
||||
./icefall-asr-librispeech-pruned-transducer-stateless3-2022-05-13/exp/onnx/decoder.onnx \
|
||||
./icefall-asr-librispeech-pruned-transducer-stateless3-2022-05-13/exp/onnx/joiner.onnx \
|
||||
./icefall-asr-librispeech-pruned-transducer-stateless3-2022-05-13/exp/onnx/joiner_encoder_proj.onnx \
|
||||
./icefall-asr-librispeech-pruned-transducer-stateless3-2022-05-13/exp/onnx/joiner_decoder_proj.onnx \
|
||||
./icefall-asr-librispeech-pruned-transducer-stateless3-2022-05-13/test_wavs/1089-134686-0001.wav
|
||||
```
|
||||
|
||||
## Download the pretrained model (Chinese)
|
||||
|
||||
```bash
|
||||
GIT_LFS_SKIP_SMUDGE=1 git clone https://huggingface.co/luomingshuang/icefall_asr_wenetspeech_pruned_transducer_stateless2
|
||||
cd icefall_asr_wenetspeech_pruned_transducer_stateless2
|
||||
git lfs pull --include "exp/*.onnx"
|
||||
cd ..
|
||||
|
||||
./build/bin/sherpa-onnx --help
|
||||
|
||||
./build/bin/sherpa-onnx \
|
||||
./icefall_asr_wenetspeech_pruned_transducer_stateless2/data/lang_char/tokens.txt \
|
||||
./icefall_asr_wenetspeech_pruned_transducer_stateless2/exp/encoder-epoch-10-avg-2.onnx \
|
||||
./icefall_asr_wenetspeech_pruned_transducer_stateless2/exp/decoder-epoch-10-avg-2.onnx \
|
||||
./icefall_asr_wenetspeech_pruned_transducer_stateless2/exp/joiner-epoch-10-avg-2.onnx \
|
||||
./icefall_asr_wenetspeech_pruned_transducer_stateless2/exp/joiner_encoder_proj-epoch-10-avg-2.onnx \
|
||||
./icefall_asr_wenetspeech_pruned_transducer_stateless2/exp/joiner_decoder_proj-epoch-10-avg-2.onnx \
|
||||
./icefall_asr_wenetspeech_pruned_transducer_stateless2/test_wavs/DEV_T0000000000.wav
|
||||
```
|
||||
|
||||
Reference in New Issue
Block a user