EngineX-Kunlunxin/enginex-r200-asr

Fork 0

Go to file

Lu Xinlong d92a97bd09 update base image

2025-08-22 09:57:42 +08:00

replaced_files/klx_r200_8f

initial commit

2025-08-19 16:23:51 +08:00

utils

initial commit

2025-08-19 16:23:51 +08:00

Dockerfile.funasr-kunlun-r200-8f

update base image

2025-08-22 09:57:42 +08:00

download_nltk_model.py

initial commit

2025-08-19 16:23:51 +08:00

nltk_data.tar.gz

initial commit

2025-08-19 16:23:51 +08:00

README.md

initial commit

2025-08-19 16:23:51 +08:00

requirements.txt

initial commit

2025-08-19 16:23:51 +08:00

start_funasr.sh

initial commit

2025-08-19 16:23:51 +08:00

test_funasr.py

initial commit

2025-08-19 16:23:51 +08:00

xpytorch-cp38-torch201-ubuntu2004-x64-socket.run

initial commit

2025-08-19 16:23:51 +08:00

README.md

昆仑芯R200-8F FunASR

镜像构造

docker build -f ./Dockerfile.funasr-kunlun-r200-8f -t <your_image>

使用说明

快速镜像测试

对funasr的测试需要在以上构造好的镜像容器内测试，测试步骤

将需要测试的音频wav文件和相应的ground truth文件（含有音频的正确内容文字的文本文件）放置于当前文件夹，并且准备好相应的ASR模型路径
快速测试命令

docker run -it \
    --security-opt=seccomp=unconfined --cap-add=SYS_PTRACE --cap-add=SYS_ADMIN --device /dev/fuse \
    --shm-size=32g --ulimit=memlock=-1 --ulimit=nofile=120000 --ulimit=stack=67108864 \
    --device=/dev/xpu0:/dev/xpu0 --device=/dev/xpuctrl:/dev/xpuctrl \
    -v $PWD:/tmp/workspace \
    -v <host_model_dir>:<model_dir> \
    -e MODEL_DIR=<model_dir> \
    -e TEST_FILE=<wav_file> \
    -e ANSWER_FILE=<ground_truth> \
    --cpus=4 --memory=16g \
    <your_image>

定制化手动运行

用户可使用类似上述的docker run指令以交互形式进入镜像中，主要的测试代码为test_funasr.py,用户可自行修改代码中需要测试的模型路径、测试文件路径以及调用funASR逻辑

昆仑芯R200-8F模型适配情况

我们在昆仑芯R200-8F上针对funASR部分进行了所有大类的适配，测试方式为在Nvidia A100环境下和昆仑芯R200-8F加速卡上对同一段长音频进行语音识别任务，获取运行时间，1-cer指标。运行时都只使用一张显卡

模型大类	模型地址	A100运行时间(秒)	昆仑芯R200-8F运行时间(秒)	A100 1-cer	昆仑芯R200-8F 1-cer	备注
sense_voice	https://www.modelscope.cn/models/iic/SenseVoiceSmall	1.3792	2.2876	0.980033	0.980033
whisper	https://www.modelscope.cn/models/iic/Whisper-large-v3	23.7654	62.3298	0.910150	0.910150
paraformer	https://modelscope.cn/models/iic/speech_paraformer-large-vad-punc_asr_nat-zh-cn-16k-common-vocab8404-pytorch	4.1229	7.7168	0.955075	0.955075	该部分的适配修改了一些funASR源码
conformer	https://www.modelscope.cn/models/iic/speech_conformer_asr_nat-zh-cn-16k-aishell2-vocab5212-pytorch	97.5196	145.0522	0.349418	0.346090
uni_asr	https://www.modelscope.cn/models/iic/speech_UniASR-large_asr_2pass-zh-cn-16k-common-vocab8358-tensorflow1-offline	97.3654	151.1122	0.717138	0.717138

README.md Unescape Escape

昆仑芯R200-8F FunASR

镜像构造

使用说明

快速镜像测试

定制化手动运行

昆仑芯R200-8F模型适配情况

README.md