main commit add README

605587de · chenxj · d08c1882 · 605587de · 605587de
Commit 605587de authored Sep 08, 2023 by chenxj
Hide whitespace changes
Inline Side-by-side

Showing with 34 additions and 0 deletions

README.md README.md +34 -0

speech_orig.wav speech_orig.wav +0 -0

No files found.
--- a/README.md
+++ b/README.md
 # asr_onnx
+## 模型介绍
+语音识别模型asr,speech-to-text(stt)
+## 模型结构
+![image](https://developer.hpccube.com/codes/modelzoo/asr_onnx/-/blob/main/resources/silero_stt_model.jpg)
+## 推理
+### 环境配置
+在[光源](https://www.sourcefind.cn/#/service-details)可拉取推理的docker镜像，在[光合开发者社区](https://cancon.hpccube.com:65024/4/main/)可下载onnxruntime安装包。asr_onnx推荐的镜像如下：
+```
+docker pull image.sourcefind.cn:5000/dcu/admin/base/pytorch:1.13.1-centos7.6-dtk-23.04-py37-latest
+```
+获取最新的dtk并安装依赖
+```
+cd /opt
+wget http://10.0.50.210:8000/jenkins/rocm/23.04.1/centos7/DTK-23.04.1-rc4-centos7-x86_64.tar.gz
+tar -zxvf DTK-23.04.1-rc4-centos7-x86_64.tar.gz
+source /opt/dtk-23.04.1/env.sh
+```
+下载模型(https://models.silero.ai/models/en/en_v5.onnx)到当前目录
+### 推理
+```
+python3 main.py --model_dir="./en_v5.onnx" --wav_dir="./wavs/" --warmup=1
+```
+## 性能数据
+| size | infer time(ms)| infer time after warmup(ms)| 
+| :------: | :------: | :------: |
+| (4,262144) | 7.6759592751041055 | 0.040782203897833824 | 
+| (4,524288) | 7.950103935785592 | 0.061228943057358265 | 
+| (4,1048576) | 8.080759830772877 | 0.11008367035537958 | 
+| (4,2097152) | 8.469960468821228 | 0.2672890517860651 | 
+## 源码仓库及问题反馈
+https://developer.hpccube.com/codes/modelzoo/asr_onnx
+## 参考
+* [silero-models](https://github.com/snakers4/silero-models)
--- a/speech_orig.wav
+++ b/speech_orig.wav