Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
ModelZoo
ASR_onnxruntime
Commits
738eac8c
Commit
738eac8c
authored
Dec 08, 2023
by
chenxj
Browse files
update README
parent
a6b312ec
Changes
4
Hide whitespace changes
Inline
Side-by-side
Showing
4 changed files
with
23 additions
and
23 deletions
+23
-23
README.md
README.md
+20
-22
model.properties
model.properties
+3
-1
resources/asr.png
resources/asr.png
+0
-0
resources/asr_result.png
resources/asr_result.png
+0
-0
No files found.
README.md
View file @
738eac8c
# asr
# asr_onnx
语音识别模型asr,speech-to-text(stt)
## 论文
无
## 模型结构
## 模型结构


## 算法原理
## 算法原理

### 环境配置
## 数据集
无
## 环境配置
在
[
光源
](
https://www.sourcefind.cn/#/service-details
)
可拉取推理的docker镜像,在
[
光合开发者社区
](
https://cancon.hpccube.com:65024/4/main/
)
可下载onnxruntime安装包。asr_onnx推荐的镜像如下:
在
[
光源
](
https://www.sourcefind.cn/#/service-details
)
可拉取推理的docker镜像,在
[
光合开发者社区
](
https://cancon.hpccube.com:65024/4/main/
)
可下载onnxruntime安装包。asr_onnx推荐的镜像如下:
```
```
docker pull image.sourcefind.cn:5000/dcu/admin/base/pytorch:1.13.1-centos7.6-dtk-23.04-py37-latest
docker pull image.sourcefind.cn:5000/dcu/admin/base/pytorch:1.13.1-centos7.6-dtk-23.04-py37-latest
docker run -d -t --privileged --device=/dev/kfd --device=/dev/dri/ --network=host --group-add video --name asr-test image.sourcefind.cn:5000/dcu/admin/base/pytorch:1.13.1-centos7.6-dtk-23.04-py37-latest
docker exec -it asr-test bash
pip3 install onnxruntime.whl
```
```
获取最新的dtk并安装依赖
```
cd /opt
wget http://10.0.50.210:8000/jenkins/rocm/23.04.1/centos7/DTK-23.04.1-rc4-centos7-x86_64.tar.gz
tar -zxvf DTK-23.04.1-rc4-centos7-x86_64.tar.gz
source /opt/dtk-23.04.1/env.sh
```
## 推理
下载模型 (https://models.silero.ai/models/en/en_v5.onnx) 到当前目录,建立wavs文件夹添加测试wav文件
下载模型 (https://models.silero.ai/models/en/en_v5.onnx) 到当前目录,建立wavs文件夹添加测试wav文件
## 推理
```
```
python3 main.py --model_dir="./en_v5.onnx" --wav_dir="./wavs/" --warmup=1
python3 main.py --model_dir="./en_v5.onnx" --wav_dir="./wavs/" --warmup=1
```
```
## result
## result

## 精度
### 精度
| size | infer time(ms)| infer time after warmup(ms)|
暂无
| :------: | :------: | :------: |
## 应用场景
| (4,262144) | 7.6759592751041055 | 0.040782203897833824 |
### 算法类别
| (4,524288) | 7.950103935785592 | 0.061228943057358265 |
语音识别
| (4,1048576) | 8.080759830772877 | 0.11008367035537958 |
### 热点应用行业
| (4,2097152) | 8.469960468821228 | 0.2672890517860651 |
交通,金融,医疗,教育,家居
## 源码仓库及问题反馈
## 源码仓库及问题反馈
https://developer.hpccube.com/codes/modelzoo/asr_onnx
https://developer.hpccube.com/codes/modelzoo/asr_onnx
## 参考
资料
## 参考
*
[
silero-models
](
https://github.com/snakers4/silero-models
)
*
[
silero-models
](
https://github.com/snakers4/silero-models
)
model.properties
View file @
738eac8c
# 模型唯一标识
modelCode
=
410
# 模型名称
# 模型名称
modelName
=
asr_onnx
modelName
=
asr_onnx
# 模型描述
# 模型描述
modelDescription
=
asr_onnx是一个语音识别模型,实现speech-to-text的转换
modelDescription
=
asr_onnx是一个语音识别模型,实现speech-to-text的转换
# 应用场景
# 应用场景
appScenario
=
推理,
ASR
appScenario
=
推理,
交通,金融,医疗,教育,家居
# 框架类型
# 框架类型
frameType
=
onnxruntime
frameType
=
onnxruntime
resources/asr.png
0 → 100644
View file @
738eac8c
7.23 KB
resources/asr_result.png
0 → 100644
View file @
738eac8c
24.3 KB
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment