Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
RapidASR
Commits
46fc6fee
Unverified
Commit
46fc6fee
authored
Mar 01, 2023
by
SWHL
Committed by
GitHub
Mar 01, 2023
Browse files
Update files
parent
2c39c32e
Changes
19
Show whitespace changes
Inline
Side-by-side
Showing
19 changed files
with
85 additions
and
79 deletions
+85
-79
README.md
README.md
+7
-65
cpp_onnx/readme.md
cpp_onnx/readme.md
+5
-14
python/README.md
python/README.md
+73
-0
python/demo.py
python/demo.py
+0
-0
python/rapid_paraformer/__init__.py
python/rapid_paraformer/__init__.py
+0
-0
python/rapid_paraformer/kaldifeat/LICENSE
python/rapid_paraformer/kaldifeat/LICENSE
+0
-0
python/rapid_paraformer/kaldifeat/README.md
python/rapid_paraformer/kaldifeat/README.md
+0
-0
python/rapid_paraformer/kaldifeat/__init__.py
python/rapid_paraformer/kaldifeat/__init__.py
+0
-0
python/rapid_paraformer/kaldifeat/feature.py
python/rapid_paraformer/kaldifeat/feature.py
+0
-0
python/rapid_paraformer/kaldifeat/ivector.py
python/rapid_paraformer/kaldifeat/ivector.py
+0
-0
python/rapid_paraformer/rapid_paraformer.py
python/rapid_paraformer/rapid_paraformer.py
+0
-0
python/rapid_paraformer/utils.py
python/rapid_paraformer/utils.py
+0
-0
python/requirements.txt
python/requirements.txt
+0
-0
python/resources/config.yaml
python/resources/config.yaml
+0
-0
python/resources/models/am.mvn
python/resources/models/am.mvn
+0
-0
python/resources/models/token_list.pkl
python/resources/models/token_list.pkl
+0
-0
python/test_wavs/0478_00017.wav
python/test_wavs/0478_00017.wav
+0
-0
python/test_wavs/asr_example_zh.wav
python/test_wavs/asr_example_zh.wav
+0
-0
python/tests/test_infer.py
python/tests/test_infer.py
+0
-0
No files found.
README.md
View file @
46fc6fee
## Rapid
paraformer
## Rapid
ASR
<p
align=
"left"
>
<p
align=
"left"
>
<a
href=
""
><img
src=
"https://img.shields.io/badge/Python->
=3.7,
<=3.10-aff.svg"
></a>
<a
href=
""
><img
src=
"https://img.shields.io/badge/OS-Linux%2C%20Win%2C%20Mac-pink.svg"
></a>
<a
href=
""
><img
src=
"https://img.shields.io/badge/OS-Linux%2C%20Win%2C%20Mac-pink.svg"
></a>
<a
href=
""
><img
src=
"https://img.shields.io/badge/Python->
=3.7,
<=3.10-aff.svg"
></a>
<a
href=
""
><img
src=
"https://img.shields.io/badge/C++-aff.svg"
></a>
</p>
</p>
-
模型出自阿里达摩院
[
Paraformer语音识别-中文-通用-16k-离线-large-pytorch
](
https://www.modelscope.cn/models/damo/speech_paraformer-large_asr_nat-zh-cn-16k-common-vocab8404-pytorch/summary
)
-
模型出自阿里达摩院
[
Paraformer语音识别-中文-通用-16k-离线-large-pytorch
](
https://www.modelscope.cn/models/damo/speech_paraformer-large_asr_nat-zh-cn-16k-common-vocab8404-pytorch/summary
)
-
🎉该项目核心代码已经并入
[
FunASR
](
https://github.com/alibaba-damo-academy/FunASR
)
-
🎉该项目核心代码已经并入
[
FunASR
](
https://github.com/alibaba-damo-academy/FunASR
)
-
本仓库仅对模型做了转换,只采用ONNXRuntime推理引擎
-
本仓库仅对模型做了转换,只采用ONNXRuntime推理引擎
#### 文档导航
-
[
Python版
](
./python/README.md
)
-
[
C++/C版
](
./cpp_onnx/readme.md
)
#### TODO
#### TODO
-
[ ] 整合vad + asr + pun三个模型,打造可部署使用的方案
-
[ ] 整合vad + asr + pun三个模型,打造可部署使用的方案
#### 支持语言
-
Python
-
C++/C
#### 使用步骤
1.
安装环境
```
bash
pip
install
-r
requirements.txt
```
2.
下载模型
-
由于模型太大(823.8M),上传到仓库不容易下载,
-
(推荐)自助转换:基于modescope下的notebook环境,可一键转换,详情戳:
[
快速体验
](
https://www.modelscope.cn/models/damo/speech_paraformer-large_asr_nat-zh-cn-16k-common-vocab8404-pytorch/summary
)
-
打开notebook → Cell中输入
`!python -m funasr.export.export_model 'damo/speech_paraformer-large_asr_nat-zh-cn-16k-common-vocab8404-pytorch' "./export" true`
, 执行即可。
-
提供百度云下载连接:
[
asr_paraformerv2.onnx
](
https://pan.baidu.com/s/1-nEf2eUpkzlcRqiYEwub2A?pwd=dcr3
)
(模型MD5:
`9ca331381a470bc4458cc6c0b0b165de`
)
-
模型下载之后,放在
`resources/models`
目录下即可,最终目录结构如下:
```
text
.
├── demo.py
├── rapid_paraformer
│ ├── __init__.py
│ ├── kaldifeat
│ ├── __pycache__
│ ├── rapid_paraformer.py
│ └── utils.py
├── README.md
├── requirements.txt
├── resources
│ ├── config.yaml
│ └── models
│ ├── am.mvn
│ ├── asr_paraformerv2.onnx # 放在这里
│ └── token_list.pkl
├── test_onnx.py
├── tests
│ ├── __pycache__
│ └── test_infer.py
└── test_wavs
├── 0478_00017.wav
└── asr_example_zh.wav
```
3.
运行demo
```
python
from
rapid_paraformer
import
RapidParaformer
config_path
=
'resources/config.yaml'
paraformer
=
RapidParaformer
(
config_path
)
# 输入:支持Union[str, np.ndarray, List[str]] 三种方式传入
# 输出: List[asr_res]
wav_path
=
[
'test_wavs/0478_00017.wav'
,
]
result
=
paraformer
(
wav_path
)
print
(
result
)
```
4.
查看结果
```
text
['呃说不配合就不配合的好以上的话呢我们摘取八九十三条因为这三条的话呢比较典型啊一些数字比较明确尤其是时间那么我们要投资者就是了解这一点啊不要轻信这个市场可以快速回来啊这些配市公司啊后期又利好了可
以快速快速攻能包括像前一段时间啊有些媒体在二三月份的时候']
```
#### 更新日志
#### 更新日志
-
2023-02-25
-
2023-02-25
...
...
cpp_onnx/readme.md
View file @
46fc6fee
...
@@ -45,9 +45,6 @@
...
@@ -45,9 +45,6 @@
See the bottom of this page: Building Guidance
See the bottom of this page: Building Guidance
### 运行程序
### 运行程序
tester /path/to/models/dir /path/to/wave/file
tester /path/to/models/dir /path/to/wave/file
...
@@ -56,11 +53,8 @@ tester /path/to/models/dir /path/to/wave/file
...
@@ -56,11 +53,8 @@ tester /path/to/models/dir /path/to/wave/file
/data/models 需要包括如下两个文件: model.onnx 和vocab.txt
/data/models 需要包括如下两个文件: model.onnx 和vocab.txt
```
```
## 支持平台
## 支持平台
-
Windows
-
Windows
-
Linux/Unix
-
Linux/Unix
...
@@ -68,7 +62,6 @@ tester /path/to/models/dir /path/to/wave/file
...
@@ -68,7 +62,6 @@ tester /path/to/models/dir /path/to/wave/file
-
fftw3
-
fftw3
-
onnxruntime
-
onnxruntime
## 导出onnx格式模型文件
## 导出onnx格式模型文件
安装 modelscope与FunASR,依赖:torch,torchaudio,安装过程
[
详细参考文档
](
https://github.com/alibaba-damo-academy/FunASR/wiki
)
安装 modelscope与FunASR,依赖:torch,torchaudio,安装过程
[
详细参考文档
](
https://github.com/alibaba-damo-academy/FunASR/wiki
)
```
shell
```
shell
...
@@ -92,7 +85,7 @@ cd build
...
@@ -92,7 +85,7 @@ cd build
# download an appropriate onnxruntime from https://github.com/microsoft/onnxruntime/releases/tag/v1.14.0
# download an appropriate onnxruntime from https://github.com/microsoft/onnxruntime/releases/tag/v1.14.0
# here we get a copy of onnxruntime for linux 64
# here we get a copy of onnxruntime for linux 64
wget https://github.com/microsoft/onnxruntime/releases/download/v1.14.0/onnxruntime-linux-x64-1.14.0.tgz
wget https://github.com/microsoft/onnxruntime/releases/download/v1.14.0/onnxruntime-linux-x64-1.14.0.tgz
#ls
#
ls
# onnxruntime-linux-x64-1.14.0 onnxruntime-linux-x64-1.14.0.tgz
# onnxruntime-linux-x64-1.14.0 onnxruntime-linux-x64-1.14.0.tgz
#install fftw3-dev
#install fftw3-dev
...
@@ -104,7 +97,6 @@ apt install libfftw3-dev
...
@@ -104,7 +97,6 @@ apt install libfftw3-dev
# then in the subfolder tester of current direcotry, you will see a program, tester
# then in the subfolder tester of current direcotry, you will see a program, tester
```
`
```
`
### The structure of a qualified onnxruntime package.
### The structure of a qualified onnxruntime package.
...
@@ -113,4 +105,3 @@ onnxruntime_xxx
...
@@ -113,4 +105,3 @@ onnxruntime_xxx
├───include
├───include
└───lib
└───lib
``
`
``
`
python/README.md
0 → 100644
View file @
46fc6fee
## Rapid ASR
<p
align=
"left"
>
<a
href=
""
><img
src=
"https://img.shields.io/badge/Python->
=3.7,
<=3.10-aff.svg"
></a>
<a
href=
""
><img
src=
"https://img.shields.io/badge/OS-Linux%2C%20Win%2C%20Mac-pink.svg"
></a>
</p>
-
模型出自阿里达摩院
[
Paraformer语音识别-中文-通用-16k-离线-large-pytorch
](
https://www.modelscope.cn/models/damo/speech_paraformer-large_asr_nat-zh-cn-16k-common-vocab8404-pytorch/summary
)
-
🎉该项目核心代码已经并入
[
FunASR
](
https://github.com/alibaba-damo-academy/FunASR
)
-
本仓库仅对模型做了转换,只采用ONNXRuntime推理引擎
#### TODO
-
[ ] 整合vad + asr + pun三个模型,打造可部署使用的方案
#### 使用步骤
1.
安装环境
```
bash
pip
install
-r
requirements.txt
```
2.
下载模型
-
由于模型太大(823.8M),上传到仓库不容易下载,
-
(推荐)自助转换:基于modescope下的notebook环境,可一键转换,详情戳:
[
快速体验
](
https://www.modelscope.cn/models/damo/speech_paraformer-large_asr_nat-zh-cn-16k-common-vocab8404-pytorch/summary
)
-
打开notebook → Cell中输入
`!python -m funasr.export.export_model 'damo/speech_paraformer-large_asr_nat-zh-cn-16k-common-vocab8404-pytorch' "./export" true`
, 执行即可。
-
提供百度云下载连接:
[
asr_paraformerv2.onnx
](
https://pan.baidu.com/s/1-nEf2eUpkzlcRqiYEwub2A?pwd=dcr3
)
(模型MD5:
`9ca331381a470bc4458cc6c0b0b165de`
)
-
模型下载之后,放在
`resources/models`
目录下即可,最终目录结构如下:
```
text
.
├── demo.py
├── rapid_paraformer
│ ├── __init__.py
│ ├── kaldifeat
│ ├── __pycache__
│ ├── rapid_paraformer.py
│ └── utils.py
├── README.md
├── requirements.txt
├── resources
│ ├── config.yaml
│ └── models
│ ├── am.mvn
│ ├── asr_paraformerv2.onnx # 放在这里
│ └── token_list.pkl
├── test_onnx.py
├── tests
│ ├── __pycache__
│ └── test_infer.py
└── test_wavs
├── 0478_00017.wav
└── asr_example_zh.wav
```
3.
运行demo
```
python
from
rapid_paraformer
import
RapidParaformer
config_path
=
'resources/config.yaml'
paraformer
=
RapidParaformer
(
config_path
)
# 输入:支持Union[str, np.ndarray, List[str]] 三种方式传入
# 输出: List[asr_res]
wav_path
=
[
'test_wavs/0478_00017.wav'
,
]
result
=
paraformer
(
wav_path
)
print
(
result
)
```
4.
查看结果
```
text
['呃说不配合就不配合的好以上的话呢我们摘取八九十三条因为这三条的话呢比较典型啊一些数字比较明确尤其是时间那么我们要投资者就是了解这一点啊不要轻信这个市场可以快速回来啊这些配市公司啊后期又利好了可
以快速快速攻能包括像前一段时间啊有些媒体在二三月份的时候']
```
demo.py
→
python/
demo.py
View file @
46fc6fee
File moved
rapid_paraformer/__init__.py
→
python/
rapid_paraformer/__init__.py
View file @
46fc6fee
File moved
rapid_paraformer/kaldifeat/LICENSE
→
python/
rapid_paraformer/kaldifeat/LICENSE
View file @
46fc6fee
File moved
rapid_paraformer/kaldifeat/README.md
→
python/
rapid_paraformer/kaldifeat/README.md
View file @
46fc6fee
File moved
rapid_paraformer/kaldifeat/__init__.py
→
python/
rapid_paraformer/kaldifeat/__init__.py
View file @
46fc6fee
File moved
rapid_paraformer/kaldifeat/feature.py
→
python/
rapid_paraformer/kaldifeat/feature.py
View file @
46fc6fee
File moved
rapid_paraformer/kaldifeat/ivector.py
→
python/
rapid_paraformer/kaldifeat/ivector.py
View file @
46fc6fee
File moved
rapid_paraformer/rapid_paraformer.py
→
python/
rapid_paraformer/rapid_paraformer.py
View file @
46fc6fee
File moved
rapid_paraformer/utils.py
→
python/
rapid_paraformer/utils.py
View file @
46fc6fee
File moved
requirements.txt
→
python/
requirements.txt
View file @
46fc6fee
File moved
resources/config.yaml
→
python/
resources/config.yaml
View file @
46fc6fee
File moved
resources/models/am.mvn
→
python/
resources/models/am.mvn
View file @
46fc6fee
File moved
resources/models/token_list.pkl
→
python/
resources/models/token_list.pkl
View file @
46fc6fee
File moved
test_wavs/0478_00017.wav
→
python/
test_wavs/0478_00017.wav
View file @
46fc6fee
File moved
test_wavs/asr_example_zh.wav
→
python/
test_wavs/asr_example_zh.wav
View file @
46fc6fee
File moved
tests/test_infer.py
→
python/
tests/test_infer.py
View file @
46fc6fee
File moved
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment