README.md 1.84 KB
Newer Older
zhangqha's avatar
zhangqha committed
1
# AASIST
zhangqha's avatar
zhangqha committed
2
## 模型介绍
3
开源的音频反欺骗的模型
zhangqha's avatar
zhangqha committed
4

zhangqha's avatar
zhangqha committed
5
6
7
## 论文
参考ASIST: Audio Anti-Spoofing using Integrated Spectro-Temporal Graph Attention Networks

zhangqha's avatar
zhangqha committed
8
## 模型结构
zhangxiao-stack's avatar
zhangxiao-stack committed
9
aasist是一种开源的音频反欺诈的模型,主要的模型结构如下所示:
zhangqha's avatar
zhangqha committed
10

zhangqha's avatar
zhangqha committed
11
![Aassist_Backbone](Aassist_Backbone.PNG)
zhangqha's avatar
zhangqha committed
12

zhangxiao-stack's avatar
zhangxiao-stack committed
13
14
15
## 环境配置
### Docker(方法一)
提供[光源](https://www.sourcefind.cn/#/service-list)拉取的训练的docker镜像:
zhangqha's avatar
zhangqha committed
16
* 推理镜像:
zhangqha's avatar
update  
zhangqha committed
17
```
18
docker pull image.sourcefind.cn:5000/dcu/admin/base/custom:aasist-main
zhangqha's avatar
update  
zhangqha committed
19
```
zhangxiao-stack's avatar
zhangxiao-stack committed
20
* 激活镜像环境:
zhangqha's avatar
update  
zhangqha committed
21
```
zhangqha's avatar
zhangqha committed
22
23
source /root/env_disc.sh
cd /root/aasist;sh run.sh
zhangqha's avatar
update  
zhangqha committed
24
```
25
* python依赖安装:
zhangqha's avatar
update  
zhangqha committed
26
```
zhangqha's avatar
zhangqha committed
27
pip3 install -r requirement.txt
zhangqha's avatar
update  
zhangqha committed
28
```
zhangxiao-stack's avatar
zhangxiao-stack committed
29
30
31
32
33
34
35
36
37
38
39
40
41
42

## 数据集

脚本下载方式:
```
python ./download_dataset.py
```
手动下载方式:
```
ASVspoof2019 dataset: https://datashare.ed.ac.uk/handle/10283/3336
```
下载LA.zip文件,unzip解压

## 推理
zhangqha's avatar
zhangqha committed
43
44

To evaluate AASIST [1]:
zhangqha's avatar
update  
zhangqha committed
45
```
zhangqha's avatar
zhangqha committed
46
export TORCH_MHLO_OP_WHITE_LIST="aten::max;aten::batch_norm;aten::abs,aten::selu;prim::NumToTensor;aten::zeros_like;aten::size;aten::narrow;aten::cat;aten::selu_"
zhangqha's avatar
zhangqha committed
47

zhangqha's avatar
zhangqha committed
48
49
python3 main.py --eval --config ./config/AASIST.conf
python3 main_opt.py --eval --config ./config/AASIST.conf
zhangqha's avatar
update  
zhangqha committed
50
```
zhangqha's avatar
zhangqha committed
51
52

To evaluate AASIST-L [1]:
zhangqha's avatar
update  
zhangqha committed
53
```
zhangqha's avatar
zhangqha committed
54
export TORCH_MHLO_OP_WHITE_LIST="aten::max;aten::batch_norm;aten::abs,aten::selu;prim::NumToTensor;aten::zeros_like;aten::size;aten::narrow;aten::cat;aten::selu_"
zhangqha's avatar
zhangqha committed
55

zhangqha's avatar
zhangqha committed
56
57
python3 main.py --eval --config ./config/AASIST-L.conf
python3 main_opt.py --eval --config ./config/AASIST-L.conf
zhangqha's avatar
update  
zhangqha committed
58
```
zhangxiao-stack's avatar
zhangxiao-stack committed
59

zhangqha's avatar
zhangqha committed
60
测试命令:
zhangqha's avatar
update  
zhangqha committed
61
```
zhangqha's avatar
zhangqha committed
62
bash run.sh
zhangqha's avatar
update  
zhangqha committed
63
```
zhangxiao-stack's avatar
zhangxiao-stack committed
64
65
66
67
68
69
70
71
72
73
74
75
## 精度
使用Blade DISC优化后的精度与未使用Blade DISC优化后的精度保持一致

## 应用场景
### 算法类别
语音识别

### 热点行业
金融,交通,教育

### 源码仓库及问题反馈
https://developer.hpccube.com/codes/modelzoo/bladedisc_aasist
76

zhangxiao-stack's avatar
zhangxiao-stack committed
77
78
### 参考
https://github.com/clovaai/aasist.git
zhangqha's avatar
zhangqha committed
79