README.md 1.82 KB
Newer Older
zhangqha's avatar
zhangqha committed
1
# AASIST(Audio Anti-Spoofing Using Integrated Spectro-Temporal Graph Attention Networks) 
zhangqha's avatar
zhangqha committed
2
## 模型介绍
3
开源的音频反欺骗的模型
zhangqha's avatar
zhangqha committed
4
5

## 模型结构
zhangxiao-stack's avatar
zhangxiao-stack committed
6
aasist是一种开源的音频反欺诈的模型,主要的模型结构如下所示:
zhangqha's avatar
zhangqha committed
7

zhangqha's avatar
zhangqha committed
8
![Aassist_Backbone](Aassist_Backbone.PNG)
zhangqha's avatar
zhangqha committed
9

zhangxiao-stack's avatar
zhangxiao-stack committed
10
11
12
## 环境配置
### Docker(方法一)
提供[光源](https://www.sourcefind.cn/#/service-list)拉取的训练的docker镜像:
zhangqha's avatar
zhangqha committed
13
* 推理镜像:
zhangqha's avatar
update  
zhangqha committed
14
```
15
docker pull image.sourcefind.cn:5000/dcu/admin/base/custom:aasist-main
zhangqha's avatar
update  
zhangqha committed
16
```
zhangxiao-stack's avatar
zhangxiao-stack committed
17
* 激活镜像环境:
zhangqha's avatar
update  
zhangqha committed
18
```
zhangqha's avatar
zhangqha committed
19
20
source /root/env_disc.sh
cd /root/aasist;sh run.sh
zhangqha's avatar
update  
zhangqha committed
21
```
22
* python依赖安装:
zhangqha's avatar
update  
zhangqha committed
23
```
zhangqha's avatar
zhangqha committed
24
pip3 install -r requirement.txt
zhangqha's avatar
update  
zhangqha committed
25
```
zhangxiao-stack's avatar
zhangxiao-stack committed
26
27
28
29
30
31
32
33
34
35
36
37
38
39

## 数据集

脚本下载方式:
```
python ./download_dataset.py
```
手动下载方式:
```
ASVspoof2019 dataset: https://datashare.ed.ac.uk/handle/10283/3336
```
下载LA.zip文件,unzip解压

## 推理
zhangqha's avatar
zhangqha committed
40
41

To evaluate AASIST [1]:
zhangqha's avatar
update  
zhangqha committed
42
```
zhangqha's avatar
zhangqha committed
43
export TORCH_MHLO_OP_WHITE_LIST="aten::max;aten::batch_norm;aten::abs,aten::selu;prim::NumToTensor;aten::zeros_like;aten::size;aten::narrow;aten::cat;aten::selu_"
zhangqha's avatar
zhangqha committed
44

zhangqha's avatar
zhangqha committed
45
46
python3 main.py --eval --config ./config/AASIST.conf
python3 main_opt.py --eval --config ./config/AASIST.conf
zhangqha's avatar
update  
zhangqha committed
47
```
zhangqha's avatar
zhangqha committed
48
49

To evaluate AASIST-L [1]:
zhangqha's avatar
update  
zhangqha committed
50
```
zhangqha's avatar
zhangqha committed
51
export TORCH_MHLO_OP_WHITE_LIST="aten::max;aten::batch_norm;aten::abs,aten::selu;prim::NumToTensor;aten::zeros_like;aten::size;aten::narrow;aten::cat;aten::selu_"
zhangqha's avatar
zhangqha committed
52

zhangqha's avatar
zhangqha committed
53
54
python3 main.py --eval --config ./config/AASIST-L.conf
python3 main_opt.py --eval --config ./config/AASIST-L.conf
zhangqha's avatar
update  
zhangqha committed
55
```
zhangxiao-stack's avatar
zhangxiao-stack committed
56

zhangqha's avatar
zhangqha committed
57
测试命令:
zhangqha's avatar
update  
zhangqha committed
58
```
zhangqha's avatar
zhangqha committed
59
bash run.sh
zhangqha's avatar
update  
zhangqha committed
60
```
zhangxiao-stack's avatar
zhangxiao-stack committed
61
62
63
64
65
66
67
68
69
70
71
72
## 精度
使用Blade DISC优化后的精度与未使用Blade DISC优化后的精度保持一致

## 应用场景
### 算法类别
语音识别

### 热点行业
金融,交通,教育

### 源码仓库及问题反馈
https://developer.hpccube.com/codes/modelzoo/bladedisc_aasist
73

zhangxiao-stack's avatar
zhangxiao-stack committed
74
75
### 参考
https://github.com/clovaai/aasist.git
zhangqha's avatar
zhangqha committed
76