"src/vscode:/vscode.git/clone" did not exist on "9aac93ff21114c715767224483883b05c7bc95b8"
README.md 2.7 KB
Newer Older
dcuai's avatar
dcuai committed
1
# aasist
zhangqha's avatar
zhangqha committed
2
3
## 论文
参考ASIST: Audio Anti-Spoofing using Integrated Spectro-Temporal Graph Attention Networks
zhangqha@sugon.com's avatar
zhangqha@sugon.com committed
4
https://arxiv.org/pdf/2110.01200.pdf
zhangqha's avatar
zhangqha committed
5

zhangqha's avatar
zhangqha committed
6
## 模型结构
zhangxiao-stack's avatar
zhangxiao-stack committed
7
aasist是一种开源的音频反欺诈的模型,主要的模型结构如下所示:
zhangqha's avatar
zhangqha committed
8

zhangqha's avatar
zhangqha committed
9
![Aassist_Backbone](Aassist_Backbone.PNG)
zhangqha's avatar
zhangqha committed
10

zhangqha@sugon.com's avatar
zhangqha@sugon.com committed
11
12
13
14
## 算法原理

![Aassist_Backbone](Aassist_Backbone.PNG)

zhangxiao-stack's avatar
zhangxiao-stack committed
15
16
17
## 环境配置
### Docker(方法一)
提供[光源](https://www.sourcefind.cn/#/service-list)拉取的训练的docker镜像:
zhangqha's avatar
zhangqha committed
18
* 推理镜像:
zhangqha's avatar
update  
zhangqha committed
19
```
20
docker pull image.sourcefind.cn:5000/dcu/admin/base/custom:aasist-main
zhangqha@sugon.com's avatar
zhangqha@sugon.com committed
21
22
docker run -it -v /path/your_code_data/:/path/your_code_data/ --shm-size=32G --privileged=true --device=/dev/kfd --device=/dev/dri/ --group-add video --name docker_name imageID bash

zhangqha's avatar
update  
zhangqha committed
23
```
zhangxiao-stack's avatar
zhangxiao-stack committed
24
* 激活镜像环境:
zhangqha's avatar
update  
zhangqha committed
25
```
zhangqha's avatar
zhangqha committed
26
27
source /root/env_disc.sh
cd /root/aasist;sh run.sh
zhangqha's avatar
update  
zhangqha committed
28
```
29
* python依赖安装:
zhangqha's avatar
update  
zhangqha committed
30
```
zhangqha@sugon.com's avatar
zhangqha@sugon.com committed
31
pip3 install -r requirements.txt
zhangqha's avatar
update  
zhangqha committed
32
```
zhangxiao-stack's avatar
zhangxiao-stack committed
33
34
35
36
37
38
39
40
41
42
43

## 数据集

脚本下载方式:
```
python ./download_dataset.py
```
手动下载方式:
```
ASVspoof2019 dataset: https://datashare.ed.ac.uk/handle/10283/3336
下载LA.zip文件,unzip解压
zhangqha@sugon.com's avatar
zhangqha@sugon.com committed
44
45
46
47
48
49
50
51
52
53
LA
    ├── ASVspoof2019_LA_asv_protocols
    ├── ASVspoof2019_LA_asv_scores
        ├──ASVspoof2019.LA.asv.dev.gi.trl.scores.txt
        ├──ASVspoof2019.LA.asv.eval.gi.trl.scores.txt
        ...
    ├── ASVspoof2019_LA_cm_protocols
    ├── ASVspoof2019_LA_dev
    ├── ASVspoof2019_LA_eval
    ├── ASVspoof2019_LA_train
zhangqha@sugon.com's avatar
zhangqha@sugon.com committed
54
```
chenzk's avatar
chenzk committed
55

chenzk's avatar
chenzk committed
56
数据集快速下载中心:[SCNet AIDatasets](https://www.scnet.cn/ui/aihub/datasets) ,项目中数据集可从快速下载通道下载:[ASVspoof2019](https://www.scnet.cn/ui/aihub/datasets/chenhao/ASVspoof2019)
chenzk's avatar
chenzk committed
57

zhangxiao-stack's avatar
zhangxiao-stack committed
58
## 推理
zhangqha's avatar
zhangqha committed
59
60

To evaluate AASIST [1]:
zhangqha's avatar
update  
zhangqha committed
61
```
zhangqha's avatar
zhangqha committed
62
export TORCH_MHLO_OP_WHITE_LIST="aten::max;aten::batch_norm;aten::abs,aten::selu;prim::NumToTensor;aten::zeros_like;aten::size;aten::narrow;aten::cat;aten::selu_"
zhangqha's avatar
zhangqha committed
63

zhangqha's avatar
zhangqha committed
64
65
python3 main.py --eval --config ./config/AASIST.conf
python3 main_opt.py --eval --config ./config/AASIST.conf
zhangqha's avatar
update  
zhangqha committed
66
```
zhangqha's avatar
zhangqha committed
67
68

To evaluate AASIST-L [1]:
zhangqha's avatar
update  
zhangqha committed
69
```
zhangqha's avatar
zhangqha committed
70
export TORCH_MHLO_OP_WHITE_LIST="aten::max;aten::batch_norm;aten::abs,aten::selu;prim::NumToTensor;aten::zeros_like;aten::size;aten::narrow;aten::cat;aten::selu_"
zhangqha's avatar
zhangqha committed
71

zhangqha's avatar
zhangqha committed
72
73
python3 main.py --eval --config ./config/AASIST-L.conf
python3 main_opt.py --eval --config ./config/AASIST-L.conf
zhangqha's avatar
update  
zhangqha committed
74
```
zhangxiao-stack's avatar
zhangxiao-stack committed
75

zhangqha's avatar
zhangqha committed
76
测试命令:
zhangqha's avatar
update  
zhangqha committed
77
```
zhangqha's avatar
zhangqha committed
78
bash run.sh
zhangqha@sugon.com's avatar
zhangqha@sugon.com committed
79
```
zhangqha@sugon.com's avatar
zhangqha@sugon.com committed
80
## result
zhangqha's avatar
zhangqha committed
81
![aasist_result](aasist_result.png)
zhangqha@sugon.com's avatar
zhangqha@sugon.com committed
82

dcuai's avatar
dcuai committed
83
### 精度
zhangxiao-stack's avatar
zhangxiao-stack committed
84
85
86
87
88
89
使用Blade DISC优化后的精度与未使用Blade DISC优化后的精度保持一致

## 应用场景
### 算法类别
语音识别

dcuai's avatar
dcuai committed
90
### 热点应用行业
zhangxiao-stack's avatar
zhangxiao-stack committed
91
92
金融,交通,教育

zhangqha@sugon.com's avatar
zhangqha@sugon.com committed
93
## 源码仓库及问题反馈
chenzk's avatar
chenzk committed
94
https://developer.sourcefind.cn/codes/modelzoo/bladedisc_aasist
95

zhangqha@sugon.com's avatar
zhangqha@sugon.com committed
96
## 参考资料
zhangxiao-stack's avatar
zhangxiao-stack committed
97
https://github.com/clovaai/aasist.git
zhangqha's avatar
zhangqha committed
98