# Tacotron 2 (without wavenet) ## 论文 https://arxiv.org/abs/1712.05884 ## 环境配置 ### Docker 拉取镜像,启动并进入容器 ``` docker pull image.sourcefind.cn:5000/dcu/admin/base/pytorch:2.1.0-ubuntu20.04-dtk24.04.1-py3.10 docker run -it --shm-size 80g --network=host --name=tacotron2 --privileged --device /dev/m--device=/dev/kfd --device=/dev/dri --group-add video --cap-add=SYS_PTRACE --security-opt seccomp=unconfined -v /opt/hyhal/:/opt/hyhal/:ro -v /public/DL_DATA/AI/publicdata/ASR/LJSpeech-1.1:/LJSpeech-1.1:ro image.sourcefind.cn:5000/dcu/admin/base/pytorch:2.1.0-ubuntu20.04-dtk24.04.1-py3.10 /bin/bash ``` ### 拉取代码仓 ``` http://developer.hpccube.com/codes/modelzoo/tacotron2.git ``` ``` cd tacotron2 ``` ### 初始化子模块 ``` git submodule init; git submodule update ``` ### 更新.wav路径 ``` sed -i -- 's,DUMMY,/LJSpeech-1.1/wavs,g' filelists/*.txt ``` ### 安装python依赖 ``` pip install -r requirements.txt -i https://pypi.tuna.tsinghua.edu.cn/simple ``` ## 单卡训练 ``` bash run_single.sh ``` ## 多卡训练 运行脚本 ``` bash run_multi.sh ``` ## 模型推理 将inference.py中的”checkpoint_path“和”waveglow_path“换成自己的路径,运行inference.py ```python python inference.py ```