Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
ModelZoo
tacotron2_pytorch
Commits
496360c6
"vscode:/vscode.git/clone" did not exist on "1d2551d716b3fd9f40e528f50902f7cb17dc4617"
Commit
496360c6
authored
Jan 14, 2025
by
ACzhangchao
Browse files
Update README.md
parent
71a2d4a6
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
2 additions
and
2 deletions
+2
-2
README.md
README.md
+2
-2
No files found.
README.md
View file @
496360c6
...
@@ -10,13 +10,13 @@ https://arxiv.org/abs/1712.05884
...
@@ -10,13 +10,13 @@ https://arxiv.org/abs/1712.05884
Tacotron2与第一代相比剔除了CBHG模块,改为LSTM和卷积层,在保证语音合成质量的前提下简化了模型结构,提高训练和推理效率,在Vocoder部分使用可训练的WaveNet替换掉第一代中的Griffin-Lim算法,能够以高质量和高保真度生成音频波形
Tacotron2与第一代相比剔除了CBHG模块,改为LSTM和卷积层,在保证语音合成质量的前提下简化了模型结构,提高训练和推理效率,在Vocoder部分使用可训练的WaveNet替换掉第一代中的Griffin-Lim算法,能够以高质量和高保真度生成音频波形


## 算法原理
## 算法原理
Tacotron 2 模型通过使用编码器-解码器架构结合注意力机制,将文本序列转换为梅尔频谱图,然后利用WaveNet声码器将这些频谱图转化为自然语音波形,其核心在于端到端的训练方式和高质量语音合成能力。
Tacotron 2 模型通过使用编码器-解码器架构结合注意力机制,将文本序列转换为梅尔频谱图,然后利用WaveNet声码器将这些频谱图转化为自然语音波形,其核心在于端到端的训练方式和高质量语音合成能力。


## 环境配置
## 环境配置
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment