are trained using 45k hours of narrated English audiobooks. It generates high-quality speech
with features that can be controlled using a simple text prompt (e.g. gender, background noise, speaking rate, pitch and reverberation).</p>
<p>By default, Parler-TTS generates 🎲 random voice. To ensure 🎯 <b> speaker consistency </b> across generations, these checkpoints were also trained on 34 speakers, characterized by name (e.g. Jon, Lea, Gary, Jenna, Mike, Laura).</p>
<li>Punctuation can be used to control the prosody of the generations, e.g. use commas to add small breaks in speech</li>
<li>标点符号可用于控制代际韵律,例如可以使用逗号在语音中添加小停顿</li>
<li>The remaining speech features (gender, speaking rate, pitch and reverberation) can be controlled directly through the prompt</li>
<li>其余语音特征(性别、语速、音调和混响)可以通过文本直接控制</li>
</ul>
</ul>
</p>
</p>
<p>Parler-TTS can be much faster. We give some tips on how to generate much more quickly in this <a href="https://github.com/huggingface/parler-tts/blob/main/INFERENCE.md"> inference guide</a>. Think SDPA, torch.compile, batching and streaming!</p>
<p>Parler-TTS可以更快。 在<a href="https://github.com/huggingface/parler-tts/blob/main/INFERENCE.md"> 推理指南</a>给出了一些快速生成的方法,包括SDPA, torch.compile, batching and streaming!</p>
<p>If you want to find out more about how this model was trained and even fine-tune it yourself, check-out the
<p> 如果您想了解有关如何训练该模型的更多信息,甚至对其进行微调,请查看 GitHub 上
<a href="https://github.com/huggingface/parler-tts"> Parler-TTS</a> repository on GitHub.</p>
<p>The Parler-TTS codebase and its associated checkpoints are licensed under <a href='https://github.com/huggingface/parler-tts?tab=Apache-2.0-1-ov-file#readme'> Apache 2.0</a>.</p>