"tests/fixtures/custom_pipeline/pipeline.py" did not exist on "9c3820d05a674098e47a9122fb2b431dd259a808"
Unverified Commit ac9c881d authored by Yoach Lacombe's avatar Yoach Lacombe Committed by GitHub
Browse files

Update TRAINING.md

parent 7243b0f5
...@@ -7,7 +7,7 @@ This sub-folder contains all the information to train or finetune you own Parler ...@@ -7,7 +7,7 @@ This sub-folder contains all the information to train or finetune you own Parler
- [E. Scaling up to 10.5K hours](#d-scaling-up---discussions-and-tips) - [E. Scaling up to 10.5K hours](#d-scaling-up---discussions-and-tips)
# A. Architecture ## A. Architecture
At the moment, Parler-TTS architecture is a carbon copy of [Musicgen architecture](https://huggingface.co/docs/transformers/v4.39.3/en/model_doc/musicgen#model-structure) and can be decomposed into three distinct stages: At the moment, Parler-TTS architecture is a carbon copy of [Musicgen architecture](https://huggingface.co/docs/transformers/v4.39.3/en/model_doc/musicgen#model-structure) and can be decomposed into three distinct stages:
>1. Text encoder: maps the text inputs to a sequence of hidden-state representations. The pre-trained MusicGen models use a frozen text encoder from either T5 or Flan-T5 >1. Text encoder: maps the text inputs to a sequence of hidden-state representations. The pre-trained MusicGen models use a frozen text encoder from either T5 or Flan-T5
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment