Commits · main · chenpangpang / parler-tts

15 Aug, 2024 2 commits

Update training guide colab (#108) · 8e465f1b

Yoach Lacombe authored Aug 15, 2024

* Update README.md

* Update README.md

* Update README.md

* update configs and readme

* fix training and eval single gpus and long audios errors

* fix error transcriptions none

* fix trascription null wer

* Update README.md

* Update README.md

---------

Co-authored-by: yoach@huggingface.co <Yoach Lacombe>

8e465f1b

Update training guide (#102) · 8f5ef3a2

Yoach Lacombe authored Aug 15, 2024

* Update README.md

* Update README.md

* Update README.md

* update configs and readme

* fix training and eval single gpus and long audios errors

* fix error transcriptions none

* fix trascription null wer

---------

Co-authored-by: yoach@huggingface.co <Yoach Lacombe>

8f5ef3a2

08 Aug, 2024 1 commit

V02 release (#94) · 6185106e

Yoach Lacombe authored Aug 08, 2024



* bump version to v0.2

* adapt readme

* Update README.md

* update README

* add inference tips + streamer class

* update readme

* Update README.md

* Apply suggestions from code review
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

* Update README

* Apply suggestions from code review
Co-authored-by: Vaibhav Srivastav <vaibhavs10@gmail.com>

---------
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>
Co-authored-by: Vaibhav Srivastav <vaibhavs10@gmail.com>

6185106e

31 Jul, 2024 1 commit

Architecture improvements (#65) · 11b209e1

Yoach Lacombe authored Jul 31, 2024



* add RoPe

* don't include padding in rope

* possibly use cross-attn for prompt

* fix rope

* fix cross-attn

* fix self-attn

* fix dummy model

* clean-up rope

* first gqa implementation

* fix wer eval

* feat: add flash attention and spda

* chore: add README for flash attention

* chore: add benchmark script

* chore: add benchmark attention approach

* multi node and fix wer and fix compile

* Update modeling_parler_tts.py

* fix FA2, SDPA and add cross-attn MHA and attention type forcing

* better cross_attention key values number of heads default + add training arguments for attn implementation

* fix audio padding when torch compile or pad_to_max_length=True

* correct multi node

* make rope faster

* fix encoder sdpa

* fix training with cross attention + with FAZ

* use fp32 as default model dtype + fix generation when using FA2 with autocast

* remove redundant passes in generate + clean and fix attentions

* fix edge case in WER evaluation when longform generation

* better multi-node mapping and saving / add eval dataloader num workers

* remove old benchmarks

* faster audio encoding + checkpointing + fix generation step

* better eval + add right padding + fix eval loss compute

* correct README

* correct config docstrings

* remove comment

* make style

---------
Co-authored-by: sanchit-gandhi <sanchit@huggingface.co>
Co-authored-by: sang-nguyen-ts <sang.nguyen@trustingsocial.com>
Co-authored-by: yoach@huggingface.co <Yoach Lacombe>

11b209e1

30 Apr, 2024 1 commit
- Add colab link for fine-tuning · da4fcdd5
  Yoach Lacombe authored Apr 30, 2024
  
  da4fcdd5
12 Apr, 2024 1 commit
- add mps and xpu to examples · 99f6e9ab
  bghira authored Apr 12, 2024
  
  99f6e9ab
10 Apr, 2024 5 commits
- change ordering · 10016fb0
  Sanchit Gandhi authored Apr 10, 2024
  
  10016fb0
- Update README.md with video · 21e34d2a
  Yoach Lacombe authored Apr 10, 2024
  
  21e34d2a
- add final modif 300M to Mini or 600M · d41a009d
  Yoach Lacombe authored Apr 10, 2024
  
  d41a009d
- replace 300M reference to 600M and Mini · 5b593f58
  Yoach Lacombe authored Apr 10, 2024
  
  5b593f58
- Update README.md · 86e4eb71
  Yoach Lacombe authored Apr 10, 2024
  
  86e4eb71
09 Apr, 2024 6 commits
- add TL;DR for training · 92f82a3a
  Yoach Lacombe authored Apr 09, 2024
  
  92f82a3a
- Add quick inde · a87db05c
  Yoach Lacombe authored Apr 09, 2024
  
  a87db05c
- further improvements of README · b10e5625
  Yoach Lacombe authored Apr 09, 2024
  
  b10e5625
- Apply suggestions from code review · c40c6de2
  Yoach Lacombe authored Apr 09, 2024
```
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>
```
  c40c6de2
- update library · 5a25b7c9
  Yoach Lacombe authored Apr 09, 2024
  
  5a25b7c9
- update repo id and gradio link · 153694fa
  Yoach Lacombe authored Apr 09, 2024
  
  153694fa
08 Apr, 2024 5 commits
- add training.md skeleton · 0968eb4f
  Yoach Lacombe authored Apr 09, 2024
  
  0968eb4f
- add contribution section · b30e5194
  Yoach Lacombe authored Apr 09, 2024
  
  b30e5194
- Update README.md · 2518810d
  Yoach Lacombe authored Apr 08, 2024
  
  2518810d
- update README · c2f3296f
  Yoach Lacombe authored Apr 08, 2024
  
  c2f3296f
- remove stable speech mentions · 91542bfa
  Yoach Lacombe authored Apr 08, 2024
  
  91542bfa
05 Apr, 2024 1 commit
- more renaming · 85b8cac7
  sanchit-gandhi authored Apr 05, 2024
  
  85b8cac7
28 Feb, 2024 1 commit
- add DAC · 9bde9933
  Yoach Lacombe authored Feb 28, 2024
  
  9bde9933
14 Feb, 2024 2 commits
- typo · 83953064
  sanchit-gandhi authored Feb 14, 2024
  
  83953064
- more audio class · d10775d8
  sanchit-gandhi authored Feb 14, 2024
  
  d10775d8
13 Feb, 2024 2 commits
- setup · 334ead5b
  sanchit-gandhi authored Feb 13, 2024
  
  334ead5b
- first commit · b6330cd4
  sanchit-gandhi authored Feb 13, 2024
  
  b6330cd4