Commits · 02b176c4ce14340d26d42825523f406959c6c202 · chenpangpang / transformers

03 Aug, 2022 1 commit

Fix torch version comparisons (#18460) · 02b176c4

LSinev authored Aug 03, 2022

Comparisons like
version.parse(torch.__version__) > version.parse("1.6")
are True for torch==1.6.0+cu101 or torch==1.6.0+cpu

version.parse(version.parse(torch.__version__).base_version) are preferred (and available in pytorch_utils.py

02b176c4

19 May, 2022 1 commit
- Fix bug in Wav2Vec2 pretrain example (#17326) · 48c22691
  ddobokki authored May 20, 2022
  
  48c22691
12 May, 2022 1 commit

Black preview (#17217) · afe5d42d

Sylvain Gugger authored May 12, 2022

* Black preview

* Fixup too!

* Fix check copies

* Use the same version as the CI

* Bump black

afe5d42d

09 Feb, 2022 1 commit
- Upgrade black to version ~=22.0 (#15565) · 7732d0fe
  Lysandre Debut authored Feb 09, 2022
```
* Upgrade black to version ~=22.0

* Check copies

* Fix code
```
  7732d0fe
11 Nov, 2021 1 commit
- fix --gradient_checkpointing (#13964) · 77262ef7
  Stas Bekman authored Nov 11, 2021
  
  77262ef7
22 Oct, 2021 1 commit
- Add missing --validation_split_percentage data args (#14119) · 05a2afc2
  Antonio Carlos Falcão Petri authored Oct 22, 2021
  
  05a2afc2
30 Jul, 2021 1 commit

fix typo in gradient_checkpointing arg (#12855) · 5c673efa

21jun authored Jul 30, 2021

help for `ModelArguments.gradient_checkpointing` should be
"If True, use gradient checkpointing to save memory
at the expense of slower backward pass."
not "Whether to freeze the feature extractor layers of the model."
(which is duplicated from `freeze_feature_extractor` arg)

5c673efa

15 Jul, 2021 1 commit

[Wav2Vec2] Correctly pad mask indices for PreTraining (#12748) · 2e9fb13f

Patrick von Platen authored Jul 15, 2021



* fix_torch_device_generate_test

* remove @

* start adding tests

* correct wav2vec2 pretraining

* up

* up
Co-authored-by: Patrick von Platen <patrick@huggingface.co>

2e9fb13f

25 Jun, 2021 1 commit
- remove extra white space from log format (#12360) · 4a872cae
  Stas Bekman authored Jun 25, 2021
  
  4a872cae
14 Jun, 2021 1 commit
- [style] consistent nn. and nn.functional: part 4 `examples` (#12156) · 88e84186
  Stas Bekman authored Jun 14, 2021
```
* consistent nn. and nn.functional: p4 examples

* restore
```
  88e84186
09 Jun, 2021 1 commit

Wav2Vec2 Pretraining (#11306) · d472bd7b

Anton Lozhkov authored Jun 09, 2021



* Working quantizer forward

* Working quantizer forward

* Clean up unused model parts, test reproducibility

* Working quantizer forward

* Clean up unused model parts, test reproducibility

* Remove custom outputs from the shared ones

* correct conversion

* correct bug

* add first pretrain script

* save intermediate

* static shapes

* save intermediate

* finish first pretrain script version

* more refactor

* remove wanddb

* refactor more

* improve test

* correct perplexity compute bug

* finish model implementation

* add to docs

* finish docs

* finish pretraining script

* finish pretraining script

* remove wandb

* finish PR for merge

* finish config

* finish

* make deepspeed work

* Apply suggestions from code review
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* apply suggestions

* fix flaky test
Co-authored-by: patrickvonplaten <patrick.v.platen@gmail.com>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

d472bd7b