Commits · 4b796978656e461177a83d58ec3c2b06152c63db · chenpangpang / transformers

25 Aug, 2023 1 commit

[`Refactor`] Move third-party related utility files into `integrations/` folder

(#25599) · 4b796978

Younes Belkada authored Aug 25, 2023



* move deepspeed to `lib_integrations.deepspeed`

* more refactor

* oops

* fix slow tests

* Fix docs

* fix docs

* addess feedback

* address feedback

* final modifs for PEFT

* fixup

* ok now

* trigger CI

* trigger CI again

* Update docs/source/en/main_classes/deepspeed.md
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* import from `integrations`

* address feedback

* revert removal of `deepspeed` module

* revert removal of `deepspeed` module

* fix conflicts

* ooops

* oops

* add deprecation warning

* place it on the top

* put `FutureWarning`

* fix conflicts with not_doctested.txt

* add back `bitsandbytes` module with a depr warning

* fix

* fix

* fixup

* oops

* fix doctests

---------
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

4b796978

25 Apr, 2023 1 commit
- Avoid invalid escape sequences, use raw strings (#22936) · 54272503
  Lingepumpe authored Apr 25, 2023
```
* Avoid invalid escape sequences, use raw strings

* Integrate PR feedback
```
  54272503
22 Feb, 2023 1 commit
- Apply ruff flake8-comprehensions (#21694) · 5e8c8eb5
  Aaron Gokaslan authored Feb 22, 2023
  
  5e8c8eb5
06 Feb, 2023 1 commit

Update quality tooling for formatting (#21480) · 6f79d264

Sylvain Gugger authored Feb 06, 2023

* Result of black 23.1

* Update target to Python 3.7

* Switch flake8 to ruff

* Configure isort

* Configure isort

* Apply isort with line limit

* Put the right black version

* adapt black in check copies

* Fix copies

6f79d264

27 Sep, 2022 1 commit

add wav2vec2_alignment (#16782) · ea540a59

Arijit Mukherjee authored Sep 27, 2022



* add wav2vec2_alignment

* Update alignment.py

* Update examples/research_projects/wav2vec2/alignment.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update examples/research_projects/wav2vec2/alignment.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update examples/research_projects/wav2vec2/alignment.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update examples/research_projects/wav2vec2/alignment.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update README.md

* fix style

* fix imports

* fix multithread

* fix bash script

* [@anton-l] Style fixes and docstrings

* [@anton-l] Style fixes and docstrings

* Update alignment.py

fix blank id in backtrack
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: anton-l <aglozhkov@gmail.com>

ea540a59

03 Aug, 2022 1 commit

Fix torch version comparisons (#18460) · 02b176c4

LSinev authored Aug 03, 2022

Comparisons like
version.parse(torch.__version__) > version.parse("1.6")
are True for torch==1.6.0+cu101 or torch==1.6.0+cpu

version.parse(version.parse(torch.__version__).base_version) are preferred (and available in pytorch_utils.py

02b176c4

29 Jul, 2022 1 commit

Replace `as_target` context managers by direct calls (#18325) · 986526a0

Sylvain Gugger authored Jul 29, 2022



* Preliminary work on tokenizers

* Quality + fix tests

* Treat processors

* Fix pad

* Remove all uses of  in tests, docs and examples

* Replace all as_target_tokenizer

* Fix tests

* Fix quality

* Update examples/flax/image-captioning/run_image_captioning_flax.py
Co-authored-by: amyeroberts <amy@huggingface.co>

* Style
Co-authored-by: amyeroberts <amy@huggingface.co>

986526a0

19 May, 2022 1 commit
- Fix bug in Wav2Vec2 pretrain example (#17326) · 48c22691
  ddobokki authored May 20, 2022
  
  48c22691
12 May, 2022 1 commit

Black preview (#17217) · afe5d42d

Sylvain Gugger authored May 12, 2022

* Black preview

* Fixup too!

* Fix check copies

* Use the same version as the CI

* Bump black

afe5d42d

30 Mar, 2022 1 commit
- [examples] max samples can't be bigger than the len of dataset (#16501) · a73281e3
  Stas Bekman authored Mar 30, 2022
```
* [examples] max samples can't be bigger than then len of dataset

* do tf and flax
```
  a73281e3
23 Mar, 2022 1 commit

Updates the default branch from master to main (#16326) · eca77f47

Lysandre Debut authored Mar 23, 2022



* Updates the default branch from master to main

* Links from `master` to `main`

* Typo

* Update examples/flax/README.md
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

eca77f47

12 Mar, 2022 1 commit

[Deepspeed] add support for bf16 mode (#14569) · 580dd87c

Stas Bekman authored Mar 11, 2022



* [WIP] add support for bf16 mode

* prep for bf16

* prep for bf16

* fix; zero2/bf16 is ok

* check bf16 is available

* test fixes

* enable zero3_bf16

* config files

* docs

* split stage_dtype; merge back to non-dtype-specific config file

* fix doc

* cleanup

* cleanup

* bfloat16 => bf16 to match the PR changes

* s/zero_gather_fp16_weights_on_model_save/zero_gather_16bit_weights_on_model_save/; s/save_fp16_model/save_16bit_model/

* test fixes/skipping

* move

* fix

* Update docs/source/main_classes/deepspeed.mdx
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* backticks

* cleanup

* cleanup

* cleanup

* new version

* add note about grad accum in bf16
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

580dd87c

09 Feb, 2022 1 commit
- Upgrade black to version ~=22.0 (#15565) · 7732d0fe
  Lysandre Debut authored Feb 09, 2022
```
* Upgrade black to version ~=22.0

* Check copies

* Fix code
```
  7732d0fe
10 Jan, 2022 1 commit

[Wav2Vec2 Speech Event] Add speech event v2 (#15083) · d72343d2

Patrick von Platen authored Jan 10, 2022

* up

* up

* up

* up

* up

* up

* improve

* up

* up

* Update src/transformers/trainer.py

* up

* up

* up

d72343d2

11 Nov, 2021 1 commit
- fix --gradient_checkpointing (#13964) · 77262ef7
  Stas Bekman authored Nov 11, 2021
  
  77262ef7
22 Oct, 2021 1 commit
- Add missing --validation_split_percentage data args (#14119) · 05a2afc2
  Antonio Carlos Falcão Petri authored Oct 22, 2021
  
  05a2afc2
14 Oct, 2021 1 commit
- up (#14008) · 7fb2a8b3
  Patrick von Platen authored Oct 14, 2021
  
  7fb2a8b3
08 Aug, 2021 1 commit
- Update README.md · 24cbf6bc
  Patrick von Platen authored Aug 08, 2021
  
  24cbf6bc
30 Jul, 2021 1 commit

fix typo in gradient_checkpointing arg (#12855) · 5c673efa

21jun authored Jul 30, 2021

help for `ModelArguments.gradient_checkpointing` should be
"If True, use gradient checkpointing to save memory
at the expense of slower backward pass."
not "Whether to freeze the feature extractor layers of the model."
(which is duplicated from `freeze_feature_extractor` arg)

5c673efa

23 Jul, 2021 1 commit
- [tests] fix logging_steps requirements (#12860) · 98364ea7
  Stas Bekman authored Jul 23, 2021
  
  98364ea7
15 Jul, 2021 1 commit

[Wav2Vec2] Correctly pad mask indices for PreTraining (#12748) · 2e9fb13f

Patrick von Platen authored Jul 15, 2021



* fix_torch_device_generate_test

* remove @

* start adding tests

* correct wav2vec2 pretraining

* up

* up
Co-authored-by: Patrick von Platen <patrick@huggingface.co>

2e9fb13f

25 Jun, 2021 1 commit
- remove extra white space from log format (#12360) · 4a872cae
  Stas Bekman authored Jun 25, 2021
  
  4a872cae
14 Jun, 2021 1 commit
- [style] consistent nn. and nn.functional: part 4 `examples` (#12156) · 88e84186
  Stas Bekman authored Jun 14, 2021
```
* consistent nn. and nn.functional: p4 examples

* restore
```
  88e84186
09 Jun, 2021 2 commits

Wav2Vec2 Pretraining (#11306) · d472bd7b

Anton Lozhkov authored Jun 09, 2021



* Working quantizer forward

* Working quantizer forward

* Clean up unused model parts, test reproducibility

* Working quantizer forward

* Clean up unused model parts, test reproducibility

* Remove custom outputs from the shared ones

* correct conversion

* correct bug

* add first pretrain script

* save intermediate

* static shapes

* save intermediate

* finish first pretrain script version

* more refactor

* remove wanddb

* refactor more

* improve test

* correct perplexity compute bug

* finish model implementation

* add to docs

* finish docs

* finish pretraining script

* finish pretraining script

* remove wandb

* finish PR for merge

* finish config

* finish

* make deepspeed work

* Apply suggestions from code review
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* apply suggestions

* fix flaky test
Co-authored-by: patrickvonplaten <patrick.v.platen@gmail.com>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

d472bd7b

sync LayerDrop for Wav2Vec2Encoder + tests (#12076) · d14e0af2
Stas Bekman authored Jun 09, 2021

d14e0af2

08 Jun, 2021 1 commit

[Deepspeed Wav2vec2] integration (#11638) · 11d86d3d

Stas Bekman authored Jun 08, 2021

* wip

* wip - but working with https://github.com/microsoft/DeepSpeed/pull/1044

* cleanup

* workaround

* working 5/8 modes

* solve fp32 distributed zero3

* style

* sync

* sync

* rework

* deprecation

* cleanup

* https://github.com/microsoft/DeepSpeed/pull/1044

 pr was merged

* clean up

* add a guide

* more prose

* more prose

* fix

* more prose

* sub_group_size was too big

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* refactor

* bug fix

* make the true check explicit

* new deepspeed release
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

11d86d3d

12 May, 2021 1 commit
- remove defaults to None if optional (#11703) · 77f4c46b
  Philip May authored May 12, 2021
  
  77f4c46b
14 Apr, 2021 1 commit
- Save the Wav2Vec2 processor before training starts (#10910) · 653076ca
  Nithin Holla authored Apr 14, 2021
```
Co-authored-by: nithin19 <nithin@amberscript.com>
```
  653076ca
30 Mar, 2021 1 commit
- fix md file to avoid evaluation crash (#10962) · e031162a
  Yih-Dar authored Mar 30, 2021
  
  e031162a
22 Mar, 2021 2 commits
- Update FINE_TUNE_XLSR_WAV2VEC2.md (#10849) · 29904a96
  Qiushi Pan authored Mar 22, 2021
```
Fix typo.
```
  29904a96
- push (#10846) · 0f226f78
  Patrick von Platen authored Mar 22, 2021
  
  0f226f78
21 Mar, 2021 4 commits
- Update FINE_TUNE_XLSR_WAV2VEC2.md · 82b8d8c7
  Suraj Patil authored Mar 21, 2021
  
  82b8d8c7
- Update FINE_TUNE_XLSR_WAV2VEC2.md · af6125ff
  Patrick von Platen authored Mar 21, 2021
  
  af6125ff
- small improvements for wav2vec2 info script (#10829) · 5aaf6e14
  Patrick von Platen authored Mar 21, 2021
  
  5aaf6e14
- add doc for Local machine (#10828) · 68b55885
  Suraj Patil authored Mar 21, 2021
  
  68b55885
19 Mar, 2021 3 commits
- wav2vec doc tweaks (#10808) · 1438c487
  Julien Chaumond authored Mar 19, 2021
```
* wording/typos tweaks

* Make model upload instructions simpler
```
  1438c487
- Update FINE_TUNE_XLSR_WAV2VEC2.md · b9570a81
  Patrick von Platen authored Mar 19, 2021
  
  b9570a81
- [XLSR-Wav2Vec2 Info doc] Add a couple of lines (#10806) · e8968bd0
  Patrick von Platen authored Mar 19, 2021
```
* finish

* fix

* fix

* fix

* fix
```
  e8968bd0
18 Mar, 2021 2 commits
- Update FINE_TUNE_XLSR_WAV2VEC2.md · 2ae67822
  Patrick von Platen authored Mar 19, 2021
  
  2ae67822
- Update FINE_TUNE_XLSR_WAV2VEC2.md · 68a32159
  Patrick von Platen authored Mar 19, 2021
  
  68a32159