Commits · e6f221c8d4829c9a3bca699c18a32043ab21f7a0 · chenpangpang / transformers

09 Sep, 2022 1 commit
- [JAX] Replace all jax.tree_* calls with jax.tree_util.tree_* (#18361) · e6f221c8
  Sanchit Gandhi authored Sep 09, 2022
```
* [JAX] Replace all jax.tree_* calls with jax.tree_util.tree_*

* fix double tree_util
```
  e6f221c8
07 Sep, 2022 1 commit

Accelerator end training (#18910) · 4f299b24

Nicholas Broad authored Sep 07, 2022

* add accelerator.end_training()

Some trackers need this to end their runs.

* fixup and quality

* add space

* add space again ?!?

4f299b24

06 Sep, 2022 1 commit
- updating gather function with gather_for_metrics in run_wav2vec2_pretraining (#18877) · 3b19c031
  arun99481 authored Sep 06, 2022
```
Co-authored-by: Arun Rajaram <arunrajaram@Aruns-MacBook-Pro.local>
```
  3b19c031
01 Sep, 2022 1 commit
- Tie weights after preparing the model in run_clm (#18855) · c61f116b
  Sylvain Gugger authored Sep 01, 2022
  
  c61f116b
25 Aug, 2022 1 commit
- streamlining 'checkpointing_steps' parsing (#18755) · e9442440
  Rahul A R authored Aug 25, 2022
  
  e9442440
24 Aug, 2022 3 commits

examples/run_summarization_no_trainer: fixed incorrect param to hasattr (#18720) · c55d6e4e
Rahul A R authored Aug 24, 2022
```
* fixed incorrect param to hasattr

* simplified condition checks

* code cleanup
```
c55d6e4e

Bump nbconvert from 6.3.0 to 6.5.1 in /examples/research_projects/lxmert (#18742) · e49c71fc

dependabot[bot] authored Aug 24, 2022

Bumps [nbconvert](https://github.com/jupyter/nbconvert) from 6.3.0 to 6.5.1.
- [Release notes](https://github.com/jupyter/nbconvert/releases)
- [Commits](https://github.com/jupyter/nbconvert/compare/6.3.0...6.5.1

)

---
updated-dependencies:
- dependency-name: nbconvert
  dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

e49c71fc

Bump nbconvert in /examples/research_projects/visual_bert (#18741) · 5b249496

dependabot[bot] authored Aug 24, 2022

Bumps [nbconvert](https://github.com/jupyter/nbconvert) from 6.3.0 to 6.5.1.
- [Release notes](https://github.com/jupyter/nbconvert/releases)
- [Commits](https://github.com/jupyter/nbconvert/compare/6.3.0...6.5.1

)

---
updated-dependencies:
- dependency-name: nbconvert
  dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

5b249496

22 Aug, 2022 1 commit
- remove check for main process for trackers initialization (#18706) · d90a36d1
  Atharva Ingle authored Aug 22, 2022
  
  d90a36d1
18 Aug, 2022 3 commits

`model.tie_weights()` should be applied after `accelerator.prepare()` (#18676) · e54a1b49

Atharva Ingle authored Aug 18, 2022

* `model.tie_weights()` should be applied after `accelerator.prepare`

Weight tying should be done after the model has been moved to XLA device as mentioned on PyTorch/XLA Troubleshooting guide [here](https://github.com/pytorch/xla/blob/master/TROUBLESHOOTING.md#xla-tensor-quirks)

* format code

e54a1b49

Add an examples folder for code downstream tasks (#18679) · bbbb453e

Loubna Ben Allal authored Aug 18, 2022

* add examples subfolder

* mention examples in codeparrot readme

* use Trainer optimizer and scheduler type and add output_dir as argument

* add example of text-to-python and python-to-text models

* mention the downstream examples in the readme

* fix typo

bbbb453e

Add evaluate to examples requirements (#18666) · 358fc186
Zachary Mueller authored Aug 18, 2022

358fc186

17 Aug, 2022 1 commit

Examples: add Bloom support for token classification (#18632) · 358478e7

Stefan Schweter authored Aug 17, 2022

* examples: add Bloom support for token classification (FLAX, PyTorch and TensorFlow)

* examples: remove support for Bloom in token classication (FLAX and TensorFlow currently have no support for it)

358478e7

16 Aug, 2022 1 commit

Update run_translation_no_trainer.py (#18637) · 25e651a2

zhoutang776 authored Aug 16, 2022

* Update run_translation_no_trainer.py

found an error in selecting `no_decay` parameters and some small modifications when the user continues to train from a checkpoint

* fixs `no_decay` and `resume_step` issue

1. change `no_decay` list
2. if use continue to train their model from provided checkpoint, the `resume_step` will not be initialized properly if `args.gradient_accumulation_steps != 1`

25e651a2

14 Aug, 2022 1 commit

Flax Remat for LongT5 (#17994) · d6eeb871

Karim Foda authored Aug 14, 2022



* [Flax] Add remat (gradient checkpointing)

* fix variable naming in test

* flip: checkpoint using a method

* fix naming

* fix class naming

* apply PVP's suggestions from code review

* add gradient_checkpointing to examples

* Add gradient_checkpointing to run_mlm_flax

* Add remat to longt5

* Add gradient checkpointing test longt5

* Fix args errors

* Fix remaining tests

* Make fixup & quality fixes

* replace kwargs

* remove unecessary kwargs

* Make fixup changes

* revert long_t5_flax changes

* Remove return_dict and copy to LongT5

* Remove test_gradient_checkpointing
Co-authored-by: sanchit-gandhi <sanchit@huggingface.co>

d6eeb871

11 Aug, 2022 2 commits

Bump nbconvert in /examples/research_projects/visual_bert (#18566) · 05d3a43c

dependabot[bot] authored Aug 11, 2022

Bumps [nbconvert](https://github.com/jupyter/nbconvert) from 6.0.1 to 6.3.0.
- [Release notes](https://github.com/jupyter/nbconvert/releases)
- [Commits](https://github.com/jupyter/nbconvert/compare/6.0.1...6.3.0

)

---
updated-dependencies:
- dependency-name: nbconvert
  dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

05d3a43c

Bump nbconvert from 6.0.1 to 6.3.0 in /examples/research_projects/lxmert (#18565) · 713ab6fd

dependabot[bot] authored Aug 11, 2022

Bumps [nbconvert](https://github.com/jupyter/nbconvert) from 6.0.1 to 6.3.0.
- [Release notes](https://github.com/jupyter/nbconvert/releases)
- [Commits](https://github.com/jupyter/nbconvert/compare/6.0.1...6.3.0

)

---
updated-dependencies:
- dependency-name: nbconvert
  dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

713ab6fd

10 Aug, 2022 1 commit

TF Examples Rewrite (#18451) · 6eb51450

Matt authored Aug 10, 2022



* Finished QA example

* Dodge a merge conflict

* Update text classification and LM examples

* Update NER example

* New Keras metrics WIP, fix NER example

* Update NER example

* Update MC, summarization and translation examples

* Add XLA warnings when shapes are variable

* Make sure batch_size is consistently scaled by num_replicas

* Add PushToHubCallback to all models

* Add docs links for KerasMetricCallback

* Add docs links for prepare_tf_dataset and jit_compile

* Correct inferred model names

* Don't assume the dataset has 'lang'

* Don't assume the dataset has 'lang'

* Write metrics in text classification

* Add 'framework' to TrainingArguments and TFTrainingArguments

* Export metrics in all examples and add tests

* Fix training args for Flax

* Update command line args for translation test

* make fixup

* Fix accidentally running other tests in fp16

* Remove do_train/do_eval from run_clm.py

* Remove do_train/do_eval from run_mlm.py

* Add tensorflow tests to circleci

* Fix circleci

* Update examples/tensorflow/language-modeling/run_mlm.py
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* Update examples/tensorflow/test_tensorflow_examples.py
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* Update examples/tensorflow/translation/run_translation.py
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* Update examples/tensorflow/token-classification/run_ner.py
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* Fix save path for tests

* Fix some model card kwargs

* Explain the magical -1000

* Actually enable tests this time

* Skip text classification PR until we fix shape inference

* make fixup
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

6eb51450

08 Aug, 2022 3 commits

Update no_trainer.py scripts to include accelerate gradient accumulation wrapper (#18473) · a765b68a

Rasmus Arpe Fogh Jensen authored Aug 08, 2022

* Added accelerate gradient accumulation wrapper to run_image_classification_no_trainer.py example script

* make fixup changes

* PR comments

* changed input to Acceletor based on PR comment, ran make fixup

* Added comment explaining the sync_gradients statement

* Fixed lr scheduler max steps

* Changed run_clm_no_trainer.py script to use accelerate gradient accum wrapper

* Fixed all scripts except wav2vec2 pretraining to use accelerate gradient accum wrapper

* Added accelerate gradient accum wrapper for wav2vec2_pretraining_no_trainer.py script

* make fixup and lr_scheduler step inserted back into run_qa_beam_search_no_trainer.py

* removed changes to run_wav2vec2_pretraining_no_trainer.py script and fixed using wrong constant in qa_beam_search_no_trainer.py script

a765b68a

Fix compatibility with 1.12 (#17925) · 70b0d4e1

Sylvain Gugger authored Aug 08, 2022



* Fix compatibility with 1.12

* Remove pin from examples requirements

* Update torch scatter version

* Fix compatibility with 1.12

* Remove pin from examples requirements

* Update torch scatter version

* fix torch.onnx.symbolic_opset12 import

* Reject bad version
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

70b0d4e1

Add seed setting to image classification example (#18519) · 88a0ce57
regisss authored Aug 08, 2022

88a0ce57

06 Aug, 2022 2 commits

`transformers-cli login` => `huggingface-cli login` (#18490) · 9129fd03

Julien Chaumond authored Aug 06, 2022

* zero chance anyone's using that constant no?

* `transformers-cli login` => `huggingface-cli login`

* `transformers-cli repo create` => `huggingface-cli repo create`

* `make style`

9129fd03

Just re-reading the whole doc every couple of months

😬

(#18489) · 8d1f9039

Julien Chaumond authored Aug 06, 2022

* Delete valohai.yaml

* NLP => ML

* typo

* website supports https

* datasets

* 60k + modalities

* unrelated link fixing for accelerate

* Ok those links were actually broken

* Fix link

* Make `AutoTokenizer` auto-link

* wording tweak

* add at least one non-nlp task

8d1f9039

04 Aug, 2022 2 commits
- Update no trainer examples for QA and Semantic Segmentation (#18474) · 0bf1e1ac
  Kian Sierra McGettigan authored Aug 04, 2022
```
* swag_no_trainer updated for with gather_metrics

* Removed unused variable samples_seen

* updated examples with gather_for_metrics
```
  0bf1e1ac
- Update no trainer scripts for multiple-choice (#18468) · 330247ed
  Kian Sierra McGettigan authored Aug 04, 2022
```
* swag_no_trainer updated for with gather_metrics

* Removed unused variable samples_seen
```
  330247ed
03 Aug, 2022 2 commits

Fix torch version comparisons (#18460) · 02b176c4

LSinev authored Aug 03, 2022

Comparisons like
version.parse(torch.__version__) > version.parse("1.6")
are True for torch==1.6.0+cu101 or torch==1.6.0+cpu

version.parse(version.parse(torch.__version__).base_version) are preferred (and available in pytorch_utils.py

02b176c4

Update no trainer scripts for language modeling and image classification examples (#18443) · 3db4378b

Ritik Nandwal authored Aug 03, 2022

* Update no_trainer script for image-classification

* Update no_trainer scripts for language-modeling examples

* Remove unused variable

* Removing truncation from losses array for language modeling examples

3db4378b

02 Aug, 2022 1 commit
- fix run_clip README (#18332) · 5546fb61
  Yih-Dar authored Aug 02, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  5546fb61
01 Aug, 2022 6 commits

Add Flax BART pretraining script (#18297) · 3909d7f1

Duong A. Nguyen authored Aug 01, 2022



* add bart pretraining flax script

* fixup

* add bart pretraining flax script

* add BART to README

* add BART to README

* add BART to README

* add BART to README

* add BART to README

* add bos eos document

* Update README.md

* Update README.md

* Update examples/flax/language-modeling/run_bart_dlm_flax.py
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

* final

* final

* final

* remove use_auth_token ing from_config
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

3909d7f1

Fix ROUGE add example check and update README (#18398) · 941d2331
Sylvain Gugger authored Aug 01, 2022
```
* Fix ROUGE add example check and update README

* Stay consistent in values
```
941d2331
Correct the spelling of bleu metric (#18375) · 679d68a1
Ogundepo Odunayo authored Aug 01, 2022

679d68a1
Migrate metric to Evaluate in Pytorch examples (#18369) · 1f843991
atturaioe authored Aug 01, 2022
```
* Migrate metric to Evaluate in pytorch examples

* Remove unused imports
```
1f843991

Bump mistune from 0.8.4 to 2.0.3 in /examples/research_projects/lxmert (#18370) · 25ec12ea

dependabot[bot] authored Aug 01, 2022

Bumps [mistune](https://github.com/lepture/mistune) from 0.8.4 to 2.0.3.
- [Release notes](https://github.com/lepture/mistune/releases)
- [Changelog](https://github.com/lepture/mistune/blob/master/docs/changes.rst)
- [Commits](https://github.com/lepture/mistune/compare/v0.8.4...v2.0.3

)

---
updated-dependencies:
- dependency-name: mistune
  dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

25ec12ea

Bump mistune in /examples/research_projects/visual_bert (#18371) · a7360385

dependabot[bot] authored Aug 01, 2022

Bumps [mistune](https://github.com/lepture/mistune) from 0.8.4 to 2.0.3.
- [Release notes](https://github.com/lepture/mistune/releases)
- [Changelog](https://github.com/lepture/mistune/blob/master/docs/changes.rst)
- [Commits](https://github.com/lepture/mistune/compare/v0.8.4...v2.0.3

)

---
updated-dependencies:
- dependency-name: mistune
  dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

a7360385

29 Jul, 2022 1 commit

Replace `as_target` context managers by direct calls (#18325) · 986526a0

Sylvain Gugger authored Jul 29, 2022



* Preliminary work on tokenizers

* Quality + fix tests

* Treat processors

* Fix pad

* Remove all uses of  in tests, docs and examples

* Replace all as_target_tokenizer

* Fix tests

* Fix quality

* Update examples/flax/image-captioning/run_image_captioning_flax.py
Co-authored-by: amyeroberts <amy@huggingface.co>

* Style
Co-authored-by: amyeroberts <amy@huggingface.co>

986526a0

28 Jul, 2022 3 commits

Migrate metrics used in flax examples to Evaluate (#18348) · da503ea0

Vijay S Kalmath authored Jul 28, 2022

Currently, tensorflow examples use the `load_metric` function from
Datasets library, commit migrates function call to `load` function
from Evaluate library.

da503ea0

Migrate metric to Evaluate library for tensorflow examples (#18327) · a2586795

Vijay S Kalmath authored Jul 28, 2022

* Migrate metric to Evaluate library in tf examples

Currently tensorflow examples use `load_metric` function from Datasets
library , commit migrates function call to `load` function to
Evaluate library.

Fix for #18306

* Migrate metric to Evaluate library in tf examples

Currently tensorflow examples use `load_metric` function from Datasets
library , commit migrates function call to `load` function to
Evaluate library.

Fix for #18306

* Migrate `metric` to Evaluate for all tf examples

Currently tensorflow examples use `load_metric` function from Datasets
library , commit migrates function call to `load` function to
Evaluate library.

a2586795

Fix codeparrot deduplication - ignore whitespaces (#18023) · 286a18fa
Loubna Ben Allal authored Jul 28, 2022
```
* ignore whitspaces for hash

* reformat code

* Update README.md
```
286a18fa

27 Jul, 2022 2 commits

Dev version · c89a592e
Lysandre authored Jul 27, 2022

c89a592e

[Flax] Fix incomplete batches in example scripts (#17863) · 7490a97c

Sanchit Gandhi authored Jul 27, 2022

* [Flax] Fix incomplete batches in example scripts

* fix dataloader batching

* convert jnp batch idxs to np array

* add missing `pad_shard_unpad` to final prediction generate step

* only `pad_shard_unpad` at inference time

* merge conflicts

* remove incomplete batch step from eval

* fix run_qa.py

* add `pad_shard_unpad` to run_flax_ner.py

* add `pad_shard_unpad` to run_flax_glue.py

* add `pad_shard_unpad` to run_image_classification.py

* make style

* fix mlm flax eval batches

* remove redundant imports

7490a97c