Commits · 7490a97cac20cef6858f32e5f39a61f31ad64552 · chenpangpang / transformers

27 Jul, 2022 4 commits

[Flax] Fix incomplete batches in example scripts (#17863) · 7490a97c

Sanchit Gandhi authored Jul 27, 2022

* [Flax] Fix incomplete batches in example scripts

* fix dataloader batching

* convert jnp batch idxs to np array

* add missing `pad_shard_unpad` to final prediction generate step

* only `pad_shard_unpad` at inference time

* merge conflicts

* remove incomplete batch step from eval

* fix run_qa.py

* add `pad_shard_unpad` to run_flax_ner.py

* add `pad_shard_unpad` to run_flax_glue.py

* add `pad_shard_unpad` to run_image_classification.py

* make style

* fix mlm flax eval batches

* remove redundant imports

7490a97c

Remove all uses of six (#18318) · cf32b2ee
Sylvain Gugger authored Jul 27, 2022
```
* Remove all uses of six

* fix quality
```
cf32b2ee
Generalize decay_mask_fn to apply mask to all LayerNorm params (#18273) · 170fcaa6
Duong A. Nguyen authored Jul 27, 2022
```
* generalize decay_mask_fn to find all layernorm params

* fixup

* generalising decay_mask_fn
```
170fcaa6

Update CodeParrot readme to include training in Megatron (#17798) · 1d71ad89

Loubna Ben Allal authored Jul 27, 2022



* add info about megatron training

* upload models and datasets from CodeParrot organization

* upload models and datasets from CodeParrot organization

* Update examples/research_projects/codeparrot/README.md
Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com>

* Update examples/research_projects/codeparrot/README.md
Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com>

* Update examples/research_projects/codeparrot/README.md
Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com>

* Update examples/research_projects/codeparrot/README.md
Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com>

* Update examples/research_projects/codeparrot/README.md
Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com>

* fix typo and add comment about codeparrot vs megatron
Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com>

1d71ad89

21 Jul, 2022 1 commit
- Fix `no_trainer` CI (#18242) · 99eb9b52
  Zachary Mueller authored Jul 21, 2022
```
* Fix all tests
```
  99eb9b52
19 Jul, 2022 1 commit
- Remove use_auth_token from the from_config method (#18192) · 4bea6584
  Duong A. Nguyen authored Jul 19, 2022
```
* remove use_auth_token from from_config

* restore use_auth_token from_pretrained run_t5_mlm_flax
```
  4bea6584
18 Jul, 2022 2 commits
- Fix incorrect type hint for lang (#18161) · a4f97e6c
  John Giorgi authored Jul 18, 2022
  
  a4f97e6c
- Fix check for falsey inputs in run_summarization (#18155) · c46d39f3
  John Giorgi authored Jul 18, 2022
  
  c46d39f3
13 Jul, 2022 1 commit
- Add summarization name mapping for MultiNews (#18117) · fde22c75
  John Giorgi authored Jul 13, 2022
```
* Add summarization name mapping for MultiNews

* Add summarization name mapping for MultiNews
```
  fde22c75
11 Jul, 2022 2 commits

Fix RESOURCE_EXHAUSTED error when dealing with large datasets in Flax example scripts (#18069) · 1e8140ca

Duong A. Nguyen authored Jul 11, 2022

* Fix RESOURCE_EXHAUSTED error for large datasets on Flax example scripts

* using np.permutation for creating batch_idx

* train_samples_idx -> training_samples_idx

* fix type hints

1e8140ca

Fix some typos. (#17560) · 95113d13

Yulv-git authored Jul 11, 2022



* Fix some typos.
Signed-off-by: Yulv-git <yulvchi@qq.com>

* Fix typo.
Signed-off-by: Yulv-git <yulvchi@qq.com>

* make fixup.

95113d13

06 Jul, 2022 1 commit
- Fix T5 incorrect weight decay in Trainer and official summarization example (#18002) · bf37e5c7
  ADAning authored Jul 06, 2022
```
* Add ALL_LAYERNORM_LAYERS for LayerNorm

* fix bug of appending layer norm
```
  bf37e5c7
29 Jun, 2022 1 commit
- Fix all is_torch_tpu_available issues (#17936) · 7c4c6f60
  Zachary Mueller authored Jun 29, 2022
```
* Fix all is_torch_tpu_available 
```
  7c4c6f60
28 Jun, 2022 1 commit
- Pin PyTorch in requirements as well · 5f1e67a5
  Sylvain Gugger authored Jun 28, 2022
  
  5f1e67a5
23 Jun, 2022 2 commits
- Properly calculate the total train iterations and recalculate num epochs in... · 75259b44
  Zachary Mueller authored Jun 23, 2022
```
Properly calculate the total train iterations and recalculate num epochs in no_trainer scripts (#17856)
```
  75259b44
- Change no trainer image_classification test (#17635) · acb709d5
  Zachary Mueller authored Jun 23, 2022
```
* Adjust test arguments and use a new example test
```
  acb709d5
22 Jun, 2022 3 commits

Bump numpy from 1.21.0 to 1.22.0 in /examples/research_projects/lxmert (#17817) · c366ce10

dependabot[bot] authored Jun 22, 2022

Bumps [numpy](https://github.com/numpy/numpy) from 1.21.0 to 1.22.0.
- [Release notes](https://github.com/numpy/numpy/releases)
- [Changelog](https://github.com/numpy/numpy/blob/main/doc/HOWTO_RELEASE.rst)
- [Commits](https://github.com/numpy/numpy/compare/v1.21.0...v1.22.0

)

---
updated-dependencies:
- dependency-name: numpy
  dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

c366ce10

Bump numpy in /examples/research_projects/visual_bert (#17816) · af0d21e7

dependabot[bot] authored Jun 22, 2022

Bumps [numpy](https://github.com/numpy/numpy) from 1.21.0 to 1.22.0.
- [Release notes](https://github.com/numpy/numpy/releases)
- [Changelog](https://github.com/numpy/numpy/blob/main/doc/HOWTO_RELEASE.rst)
- [Commits](https://github.com/numpy/numpy/compare/v1.21.0...v1.22.0

)

---
updated-dependencies:
- dependency-name: numpy
  dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

af0d21e7

Add logits_processor parameter, used by `generate`, to `Seq2SeqTrainer`... · 13570381

Eran Hirsch authored Jun 22, 2022

Add logits_processor parameter, used by `generate`, to `Seq2SeqTrainer` methods `evaluate` and `predict` (#17805)

* Add logits_processor parameter, used by `generate`, to `Seq2SeqTrainer` methods `evaluate` and `predict`

* Add all generate parameters to `Seq2SeqTrainer`, and also to `QuestionAnsweringSeq2SeqTrainer` which overrides it

* Remove `self._num_beams` from trainer classes

* - Run fixup
- Fix "Constraint" not exposed
- Fix synced_gpus to actually read from param

* Use kwargs

* Copy kwargs before making changes to it

* Fix style issues unused imports

13570381

21 Jun, 2022 1 commit

[CodeParrot] Near-deduplication with jaccard similarity (#17054) · da2bd2ae

Jia LI authored Jun 21, 2022



* deduplication draft

* update style

* update style test

* dummy test main

* rename modules

* rename functions

* return extremes in deduplicate_clusters

* update style

* cast str for gzip

* update doc string

* time processing

* use dataset map to compute minhash

* fill value for short token

* remove da map method

* update style

* use share object to multiprocess

* update style

* use f-string and minor fix
Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com>
Co-authored-by: Loubna Ben Allal <44069155+loubnabnl@users.noreply.github.com>

* update style

* use module parameters

* change ds_dedup to ds_filter

* save ds_dedup

* mv test to script tests

* make jaccard threshold a parameter of deduplicate_dataset

* update style

* add doc strings

* update style

* add doc string for DuplicationIndex

* save files into data dir

* update readme

* Update examples/research_projects/codeparrot/README.md
Co-authored-by: Loubna Ben Allal <44069155+loubnabnl@users.noreply.github.com>

* make near deduplication optional

* move near deduplication in README

* Update examples/research_projects/codeparrot/README.md
Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com>

* use f string
Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com>
Co-authored-by: Loubna Ben Allal <44069155+loubnabnl@users.noreply.github.com>

da2bd2ae

17 Jun, 2022 2 commits

Bump notebook in /examples/research_projects/lxmert (#17743) · e44a569f

dependabot[bot] authored Jun 17, 2022

Bumps [notebook](http://jupyter.org

) from 6.4.10 to 6.4.12.

---
updated-dependencies:
- dependency-name: notebook
  dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

e44a569f

Bump notebook in /examples/research_projects/visual_bert (#17742) · 5089a2d4

dependabot[bot] authored Jun 17, 2022

Bumps [notebook](http://jupyter.org

) from 6.4.10 to 6.4.12.

---
updated-dependencies:
- dependency-name: notebook
  dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

5089a2d4

16 Jun, 2022 1 commit
- v4.21.0.dev0 · 7c6ec195
  Sylvain Gugger authored Jun 16, 2022
  
  7c6ec195
15 Jun, 2022 1 commit
- Update requirements.txt (#17719) · 6ebeeeef
  Jeff Rasley authored Jun 15, 2022
  
  6ebeeeef
14 Jun, 2022 1 commit

Rag end2end new (#17650) · 9068fa6c

Shamane Siri authored Jun 15, 2022

* check

* update the RAG-end2end with new PL and RAY

* removed unwanted comments

9068fa6c

10 Jun, 2022 3 commits

update README.md (#17657) · 3114df41
Loubna Ben Allal authored Jun 10, 2022
```
- use CodeParrot scores of v1.1
- change evaluation command to use accelerate
```
3114df41

🐛

Properly raise `RepoNotFoundError` when not authenticated (#17651) · c99ddcc4

Simon Brandeis authored Jun 10, 2022

* Raise RepoNotFoundError in case of 401

* Include changes from revert-17646-skip_repo_not_found

* Add a comment

* 💄 Code quality

* 💚 Update `get_from_cache` test

* 💚 Code quality & skip failing test

c99ddcc4

Bump cookiecutter in /examples/research_projects/decision_transformer (#17645) · 1d463303

dependabot[bot] authored Jun 10, 2022

Bumps [cookiecutter](https://github.com/cookiecutter/cookiecutter) from 1.7.2 to 2.1.1.
- [Release notes](https://github.com/cookiecutter/cookiecutter/releases)
- [Changelog](https://github.com/cookiecutter/cookiecutter/blob/master/HISTORY.md)
- [Commits](https://github.com/cookiecutter/cookiecutter/compare/1.7.2...2.1.1

)

---
updated-dependencies:
- dependency-name: cookiecutter
  dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

1d463303

07 Jun, 2022 1 commit

Add examples telemetry (#17552) · 3cab9027

Sylvain Gugger authored Jun 07, 2022

* Add examples telemetry

* Alternative approach

* Add to all other examples

* Add to templates as well

* Put framework separately

* Same for TensorFlow

3cab9027

03 Jun, 2022 1 commit
- Update run_glue_no_trainer.py (#17546) · 254d9c06
  bhuang authored Jun 03, 2022
  
  254d9c06
01 Jun, 2022 2 commits
- Fix flakey no-trainer test (#17515) · 3766df4f
  Zachary Mueller authored Jun 01, 2022
  
  3766df4f
- Deal with the error when task is regression (#16330) · 028d4b7c
  fireindark707 authored Jun 01, 2022
  
  028d4b7c
27 May, 2022 1 commit

Improve notrainer examples (#17449) · d156898f

Sourab Mangrulkar authored May 28, 2022

* improve no-trainer examples

* Trigger CI

* adding comment to clarify tracker init on main process

* Trigger CI

* Trigger CI

* Trigger CI

d156898f

25 May, 2022 1 commit

Wav2vec2 finetuning shared file system (#17423) · a9eca743

Patrick von Platen authored May 25, 2022



* fix_torch_device_generate_test

* remove @

* [Fix shared file system]
Co-authored-by: Patrick von Platen <patrick@huggingface.co>

a9eca743

24 May, 2022 2 commits

Bump tensorflow in /examples/research_projects/decision_transformer (#17400) · 1ef9a1ed

dependabot[bot] authored May 24, 2022

Bumps [tensorflow](https://github.com/tensorflow/tensorflow) from 2.8.0 to 2.8.1.
- [Release notes](https://github.com/tensorflow/tensorflow/releases)
- [Changelog](https://github.com/tensorflow/tensorflow/blob/master/RELEASE.md)
- [Commits](https://github.com/tensorflow/tensorflow/compare/v2.8.0...v2.8.1

)

---
updated-dependencies:
- dependency-name: tensorflow
  dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

1ef9a1ed

Add LayoutLMv3 (#17060) · 31ee80d5

NielsRogge authored May 24, 2022



* Make forward pass work

* More improvements

* Remove unused imports

* Remove timm dependency

* Improve loss calculation of token classifier

* Fix most tests

* Add docs

* Add model integration test

* Make all tests pass

* Add LayoutLMv3FeatureExtractor

* Improve integration test + make fixup

* Add example script

* Fix style

* Add LayoutLMv3Processor

* Fix style

* Add option to add visual labels

* Make more tokenizer tests pass

* Fix more tests

* Make more tests pass

* Fix bug and improve docs

* Fix import of processors

* Improve docstrings

* Fix toctree and improve docs

* Fix auto tokenizer

* Move tests to model folder

* Move tests to model folder

* change default behavior add_prefix_space

* add prefix space for fast

* add_prefix_spcae set to True for Fast

* no space before `unique_no_split` token

* add test to hightligh special treatment of added tokens

* fix `test_batch_encode_dynamic_overflowing` by building a long enough example

* fix `test_full_tokenizer` with add_prefix_token

* Fix tokenizer integration test

* Make the code more readable

* Add tests for LayoutLMv3Processor

* Fix style

* Add model to README and update init

* Apply suggestions from code review

* Replace asserts by value errors

* Add suggestion by @ducviet00

* Add model to doc tests

* Simplify script

* Improve README

* a step ahead to fix

* Update pair_input_test

* Make all tokenizer tests pass - phew

* Make style

* Add LayoutLMv3 to CI job

* Fix auto mapping

* Fix CI job name

* Make all processor tests pass

* Make tests of LayoutLMv2 and LayoutXLM consistent

* Add copied from statements to fast tokenizer

* Add copied from statements to slow tokenizer

* Remove add_visual_labels attribute

* Fix tests

* Add link to notebooks

* Improve docs of LayoutLMv3Processor

* Fix reference to section
Co-authored-by: SaulLu <lucilesaul.com@gmail.com>
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>

31ee80d5

23 May, 2022 1 commit

Fix CodeParrot training script (#17291) · b48ac1a0

Loubna Ben Allal authored May 23, 2022



* average loss over batches and accumulated steps for tracking

* fix layernorm weight decay

* use AdamW from Pytorch instead of Transformers

* add shuffling of sequences inside the batches

* add shuffling of sequences inside the batches

* add logging dir and reformat code

* fix lr tracking

* remove Mistral scaling

* keep Mistral scaling

* reformat code

* fix error

* fix error

* use shuffling function from Pytorch

* remove argument for shuffling batch sequences as it isn't optional

* update package versions and install accelerate from source

* remove unused package

* Update loss average over accumulated steps
Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com>

* Update loss average over accumulated steps
Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com>

* use one shuffle buffer argument

* compute avg_loss in one line
Co-authored-by: Loubna ben allal <loubnabenallal@gmail.com>
Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com>

b48ac1a0

19 May, 2022 1 commit
- Fix bug in Wav2Vec2 pretrain example (#17326) · 48c22691
  ddobokki authored May 20, 2022
  
  48c22691
18 May, 2022 2 commits

Fix metric calculation in examples and setup tests to run on multi-gpu for... · 1762ded3

Zachary Mueller authored May 18, 2022

Fix metric calculation in examples and setup tests to run on multi-gpu for no_trainer scripts (#17331)

* Fix length in no_trainer examples

* Add setup and teardown

* Use new accelerator config generator to automatically make tests able to run based on environment

1762ded3

Fix style · 47107028
Sylvain Gugger authored May 18, 2022

47107028