Commits · 9129fd0377e4d46cb2d0ea28dc1eb91a15f65b77 · chenpangpang / transformers

06 Aug, 2022 2 commits

`transformers-cli login` => `huggingface-cli login` (#18490) · 9129fd03

Julien Chaumond authored Aug 06, 2022

* zero chance anyone's using that constant no?

* `transformers-cli login` => `huggingface-cli login`

* `transformers-cli repo create` => `huggingface-cli repo create`

* `make style`

9129fd03

Just re-reading the whole doc every couple of months

😬

(#18489) · 8d1f9039

Julien Chaumond authored Aug 06, 2022

* Delete valohai.yaml

* NLP => ML

* typo

* website supports https

* datasets

* 60k + modalities

* unrelated link fixing for accelerate

* Ok those links were actually broken

* Fix link

* Make `AutoTokenizer` auto-link

* wording tweak

* add at least one non-nlp task

8d1f9039

04 Aug, 2022 2 commits
- Update no trainer examples for QA and Semantic Segmentation (#18474) · 0bf1e1ac
  Kian Sierra McGettigan authored Aug 04, 2022
```
* swag_no_trainer updated for with gather_metrics

* Removed unused variable samples_seen

* updated examples with gather_for_metrics
```
  0bf1e1ac
- Update no trainer scripts for multiple-choice (#18468) · 330247ed
  Kian Sierra McGettigan authored Aug 04, 2022
```
* swag_no_trainer updated for with gather_metrics

* Removed unused variable samples_seen
```
  330247ed
03 Aug, 2022 2 commits

Fix torch version comparisons (#18460) · 02b176c4

LSinev authored Aug 03, 2022

Comparisons like
version.parse(torch.__version__) > version.parse("1.6")
are True for torch==1.6.0+cu101 or torch==1.6.0+cpu

version.parse(version.parse(torch.__version__).base_version) are preferred (and available in pytorch_utils.py

02b176c4

Update no trainer scripts for language modeling and image classification examples (#18443) · 3db4378b

Ritik Nandwal authored Aug 03, 2022

* Update no_trainer script for image-classification

* Update no_trainer scripts for language-modeling examples

* Remove unused variable

* Removing truncation from losses array for language modeling examples

3db4378b

02 Aug, 2022 1 commit
- fix run_clip README (#18332) · 5546fb61
  Yih-Dar authored Aug 02, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  5546fb61
01 Aug, 2022 6 commits

Add Flax BART pretraining script (#18297) · 3909d7f1

Duong A. Nguyen authored Aug 01, 2022



* add bart pretraining flax script

* fixup

* add bart pretraining flax script

* add BART to README

* add BART to README

* add BART to README

* add BART to README

* add BART to README

* add bos eos document

* Update README.md

* Update README.md

* Update examples/flax/language-modeling/run_bart_dlm_flax.py
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

* final

* final

* final

* remove use_auth_token ing from_config
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

3909d7f1

Fix ROUGE add example check and update README (#18398) · 941d2331
Sylvain Gugger authored Aug 01, 2022
```
* Fix ROUGE add example check and update README

* Stay consistent in values
```
941d2331
Correct the spelling of bleu metric (#18375) · 679d68a1
Ogundepo Odunayo authored Aug 01, 2022

679d68a1
Migrate metric to Evaluate in Pytorch examples (#18369) · 1f843991
atturaioe authored Aug 01, 2022
```
* Migrate metric to Evaluate in pytorch examples

* Remove unused imports
```
1f843991

Bump mistune from 0.8.4 to 2.0.3 in /examples/research_projects/lxmert (#18370) · 25ec12ea

dependabot[bot] authored Aug 01, 2022

Bumps [mistune](https://github.com/lepture/mistune) from 0.8.4 to 2.0.3.
- [Release notes](https://github.com/lepture/mistune/releases)
- [Changelog](https://github.com/lepture/mistune/blob/master/docs/changes.rst)
- [Commits](https://github.com/lepture/mistune/compare/v0.8.4...v2.0.3

)

---
updated-dependencies:
- dependency-name: mistune
  dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

25ec12ea

Bump mistune in /examples/research_projects/visual_bert (#18371) · a7360385

dependabot[bot] authored Aug 01, 2022

Bumps [mistune](https://github.com/lepture/mistune) from 0.8.4 to 2.0.3.
- [Release notes](https://github.com/lepture/mistune/releases)
- [Changelog](https://github.com/lepture/mistune/blob/master/docs/changes.rst)
- [Commits](https://github.com/lepture/mistune/compare/v0.8.4...v2.0.3

)

---
updated-dependencies:
- dependency-name: mistune
  dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

a7360385

29 Jul, 2022 1 commit

Replace `as_target` context managers by direct calls (#18325) · 986526a0

Sylvain Gugger authored Jul 29, 2022



* Preliminary work on tokenizers

* Quality + fix tests

* Treat processors

* Fix pad

* Remove all uses of  in tests, docs and examples

* Replace all as_target_tokenizer

* Fix tests

* Fix quality

* Update examples/flax/image-captioning/run_image_captioning_flax.py
Co-authored-by: amyeroberts <amy@huggingface.co>

* Style
Co-authored-by: amyeroberts <amy@huggingface.co>

986526a0

28 Jul, 2022 3 commits

Migrate metrics used in flax examples to Evaluate (#18348) · da503ea0

Vijay S Kalmath authored Jul 28, 2022

Currently, tensorflow examples use the `load_metric` function from
Datasets library, commit migrates function call to `load` function
from Evaluate library.

da503ea0

Migrate metric to Evaluate library for tensorflow examples (#18327) · a2586795

Vijay S Kalmath authored Jul 28, 2022

* Migrate metric to Evaluate library in tf examples

Currently tensorflow examples use `load_metric` function from Datasets
library , commit migrates function call to `load` function to
Evaluate library.

Fix for #18306

* Migrate metric to Evaluate library in tf examples

Currently tensorflow examples use `load_metric` function from Datasets
library , commit migrates function call to `load` function to
Evaluate library.

Fix for #18306

* Migrate `metric` to Evaluate for all tf examples

Currently tensorflow examples use `load_metric` function from Datasets
library , commit migrates function call to `load` function to
Evaluate library.

a2586795

Fix codeparrot deduplication - ignore whitespaces (#18023) · 286a18fa
Loubna Ben Allal authored Jul 28, 2022
```
* ignore whitspaces for hash

* reformat code

* Update README.md
```
286a18fa

27 Jul, 2022 5 commits

Dev version · c89a592e
Lysandre authored Jul 27, 2022

c89a592e

[Flax] Fix incomplete batches in example scripts (#17863) · 7490a97c

Sanchit Gandhi authored Jul 27, 2022

* [Flax] Fix incomplete batches in example scripts

* fix dataloader batching

* convert jnp batch idxs to np array

* add missing `pad_shard_unpad` to final prediction generate step

* only `pad_shard_unpad` at inference time

* merge conflicts

* remove incomplete batch step from eval

* fix run_qa.py

* add `pad_shard_unpad` to run_flax_ner.py

* add `pad_shard_unpad` to run_flax_glue.py

* add `pad_shard_unpad` to run_image_classification.py

* make style

* fix mlm flax eval batches

* remove redundant imports

7490a97c

Remove all uses of six (#18318) · cf32b2ee
Sylvain Gugger authored Jul 27, 2022
```
* Remove all uses of six

* fix quality
```
cf32b2ee
Generalize decay_mask_fn to apply mask to all LayerNorm params (#18273) · 170fcaa6
Duong A. Nguyen authored Jul 27, 2022
```
* generalize decay_mask_fn to find all layernorm params

* fixup

* generalising decay_mask_fn
```
170fcaa6

Update CodeParrot readme to include training in Megatron (#17798) · 1d71ad89

Loubna Ben Allal authored Jul 27, 2022



* add info about megatron training

* upload models and datasets from CodeParrot organization

* upload models and datasets from CodeParrot organization

* Update examples/research_projects/codeparrot/README.md
Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com>

* Update examples/research_projects/codeparrot/README.md
Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com>

* Update examples/research_projects/codeparrot/README.md
Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com>

* Update examples/research_projects/codeparrot/README.md
Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com>

* Update examples/research_projects/codeparrot/README.md
Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com>

* fix typo and add comment about codeparrot vs megatron
Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com>

1d71ad89

21 Jul, 2022 1 commit
- Fix `no_trainer` CI (#18242) · 99eb9b52
  Zachary Mueller authored Jul 21, 2022
```
* Fix all tests
```
  99eb9b52
19 Jul, 2022 1 commit
- Remove use_auth_token from the from_config method (#18192) · 4bea6584
  Duong A. Nguyen authored Jul 19, 2022
```
* remove use_auth_token from from_config

* restore use_auth_token from_pretrained run_t5_mlm_flax
```
  4bea6584
18 Jul, 2022 2 commits
- Fix incorrect type hint for lang (#18161) · a4f97e6c
  John Giorgi authored Jul 18, 2022
  
  a4f97e6c
- Fix check for falsey inputs in run_summarization (#18155) · c46d39f3
  John Giorgi authored Jul 18, 2022
  
  c46d39f3
13 Jul, 2022 1 commit
- Add summarization name mapping for MultiNews (#18117) · fde22c75
  John Giorgi authored Jul 13, 2022
```
* Add summarization name mapping for MultiNews

* Add summarization name mapping for MultiNews
```
  fde22c75
11 Jul, 2022 2 commits

Fix RESOURCE_EXHAUSTED error when dealing with large datasets in Flax example scripts (#18069) · 1e8140ca

Duong A. Nguyen authored Jul 11, 2022

* Fix RESOURCE_EXHAUSTED error for large datasets on Flax example scripts

* using np.permutation for creating batch_idx

* train_samples_idx -> training_samples_idx

* fix type hints

1e8140ca

Fix some typos. (#17560) · 95113d13

Yulv-git authored Jul 11, 2022



* Fix some typos.
Signed-off-by: Yulv-git <yulvchi@qq.com>

* Fix typo.
Signed-off-by: Yulv-git <yulvchi@qq.com>

* make fixup.

95113d13

06 Jul, 2022 1 commit
- Fix T5 incorrect weight decay in Trainer and official summarization example (#18002) · bf37e5c7
  ADAning authored Jul 06, 2022
```
* Add ALL_LAYERNORM_LAYERS for LayerNorm

* fix bug of appending layer norm
```
  bf37e5c7
29 Jun, 2022 1 commit
- Fix all is_torch_tpu_available issues (#17936) · 7c4c6f60
  Zachary Mueller authored Jun 29, 2022
```
* Fix all is_torch_tpu_available 
```
  7c4c6f60
28 Jun, 2022 1 commit
- Pin PyTorch in requirements as well · 5f1e67a5
  Sylvain Gugger authored Jun 28, 2022
  
  5f1e67a5
23 Jun, 2022 2 commits
- Properly calculate the total train iterations and recalculate num epochs in... · 75259b44
  Zachary Mueller authored Jun 23, 2022
```
Properly calculate the total train iterations and recalculate num epochs in no_trainer scripts (#17856)
```
  75259b44
- Change no trainer image_classification test (#17635) · acb709d5
  Zachary Mueller authored Jun 23, 2022
```
* Adjust test arguments and use a new example test
```
  acb709d5
22 Jun, 2022 3 commits

Bump numpy from 1.21.0 to 1.22.0 in /examples/research_projects/lxmert (#17817) · c366ce10

dependabot[bot] authored Jun 22, 2022

Bumps [numpy](https://github.com/numpy/numpy) from 1.21.0 to 1.22.0.
- [Release notes](https://github.com/numpy/numpy/releases)
- [Changelog](https://github.com/numpy/numpy/blob/main/doc/HOWTO_RELEASE.rst)
- [Commits](https://github.com/numpy/numpy/compare/v1.21.0...v1.22.0

)

---
updated-dependencies:
- dependency-name: numpy
  dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

c366ce10

Bump numpy in /examples/research_projects/visual_bert (#17816) · af0d21e7

dependabot[bot] authored Jun 22, 2022

Bumps [numpy](https://github.com/numpy/numpy) from 1.21.0 to 1.22.0.
- [Release notes](https://github.com/numpy/numpy/releases)
- [Changelog](https://github.com/numpy/numpy/blob/main/doc/HOWTO_RELEASE.rst)
- [Commits](https://github.com/numpy/numpy/compare/v1.21.0...v1.22.0

)

---
updated-dependencies:
- dependency-name: numpy
  dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

af0d21e7

Add logits_processor parameter, used by `generate`, to `Seq2SeqTrainer`... · 13570381

Eran Hirsch authored Jun 22, 2022

Add logits_processor parameter, used by `generate`, to `Seq2SeqTrainer` methods `evaluate` and `predict` (#17805)

* Add logits_processor parameter, used by `generate`, to `Seq2SeqTrainer` methods `evaluate` and `predict`

* Add all generate parameters to `Seq2SeqTrainer`, and also to `QuestionAnsweringSeq2SeqTrainer` which overrides it

* Remove `self._num_beams` from trainer classes

* - Run fixup
- Fix "Constraint" not exposed
- Fix synced_gpus to actually read from param

* Use kwargs

* Copy kwargs before making changes to it

* Fix style issues unused imports

13570381

21 Jun, 2022 1 commit

[CodeParrot] Near-deduplication with jaccard similarity (#17054) · da2bd2ae

Jia LI authored Jun 21, 2022



* deduplication draft

* update style

* update style test

* dummy test main

* rename modules

* rename functions

* return extremes in deduplicate_clusters

* update style

* cast str for gzip

* update doc string

* time processing

* use dataset map to compute minhash

* fill value for short token

* remove da map method

* update style

* use share object to multiprocess

* update style

* use f-string and minor fix
Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com>
Co-authored-by: Loubna Ben Allal <44069155+loubnabnl@users.noreply.github.com>

* update style

* use module parameters

* change ds_dedup to ds_filter

* save ds_dedup

* mv test to script tests

* make jaccard threshold a parameter of deduplicate_dataset

* update style

* add doc strings

* update style

* add doc string for DuplicationIndex

* save files into data dir

* update readme

* Update examples/research_projects/codeparrot/README.md
Co-authored-by: Loubna Ben Allal <44069155+loubnabnl@users.noreply.github.com>

* make near deduplication optional

* move near deduplication in README

* Update examples/research_projects/codeparrot/README.md
Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com>

* use f string
Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com>
Co-authored-by: Loubna Ben Allal <44069155+loubnabnl@users.noreply.github.com>

da2bd2ae

17 Jun, 2022 2 commits

Bump notebook in /examples/research_projects/lxmert (#17743) · e44a569f

dependabot[bot] authored Jun 17, 2022

Bumps [notebook](http://jupyter.org

) from 6.4.10 to 6.4.12.

---
updated-dependencies:
- dependency-name: notebook
  dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

e44a569f

Bump notebook in /examples/research_projects/visual_bert (#17742) · 5089a2d4

dependabot[bot] authored Jun 17, 2022

Bumps [notebook](http://jupyter.org

) from 6.4.10 to 6.4.12.

---
updated-dependencies:
- dependency-name: notebook
  dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

5089a2d4