Commits · e27707488911a4bae5936a1bdad0cfdb2018cebd · chenpangpang / transformers

28 Jun, 2021 8 commits

Tensorflow LM examples (#12358) · 7e22609e

Matt authored Jun 28, 2021

* Tensorflow MLM example

* Add CLM example

* Style fixes, adding missing checkpoint code from the CLM example

* Fix TPU training, avoid massive dataset warnings

* Fix incorrect training length calculation for multi-GPU training

* Fix incorrect training length calculation for multi-GPU training

* Refactors and nitpicks from the review

* Style pass

* Adding README

7e22609e

[Flax] Adapt flax examples to include `push_to_hub` (#12391) · 2d70c912

Patrick von Platen authored Jun 28, 2021



* fix_torch_device_generate_test

* remove @

* finish

* correct summary writer

* correct push to hub

* fix indent

* finish

* finish

* finish

* finish

* finish
Co-authored-by: Patrick von Platen <patrick@huggingface.co>

2d70c912

Fix copies · 276bc149
Sylvain Gugger authored Jun 28, 2021

276bc149
Update README.md · 27b6ac46
Patrick von Platen authored Jun 28, 2021

27b6ac46

[Flax community event] Add more description to readme (#12398) · 89b57a66

Patrick von Platen authored Jun 28, 2021



* fix_torch_device_generate_test

* remove @

* boom boom

* correct typos

* Apply suggestions from code review
Co-authored-by: Suraj Patil <surajp815@gmail.com>

* Apply suggestions from code review
Co-authored-by: Suzana Ilić <io.suzanai@gmail.com>

* Apply suggestions from code review
Co-authored-by: Suraj Patil <surajp815@gmail.com>
Co-authored-by: Suzana Ilić <io.suzanai@gmail.com>

89b57a66

[Examples] Added context manager to datasets map (#12367) · 04dbea31

Bhadresh Savani authored Jun 28, 2021

* added cotext manager to datasets map

* fixed style and spaces

* fixed warning of deprecation

* changed desc

04dbea31

Add possibility to maintain full copies of files (#12312) · 57461ac0
Sylvain Gugger authored Jun 28, 2021

57461ac0

Update run_mlm.py (#12344) · 9490d668

Taha ValizadehAslani authored Jun 28, 2021

Before the code could not be used for validation only because of this line:
extension = data_args.train_file.split(".")[-1]
was assuming that extension must be extracted from the training dataset. This line would run regardless of the training or validation options of the user. This would lead to an error if the user only wants to run an evaluation only and does not want to do train (because the training file does not exist). I modified it to extract extension from the training file if the user wants to do train and extract it from the validation file if the user wants to run eval. This way the code can be used for both training and validation separately.

9490d668

26 Jun, 2021 1 commit
- replace print with logger (#12368) · ff5cdc08
  Bhadresh Savani authored Jun 26, 2021
  
  ff5cdc08
25 Jun, 2021 6 commits
- [Examples] Replicates the new --log_level feature to all trainer-based pytorch (#12359) · 539ee456
  Bhadresh Savani authored Jun 25, 2021
```
* added log_level

* fix comment

* fixed log_level

* Trigger CI

* Unfied logging

* simplified args for log_level
```
  539ee456
- [trainer] add main_process_first context manager (#12351) · 64e60980
  Stas Bekman authored Jun 25, 2021
```
* main_process_first context manager

* handle multi-node, add context description

* sync desc
```
  64e60980
- remove extra white space from log format (#12360) · 4a872cae
  Stas Bekman authored Jun 25, 2021
  
  4a872cae
- Add FlaxBigBird QuestionAnswering script (#12233) · 332a2458
  Vasudev Gupta authored Jun 25, 2021
```
* port bigbird script

* adapt script a bit

* change location

* adapt more

* save progress

* init commit

* style

* dataset script tested

* readme add
```
  332a2458
- fixed typo (#12356) · d4ce31e8
  michal pitr authored Jun 25, 2021
  
  d4ce31e8
- Update README.md · aa550c4a
  Patrick von Platen authored Jun 25, 2021
  
  aa550c4a
24 Jun, 2021 2 commits
- Add flax/jax quickstart (#12342) · f2c4ce7e
  Marc van Zee authored Jun 24, 2021
  
  f2c4ce7e
- [examples/Flax] move the examples table up (#12341) · aef3823e
  Suraj Patil authored Jun 24, 2021
  
  aef3823e
23 Jun, 2021 4 commits

v4.9.0.dev0 · 2150dfed
Sylvain Gugger authored Jun 23, 2021

2150dfed
Release: v4.8.0 · 9252a512
Sylvain Gugger authored Jun 23, 2021

9252a512
[Flax/JAX] Add how to propose projects markdown (#12311) · 44739c81
Patrick von Platen authored Jun 23, 2021
```
* fix_torch_device_generate_test

* remove @

* finish

* make style
```
44739c81

Flax summarization script (#12230) · c0fe3c9a

Suraj Patil authored Jun 23, 2021

* add summrization script

* fix arguments, preprocessing, metrics

* add generation and metrics

* auto model, prediction loop

* prettify

* label smoothing

* adress Sylvain and Patricks suggestions

* dynamically import shift_tokens_right

* fix shift_tokens_right_fn call

c0fe3c9a

22 Jun, 2021 3 commits

[trainer] 2 bug fixes and a rename (#12309) · ebe54135
Stas Bekman authored Jun 22, 2021
```
* bug fixes and a rename

* add extended DDP test
```
ebe54135

[Flax] Main doc for event orga (#12305) · 64029abe

Patrick von Platen authored Jun 22, 2021

* fix_torch_device_generate_test

* remove @

* push

* finish

* some typos

* add more info on communication

* add suggestions

64029abe

[trainer + examples] set log level from CLI (#12276) · dad414d5

Stas Bekman authored Jun 21, 2021



* set log level from CLI

* add log_level_replica + test + extended docs

* cleanup

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* rename datasets objects to allow datasets module

* improve the doc

* style

* doc improve
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

dad414d5

21 Jun, 2021 2 commits

Tensorflow QA example (#12252) · e3cb7a0b

Matt authored Jun 21, 2021



* New Tensorflow QA example!

* Style pass

* Updating README.md for the new example

* flake8 fixes

* Update examples/tensorflow/question-answering/README.md
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

e3cb7a0b

Fix for making student ProphetNet for Seq2Seq Distillation (#12130) · b53bc55b
Vishal Burman authored Jun 21, 2021
```
* make_student.py: fix to make student ProphetNet

* reformat
```
b53bc55b

17 Jun, 2021 3 commits
- update desc for map in all examples (#12226) · e43e1126
  Bhavitvya Malik authored Jun 18, 2021
```
* update desc for map in all examples

* added plm

* suggestions
```
  e43e1126
- Docs for v4.8.0 · 0daadc19
  Lysandre authored Jun 17, 2021
  
  0daadc19
- Release: v4.7.0 · 7a6c9fab
  Lysandre authored Jun 17, 2021
  
  7a6c9fab
15 Jun, 2021 3 commits

Model card defaults (#12122) · 7d7ceca3

Sylvain Gugger authored Jun 15, 2021



* [WIP] Model card defaults

* finetuned_from default value

* Add all mappings to the mapping file

* Be more defensive on finetuned_from arg

* Add default task tag

* Separate tags from tasks

* Edge case for dataset

* Apply suggestions from code review
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

7d7ceca3

Enable add_prefix_space if model_type is roberta or gpt2 (#12116) · 955b2b97
kumapo authored Jun 15, 2021

955b2b97
Use a released version of optax rather than installing from Git. (#12173) · 9b393240
Avital Oliver authored Jun 15, 2021
```
Use a released version of optax rather than installing from Git
```
9b393240

14 Jun, 2021 7 commits

[style] consistent nn. and nn.functional: part 4 `examples` (#12156) · 88e84186
Stas Bekman authored Jun 14, 2021
```
* consistent nn. and nn.functional: p4 examples

* restore
```
88e84186

[lm examples] Replicate --config_overrides addition to other LM examples (#12135) · 9de62cfb

Kumar Abhishek authored Jun 14, 2021



* [lm examples] Replicate --config_overrides addition to other LM examples

* Removing no trainer files changes

* Update README
Co-authored-by: Kumar Abhishek <kabhishek@expedia.com>

9de62cfb

Use text_column_name variable instead of "text" (#12132) · cd7961b6

Nicholas Broad authored Jun 14, 2021



* Use text_column_name variable instead of "text"

`text_column_name` was already defined above where I made the changes and it was also used below where I made changes.

This is a very minor change. If a dataset does not use "text" as the column name, then the `tokenize_function` will now use whatever column is assigned to `text_column_name`. `text_column_name` is just the first column name if "text" is not a column name. It makes the function a little more robust, though I would assume that 90% + of datasets use "text" anyway.

* black formatting

* make style
Co-authored-by: Nicholas Broad <nicholas@nmbroad.com>

cd7961b6

Don't log anything before logging is setup in examples (#12121) · b8ab5413
Sylvain Gugger authored Jun 14, 2021
```
* Don't log anything before logging is setup in examples

* Last example
```
b8ab5413
[Flax] Add links to google colabs (#12146) · 7566fefa
Patrick von Platen authored Jun 14, 2021
```
* fix_torch_device_generate_test

* remove @

* add colab links
```
7566fefa

add readme for flax clm (#12111) · d36fce82

Suraj Patil authored Jun 14, 2021



* add readme for flax clm

* use section link for tokenizer

* Apply suggestions from code review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* update metrics
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

d36fce82

Add mlm pretraining xla torch readme (#12011) · 16c0efca

Patrick von Platen authored Jun 14, 2021



* fix_torch_device_generate_test

* remove @

* upload

* Apply suggestions from code review

* Apply suggestions from code review

* Apply suggestions from code review

* Update examples/flax/language-modeling/README.md

* add more info

* finish

* fix
Co-authored-by: Patrick von Platen <patrick@huggingface.co>

16c0efca

11 Jun, 2021 1 commit

Flax CLM script (#12023) · 15b498f3

Suraj Patil authored Jun 11, 2021

* first draft

* max_seq_length => block_size

* fix arg names

* fix typos

* fix loss calculation

* add max examples, fix  train eval steps, metrics

* optimizer mask

* fix perpelexity, metric logging

* fix logging

* data_collator = > data_loader

* refactor loss_fn

* support single GPU

* pass distributed to write_metric

* fix jitting

* fix single device training

* fix single device metrics

* close inner progress bars once finished

* add overwrite_cache arg

* ifx dataset caching issue

* add more logs

* few small fixes,

* address nicholas suggestions

* fix docstr

* address patricks suggestions

* make flake happy

* pass new new_dropout_rng to apply_gradients

* reset train metrics after every epoc

* remove distributed logis, small fixes

15b498f3