Commits · 9fa299599328b1dabb60d3e9791a35e5f02fbd65 · chenpangpang / transformers

13 Apr, 2021 1 commit
- added cache_dir=model_args.cache_dir to all example with cache_dir arg (#11220) · 9fa29959
  Philipp Schmid authored Apr 13, 2021
  
  9fa29959
09 Apr, 2021 1 commit
- [examples run_clm] fix _LazyModule hasher error (#11168) · 07f0bb69
  Stas Bekman authored Apr 09, 2021
```
* fix _LazyModule hasher error

* reword
```
  07f0bb69
08 Apr, 2021 2 commits

Run mlm pad to multiple for fp16 (#11128) · 6c40e497
Andrea Cappelli authored Apr 08, 2021
```
* Add mlm collator pad to multiple option (#10627)

* Use padding to 8x in run mlm (#10627)
```
6c40e497

[run_clm] clarify why we get the tokenizer warning on long input (#11145) · acc851e1

Stas Bekman authored Apr 08, 2021



* clarify why we get the warning here

* Update examples/language-modeling/run_clm.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* wording

* style
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

acc851e1

07 Apr, 2021 2 commits
- [examples] fix white space (#11099) · 424419f5
  Stas Bekman authored Apr 07, 2021
```
these get concatenated without whitespace, so fix it
```
  424419f5
- fix: The 'warn' method is deprecated (#11105) · c9035e45
  Stas Bekman authored Apr 07, 2021
```
* The 'warn' method is deprecated

* fix test
```
  c9035e45
06 Apr, 2021 3 commits
- Development on v4.6.0dev0 · 9853c5dd
  Lysandre authored Apr 06, 2021
  
  9853c5dd
- Release v4.5.0 · 4906a29f
  Lysandre authored Apr 06, 2021
  
  4906a29f
- Add Readme for language modeling scripts with accelerate (#11073) · 6ab7d1a4
  Hemil Desai authored Apr 06, 2021
  
  6ab7d1a4
05 Apr, 2021 1 commit

Add `examples/language_modeling/run_clm_no_trainer.py` (#11026) · b51b87c4

Hemil Desai authored Apr 05, 2021



* Initial draft for clm no trainer

* Remove unwanted args

* Fix bug

* Update examples/language-modeling/run_clm_no_trainer.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

b51b87c4

31 Mar, 2021 2 commits

Add `examples/language_modeling/run_mlm_no_trainer.py` (#11001) · 838f83d8

Hemil Desai authored Apr 01, 2021



* Add initial script for finetuning MLM models with accelerate

* Add evaluation metric calculation

* Fix bugs

* Use no_grad on evaluation

* update script docstring

* Update examples/language-modeling/run_mlm_no_trainer.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* PR feedback

* Fix CI failure

* Update examples/language-modeling/run_mlm_no_trainer.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

838f83d8

Enforce string-formatting with f-strings (#10980) · acc3bd9d

Sylvain Gugger authored Mar 31, 2021



* First third

* Styling and fix mistake

* Quality

* All the rest

* Treat %s and %d

* typo

* Missing )

* Apply suggestions from code review
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

acc3bd9d

19 Mar, 2021 1 commit

Expand a bit the presentation of examples (#10799) · 946400fb

Sylvain Gugger authored Mar 19, 2021



* Expand a bit the presentation of examples

* Apply suggestions from code review
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* Address review comments
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

946400fb

16 Mar, 2021 2 commits
- Development on v4.5.0dev0 · 1b5ce1e6
  Lysandre authored Mar 16, 2021
  
  1b5ce1e6
- Release v4.4.0 · c988db5a
  Lysandre authored Mar 16, 2021
  
  c988db5a
15 Mar, 2021 1 commit

Add minimum version check in examples (#10724) · 4c379daf

Sylvain Gugger authored Mar 15, 2021

* Add minimum version check in examples

* Style

* No need for new line maybe?

* Add helpful comment

4c379daf

08 Mar, 2021 1 commit

Added max_sample_ arguments (#10551) · dfd16af8

Bhadresh Savani authored Mar 09, 2021

* reverted changes of logging and saving metrics

* added max_sample arguments

* fixed code

* white space diff

* reformetting code

* reformatted code

dfd16af8

27 Feb, 2021 1 commit
- updated logging and saving metrics (#10436) · aca6288f
  Bhadresh Savani authored Feb 27, 2021
```
* updated logging and saving metrics

* space removal
```
  aca6288f
08 Feb, 2021 1 commit
- Truncate max length if needed in all examples (#10034) · b01483fa
  Sylvain Gugger authored Feb 08, 2021
  
  b01483fa
05 Feb, 2021 1 commit
- [examples] make run scripts executable (#10037) · 8ea412a8
  Stas Bekman authored Feb 05, 2021
```
* make executable

* make executable

* same for the template

* cleanup
```
  8ea412a8
03 Feb, 2021 1 commit
- [run_clm.py] fix getting extention · bca0dd5e
  Suraj Patil authored Feb 03, 2021
  
  bca0dd5e
01 Feb, 2021 1 commit

Fit chinese wwm to new datasets (#9887) · 1682804e

wlhgtc authored Feb 01, 2021



* MOD: fit chinese wwm to new datasets

* MOD: move wwm to new folder

* MOD: formate code

* Styling

* MOD add param and recover trainer
Co-authored-by: Sylvain Gugger <sylvain.gugger@gmail.com>

1682804e

28 Jan, 2021 1 commit
- Deprecate model_path in Trainer.train (#9854) · b4e559cf
  Sylvain Gugger authored Jan 28, 2021
  
  b4e559cf
27 Jan, 2021 1 commit
- Setup logging with a stdout handler (#9816) · f2fabedb
  Sylvain Gugger authored Jan 27, 2021
  
  f2fabedb
25 Jan, 2021 1 commit

Auto-resume training from checkpoint (#9776) · caf4abf7

Sylvain Gugger authored Jan 25, 2021



* Auto-resume training from checkpoint

* Update examples/text-classification/run_glue.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Roll out to other examples
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

caf4abf7

20 Jan, 2021 1 commit
- Restrain tokenizer.model_max_length default (#9681) · a1ad16a4
  Sylvain Gugger authored Jan 20, 2021
```
* Restrain tokenizer.model_max_length default

* Fix indent
```
  a1ad16a4
06 Jan, 2021 1 commit
- Allow example to use a revision and work with private models (#9407) · 453a70d4
  Sylvain Gugger authored Jan 06, 2021
```
* Allow example to use a revision and work with private models

* Copy to other examples and template

* Styling
```
  453a70d4
22 Dec, 2020 2 commits
- Add speed metrics to all example scripts + template (#9260) · ab177588
  Sylvain Gugger authored Dec 22, 2020
  
  ab177588
- Fix link to old language modeling script (#9254) · 189c1b91
  Manuel Romero authored Dec 22, 2020
  
  189c1b91
16 Dec, 2020 1 commit

[Flax] Align FlaxBertForMaskedLM with BertForMaskedLM, implement from_pretrained, init (#9054) · 640e6fe1

Patrick von Platen authored Dec 16, 2020



* save intermediate

* save intermediate

* save intermediate

* correct flax bert model file

* new module / model naming

* make style

* almost finish BERT

* finish roberta

* make fix-copies

* delete keys file

* last refactor

* fixes in run_mlm_flax.py

* remove pooled from run_mlm_flax.py`

* fix gelu | gelu_new

* remove Module from inits

* splits

* dirty print

* preventing warmup_steps == 0

* smaller splits

* make fix-copies

* dirty print

* dirty print

* initial_evaluation argument

* declaration order fix

* proper model initialization/loading

* proper initialization

* run_mlm_flax improvements: improper model inputs bugfix + automatic dataset splitting + tokenizers parallelism warning + avoiding warmup_steps=0 bug

* removed tokenizers warning hack, fixed model re-initialization

* reverted training_args.py changes

* fix flax from pretrained

* improve test in flax

* apply sylvains tips

* update init

* make 0.3.0 compatible

* revert tevens changes

* revert tevens changes 2

* finalize revert

* fix bug

* add docs

* add pretrained to init

* Update src/transformers/modeling_flax_utils.py

* fix copies

* final improvements
Co-authored-by: TevenLeScao <teven.lescao@gmail.com>

640e6fe1

15 Dec, 2020 1 commit

[Examples] Add automatic dataset splitting in language-modeling examples (#9133) · 2a7e8e16

Teven authored Dec 15, 2020

* replaced jnp.split + removing textual model inputs + ensuring warmup_steps > 0

* Add automatic dataset splitting in language-modeling examples

2a7e8e16

11 Dec, 2020 1 commit

Reorganize examples (#9010) · 783d7d26

Sylvain Gugger authored Dec 11, 2020



* Reorganize example folder

* Continue reorganization

* Change requirements for tests

* Final cleanup

* Finish regroup with tests all passing

* Copyright

* Requirements and readme

* Make a full link for the documentation

* Address review comments

* Apply suggestions from code review
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Add symlink

* Reorg again

* Apply suggestions from code review
Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com>

* Adapt title

* Update to new strucutre

* Remove test

* Update READMEs
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com>

783d7d26

10 Dec, 2020 1 commit

Fix typo #9012 (#1) (#9038) · 91ab02af

NatLun137 authored Dec 10, 2020

There is a tiny typo in the code "transformers/examples/language-modeling/run_mlm_wwm.py" at line 284. [Details.](https://github.com/huggingface/transformers/issues/9012)

91ab02af

09 Dec, 2020 1 commit

Flax Masked Language Modeling training example (#8728) · 75627148

Funtowicz Morgan authored Dec 09, 2020



* Remove "Model" suffix from Flax models to look more :hugs:
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

* Initial working (forward + backward) for Flax MLM training example.
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

* Simply code
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

* Addressing comments, using module and moving to LM task.
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

* Restore parameter name "module" wrongly renamed model.
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

* Restore correct output ordering...
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

* Actually commit the example 😅

Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

* Add FlaxBertModelForMaskedLM after rebasing.
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>

* Make it possible to initialize the training from scratch
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>

* Reuse flax linen example of cross entropy loss
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>

* Added specific data collator for flax
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>

* Remove todo for data collator
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>

* Added evaluation step
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>

* Added ability to provide dtype to support bfloat16 on TPU
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>

* Enable flax tensorboard output
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>

* Enable jax.pmap support.
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>

* Ensure batches are correctly sized to be dispatched with jax.pmap
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>

* Enable bfloat16 with --fp16 cmdline args
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>

* Correctly export metrics to tensorboard
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>

* Added dropout and ability to use it.
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>

* Effectively enable & disable during training and evaluation steps.
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>

* Oops.
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>

* Enable specifying kernel initializer scale
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>

* Style.
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>

* Added warmup step to the learning rate scheduler.
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>

* Fix typo.
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>

* Print training loss
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>

* Make style
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>

* fix linter issue (flake8)
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>

* Fix model matching
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>

* Fix dummies
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>

* Fix non default dtype on Flax models
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>

* Use the same create_position_ids_from_input_ids for FlaxRoberta
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>

* Make Roberta attention as Bert
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>

* fix copy
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>

* Wording.
Co-authored-by: Marc van Zee <marcvanzee@gmail.com>
Co-authored-by: Marc van Zee <marcvanzee@gmail.com>

75627148

07 Dec, 2020 1 commit
- Small fix to the run clm script (#8973) · 62d30e05
  Sylvain Gugger authored Dec 07, 2020
  
  62d30e05
23 Nov, 2020 1 commit
- Fix max length in run_plm script (#8738) · 367f497d
  Sylvain Gugger authored Nov 23, 2020
  
  367f497d
19 Nov, 2020 1 commit

fix small typo (#8644) · a79a96dd

Matthias authored Nov 19, 2020

Fixed a small typo on the XLNet and permutation language modelling section

a79a96dd

18 Nov, 2020 2 commits
- Update README.md (#8635) · 28d16e7a
  Tim Isbister authored Nov 19, 2020
  
  28d16e7a
- Fix training from scratch in new scripts (#8623) · a0c62d24
  Sylvain Gugger authored Nov 18, 2020
  
  a0c62d24
17 Nov, 2020 1 commit

Tokenizers: ability to load from model subfolder (#8586) · 042a6aa7

Julien Chaumond authored Nov 17, 2020



* <small>tiny typo</small>

* Tokenizers: ability to load from model subfolder

* use subfolder for local files as well

* Uniformize model shortcut name => model id

* from s3 => from huggingface.co
Co-authored-by: Quentin Lhoest <lhoest.q@gmail.com>

042a6aa7