Commits · b9a768b3ffa80c4c19d024f9f42d5917e7d8109e · chenpangpang / transformers

30 Mar, 2022 1 commit
- [examples] max samples can't be bigger than the len of dataset (#16501) · a73281e3
  Stas Bekman authored Mar 30, 2022
```
* [examples] max samples can't be bigger than then len of dataset

* do tf and flax
```
  a73281e3
23 Mar, 2022 1 commit

Reorganize file utils (#16264) · 4975002d

Sylvain Gugger authored Mar 23, 2022

* Split file_utils in several submodules

* Fixes

* Add back more objects

* More fixes

* Who exactly decided to import that from there?

* Second suggestion to code with code review

* Revert wront move

* Fix imports

* Adapt all imports

* Adapt all imports everywhere

* Revert this import, will fix in a separate commit

4975002d

14 Dec, 2021 1 commit
- Make data shuffling in `run_clm_flax.py` respect global seed (#13410) · 2a606f99
  Benjamin Minixhofer authored Dec 14, 2021
```
* use jax and jnp instead of numpy in data_loader

* return batches as np.ndarray
```
  2a606f99
12 Dec, 2021 1 commit
- [Flax examples] remove dependancy on pytorch training args (#14636) · 6a025487
  Suraj Patil authored Dec 12, 2021
```
* use custom training arguments

* update tests
```
  6a025487
06 Dec, 2021 2 commits

[urls to hub] Replace outdated model tags with their now-canonical pipeline types (#14617) · 6cdc3a78
Julien Chaumond authored Dec 06, 2021
```
* Replace outdated model tags with their now-canonical pipeline types

* spam the CI till it's green
```
6cdc3a78

Add Flax example tests (#14599) · c5bd732a

Suraj Patil authored Dec 06, 2021

* add test for glue

* add tests for clm

* fix clm test

* add summrization tests

* more tests

* fix few tests

* add test for t5 mlm

* fix t5 mlm test

* fix tests for multi device

* cleanup

* ci job

* fix metric file name

* make t5 more robust

c5bd732a

22 Nov, 2021 1 commit

Switch from using sum for flattening lists of lists in group_texts (#14472) · 69e16abf

Nicholas Broad authored Nov 22, 2021



* remove sum for list flattening

* change to chain(*)

* make chain object a list

* delete empty lines

per sgugger's suggestions
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Nicholas Broad <nicholas@nmbroad.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

69e16abf

30 Sep, 2021 1 commit

[examples/flax] use Repository API for push_to_hub (#13672) · 7db2a79b

Suraj Patil authored Sep 30, 2021

* use Repository for push_to_hub

* update readme

* update other flax scripts

* update readme

* update qa example

* fix push_to_hub call

* fix typo

* fix more typos

* update readme

* use abosolute path to get repo name

* fix glue script

7db2a79b

28 Aug, 2021 1 commit

examples: only use keep_linebreaks when reading TXT files (#13320) · 4046e66e

Stefan Schweter authored Aug 28, 2021

* examples: only use keep_linebreaks when reading TXT files for all CLM examples

* examples: only use keep_linebreaks when reading TXT files for all CLM examples

* examples: only use keep_linebreaks when reading TXT files for all CLM examples

4046e66e

27 Aug, 2021 1 commit

examples: add keep_linebreaks option to CLM examples (#13150) · 319d840b

Stefan Schweter authored Aug 27, 2021

* examples: add keep_linebreaks option to text dataset loader for all CLM examples

* examples: introduce new keep_linebreaks option as data argument in CLM examples

319d840b

09 Aug, 2021 1 commit

[Flax] Refactor gpt2 & bert example docs (#13024) · 13a9c9a3

Patrick von Platen authored Aug 09, 2021



* fix_torch_device_generate_test

* remove @

* improve docs for clm

* speed-ups

* correct t5 example as well

* push final touches

* Update examples/flax/language-modeling/README.md

* correct docs for mlm

* Update examples/flax/language-modeling/README.md
Co-authored-by: Patrick von Platen <patrick@huggingface.co>

13a9c9a3

27 Jul, 2021 1 commit

[FLAX] Minor fixes in CLM example (#12914) · d3c3e722

Stefan Schweter authored Jul 27, 2021

* readme: fix retrieval of vocab size for flax clm example

* examples: fix flax clm example when using training/evaluation files

d3c3e722

20 Jul, 2021 1 commit

Flax MLM: Allow validation split when loading dataset from local file (#12689) · 66197adc

fgaim authored Jul 20, 2021

* Allow validation split when loading dataset from local file

* Flax clm & t5, enable validation split for datasets loaded from local file

66197adc

09 Jul, 2021 1 commit
- [Flax] Fix cur step flax examples (#12608) · deecdd49
  Patrick von Platen authored Jul 09, 2021
```
* fix_torch_device_generate_test

* remove @

* fix save problem
```
  deecdd49
08 Jul, 2021 1 commit
- Fix group_lengths for short datasets (#12558) · 6f1adc43
  Sylvain Gugger authored Jul 08, 2021
  
  6f1adc43
07 Jul, 2021 1 commit

[examples/flax] add adafactor optimizer (#12544) · 2d42915a

Suraj Patil authored Jul 07, 2021



* add adafactor

* Update examples/flax/language-modeling/run_mlm_flax.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

2d42915a

06 Jul, 2021 1 commit

[Flax] Adapt examples to be able to use eval_steps and save_steps (#12543) · 208df208

Patrick von Platen authored Jul 06, 2021



* fix_torch_device_generate_test

* remove @

* up

* up

* correct

* upload
Co-authored-by: Patrick von Platen <patrick@huggingface.co>

208df208

05 Jul, 2021 2 commits
- [Flax] Correct logging steps flax (#12515) · d0f7508a
  Patrick von Platen authored Jul 05, 2021
```
* fix_torch_device_generate_test

* remove @

* push
```
  d0f7508a
- [Flax] Correct flax training scripts (#12514) · bb4ac2b5
  Patrick von Platen authored Jul 05, 2021
```
* fix_torch_device_generate_test

* remove @

* add logging steps

* correct training scripts

* correct readme

* correct
```
  bb4ac2b5
29 Jun, 2021 1 commit
- [Flax] Example scripts - correct weight decay (#12409) · 81332868
  Patrick von Platen authored Jun 29, 2021
```
* fix_torch_device_generate_test

* remove @

* finish

* finish

* correct style
```
  81332868
28 Jun, 2021 1 commit

[Flax] Adapt flax examples to include `push_to_hub` (#12391) · 2d70c912

Patrick von Platen authored Jun 28, 2021



* fix_torch_device_generate_test

* remove @

* finish

* correct summary writer

* correct push to hub

* fix indent

* finish

* finish

* finish

* finish

* finish
Co-authored-by: Patrick von Platen <patrick@huggingface.co>

2d70c912

25 Jun, 2021 1 commit
- remove extra white space from log format (#12360) · 4a872cae
  Stas Bekman authored Jun 25, 2021
  
  4a872cae
11 Jun, 2021 1 commit

Flax CLM script (#12023) · 15b498f3

Suraj Patil authored Jun 11, 2021

* first draft

* max_seq_length => block_size

* fix arg names

* fix typos

* fix loss calculation

* add max examples, fix  train eval steps, metrics

* optimizer mask

* fix perpelexity, metric logging

* fix logging

* data_collator = > data_loader

* refactor loss_fn

* support single GPU

* pass distributed to write_metric

* fix jitting

* fix single device training

* fix single device metrics

* close inner progress bars once finished

* add overwrite_cache arg

* ifx dataset caching issue

* add more logs

* few small fixes,

* address nicholas suggestions

* fix docstr

* address patricks suggestions

* make flake happy

* pass new new_dropout_rng to apply_gradients

* reset train metrics after every epoc

* remove distributed logis, small fixes

15b498f3

08 Jun, 2021 1 commit
- Properly indent block_size (#12070) · fd690283
  Sylvain Gugger authored Jun 08, 2021
  
  fd690283
25 May, 2021 3 commits

[Examples] create model with custom config on the fly (#11798) · 1b653010

Stas Bekman authored May 25, 2021



* create custom model on the flight

* better wording

* add update_from_string

* cleanup

* cleanup

* Update src/transformers/configuration_utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* more bool options

* style

* fix logger

* add test

* add the doc

* assert on conflict of options
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

1b653010

[lm examples] fix overflow in perplexity calc (#11855) · 6287c929
Stas Bekman authored May 25, 2021
```
* fix overflow in perplexity calc

* use inf

* fix
```
6287c929
Add option to log only once in multinode training (#11819) · f086652b
Sylvain Gugger authored May 25, 2021
```
* Add option to long only once in multinode training

* Use an alternate property
```
f086652b

12 May, 2021 2 commits
- Docs for v4.7.0.dev0 · d77eb0cf
  Lysandre authored May 12, 2021
  
  d77eb0cf
- Release: v4.6.0 · 64e78564
  Lysandre authored May 12, 2021
  
  64e78564
11 May, 2021 1 commit

Auto modelcard (#11599) · a135f595

Sylvain Gugger authored May 11, 2021



* Autogenerate model cards from the Trainer

* ModelCard deprecated

* Fix test

* Style

* Apply suggestions from code review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Address review comments

* Quality

* With all metadata

* Metadata

* Post-merge conflict mess

* Data args and all examples

* Default license and languages when possible
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

a135f595

29 Apr, 2021 1 commit
- Split checkpoint from model_name_or_path in examples (#11492) · b29eb247
  Sylvain Gugger authored Apr 29, 2021
```
* Split checkpoint from model_name_or_path in examples

* Address review comments

* Address review comments
```
  b29eb247
26 Apr, 2021 1 commit

[Examples] Fixes inconsistency around eval vs val and predict vs test (#11380) · 1d30ec95

Bhadresh Savani authored Apr 26, 2021

* added changes for uniformity

* modified files

* corrected typo

* fixed qa scripts

* fix typos

* fixed predict typo in qa no trainer

* fixed test file

* reverted trainer changes

* reverted trainer changes in custom exmaples

* updated readme

* added changes in deepspeed test

* added changes for predict and eval

1d30ec95

23 Apr, 2021 1 commit

Trainer push to hub (#11328) · bf2e0cf7

Sylvain Gugger authored Apr 23, 2021



* Initial support for upload to hub

* push -> upload

* Fixes + examples

* Fix torchhub test

* Torchhub test I hate you

* push_model_to_hub -> push_to_hub

* Apply mixin to other pretrained models

* Remove ABC inheritance

* Add tests

* Typo

* Run tests

* Install git-lfs

* Change approach

* Add push_to_hub to all

* Staging test suite

* Typo

* Maybe like this?

* More deps

* Cache

* Adapt name

* Quality

* MOAR tests

* Put it in testing_utils

* Docs + torchhub last hope

* Styling

* Wrong method

* Typos

* Update src/transformers/file_utils.py
Co-authored-by: Julien Chaumond <julien@huggingface.co>

* Address review comments

* Apply suggestions from code review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Julien Chaumond <julien@huggingface.co>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

bf2e0cf7

21 Apr, 2021 1 commit

Examples reorg (#11350) · dabeb152

Sylvain Gugger authored Apr 21, 2021



* Base move

* Examples reorganization

* Update references

* Put back test data

* Move conftest

* More fixes

* Move test data to test fixtures

* Update path

* Apply suggestions from code review
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Address review comments and clean
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

dabeb152

13 Apr, 2021 1 commit
- added cache_dir=model_args.cache_dir to all example with cache_dir arg (#11220) · 9fa29959
  Philipp Schmid authored Apr 13, 2021
  
  9fa29959
09 Apr, 2021 1 commit
- [examples run_clm] fix _LazyModule hasher error (#11168) · 07f0bb69
  Stas Bekman authored Apr 09, 2021
```
* fix _LazyModule hasher error

* reword
```
  07f0bb69
08 Apr, 2021 1 commit

[run_clm] clarify why we get the tokenizer warning on long input (#11145) · acc851e1

Stas Bekman authored Apr 08, 2021



* clarify why we get the warning here

* Update examples/language-modeling/run_clm.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* wording

* style
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

acc851e1

07 Apr, 2021 2 commits
- [examples] fix white space (#11099) · 424419f5
  Stas Bekman authored Apr 07, 2021
```
these get concatenated without whitespace, so fix it
```
  424419f5
- fix: The 'warn' method is deprecated (#11105) · c9035e45
  Stas Bekman authored Apr 07, 2021
```
* The 'warn' method is deprecated

* fix test
```
  c9035e45
06 Apr, 2021 1 commit
- Development on v4.6.0dev0 · 9853c5dd
  Lysandre authored Apr 06, 2021
  
  9853c5dd