Commits · 3ec851dc5ed303961574b781d6c794551b68edc0 · chenpangpang / transformers

28 Jul, 2021 4 commits

Fix QA examples for roberta tokenizer (#12928) · 3ec851dc
Sylvain Gugger authored Jul 28, 2021

3ec851dc
Add option to set max_len in run_ner (#12929) · fd85734e
Sylvain Gugger authored Jul 28, 2021

fd85734e
Fix typo in the example of MobileBertForPreTraining (#12919) · 1486fb81
Buddhi Chathuranga Senarathna authored Jul 28, 2021

1486fb81

Correct validation_split_percentage argument from int (ex:5) to float (0.05) (#12897) · f3d0866e

Elysium1436 authored Jul 27, 2021



* Fixed train_test_split test_size argument

* `Seq2SeqTrainer` set max_length and num_beams only when non None  (#12899)

* set max_length and num_beams only when non None

* fix instance variables

* fix code style

* [FLAX] Minor fixes in CLM example (#12914)

* readme: fix retrieval of vocab size for flax clm example

* examples: fix flax clm example when using training/evaluation files

* Fix module path for symbolic_trace example
Co-authored-by: cchen-dialpad <47165889+cchen-dialpad@users.noreply.github.com>
Co-authored-by: Stefan Schweter <stefan@schweter.it>
Co-authored-by: Sylvain Gugger <sylvain.gugger@gmail.com>

f3d0866e

27 Jul, 2021 3 commits
- Fix module path for symbolic_trace example · 68a441fa
  Sylvain Gugger authored Jul 27, 2021
  
  68a441fa
- [FLAX] Minor fixes in CLM example (#12914) · d3c3e722
  Stefan Schweter authored Jul 27, 2021
```
* readme: fix retrieval of vocab size for flax clm example

* examples: fix flax clm example when using training/evaluation files
```
  d3c3e722
- `Seq2SeqTrainer` set max_length and num_beams only when non None (#12899) · 12e02e33
  cchen-dialpad authored Jul 27, 2021
```
* set max_length and num_beams only when non None

* fix instance variables

* fix code style
```
  12e02e33
26 Jul, 2021 10 commits

Fix push_to_hub for TPUs (#12895) · ba15fe79
Sylvain Gugger authored Jul 26, 2021

ba15fe79
Merge remote-tracking branch 'origin/master' · b3f95dce
Sylvain Gugger authored Jul 26, 2021

b3f95dce
Update doc · a492aec8
Sylvain Gugger authored Jul 26, 2021

a492aec8

Better heuristic for token-classification pipeline. (#12611) · a3bd7637

Nicolas Patry authored Jul 26, 2021

* Better heuristic for token-classification pipeline.

Relooking at the problem makes thing actually much simpler,
when we look at ids from a tokenizer, we have no way in **general**
to recover if some substring is part of a word or not.

However, within the pipeline, with offsets we still have access to the
original string, so we can simply look if previous character (if it
exists) of a token, is actually a space. This will obviously be wrong
for tokenizers that contain spaces within tokens, tokenizers where
offsets include spaces too (Don't think there are a lot).

This heuristic hopefully is fully bc and still can handle non-word based
tokenizers.

* Updating test with real values.

* We still need the older "correct" heuristic to prevent fusing
punctuation.

* Adding a real warning when important.

a3bd7637

Add TF multiple choice example (#12865) · 569f61a7
Matt authored Jul 26, 2021
```
* Add new multiple-choice example, remove old one
```
569f61a7
Fix documentation of BigBird tokenizer (#12889) · 4f19881f
Sylvain Gugger authored Jul 26, 2021

4f19881f
Add accelerate to examples requirements (#12888) · 303989de
Sylvain Gugger authored Jul 26, 2021

303989de
Add possibility to ignore imports in test_fecther (#12801) · 5f436238
Sylvain Gugger authored Jul 26, 2021
```
* Add possibility to ignore imports in test_fecther

* Style
```
5f436238
Fix barrier for SM distributed (#12853) · 7c300d6d
Sylvain Gugger authored Jul 26, 2021

7c300d6d

add `classifier_dropout` to classification heads (#12794) · 0c1c42c1

Philip May authored Jul 26, 2021



* add classifier_dropout to Electra

* no type annotations yet
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* add classifier_dropout to Electra

* add classifier_dropout to Electra ForTokenClass.

* add classifier_dropout to bert

* add classifier_dropout to roberta

* add classifier_dropout to big_bird

* add classifier_dropout to mobilebert

* empty commit to trigger CI

* add classifier_dropout to reformer

* add classifier_dropout to ConvBERT

* add classifier_dropout to Albert

* add classifier_dropout to Albert
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

0c1c42c1

24 Jul, 2021 2 commits

BaseLazyModule -> LazyModule in RemBERT · 9ff672fc
Lysandre authored Jul 24, 2021

9ff672fc

Add RemBERT model code to huggingface (#10692) · 434022ad

Thibault FEVRY authored Jul 24, 2021



* Faster list concat for trainer_pt_utils.get_length_grouped_indices() (#11825)

get_length_grouped_indices() in LengthGroupedSampler and DistributedLengthGroupedSampler
is prohibitively slow for large number of megabatches (in test case takes hours for ~270k
megabatches with 100 items each) due to slow list concatenation with sum(megabatches, []).

Resolves: #11795
Co-authored-by: ctheodoris <cvtheodo@ds.dfci.harvard.edu>

* Replace double occurrences as the last step (#11367)

* [Flax] Fix PyTorch import error (#11839)

* fix_torch_device_generate_test

* remove @

* change pytorch import to flax import

* Fix reference to XLNet (#11846)

* Switch mem metrics flag (#11851)

* Switch mem metrics flag

* Update src/transformers/training_args.py
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* Fix flos single node (#11844)

* fixing flos bug/typo ...

434022ad

23 Jul, 2021 3 commits

[Sequence Feature Extraction] Add truncation (#12804) · f6e25447

Patrick von Platen authored Jul 23, 2021



* fix_torch_device_generate_test

* remove @

* add truncate

* finish

* correct test

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* clean tests

* correct normalization for truncation

* remove casting

* up

* save intermed

* finish

* finish

* correct
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

f6e25447

[tests] fix logging_steps requirements (#12860) · 98364ea7
Stas Bekman authored Jul 23, 2021

98364ea7
Pin git python to <3.10.0 (#12858) · e218249b
Patrick von Platen authored Jul 23, 2021
```
* fix_torch_device_generate_test

* remove @

* pin git python

* make style

* typo
```
e218249b

22 Jul, 2021 4 commits

Improving pipeline tests (#12784) · 795c1444

Nicolas Patry authored Jul 22, 2021



* Proposal

* Testing pipelines slightly better.

- Overall same design
- Metaclass to get proper different tests instead of subTest (not well
supported by Pytest)
- Added ANY meta object to make output checking more readable.
- Skipping architectures either without tiny_config or without
architecture.

* Small fix.

* Fixing the tests in case of None value.

* Oups.

* Rebased with more architectures.

* Fixing reformer tests (no override anymore).

* Adding more options for model tester config.
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>

795c1444

Docs for v4.10.0dev0 · 40de2d5a
Lysandre authored Jul 22, 2021

40de2d5a
Release: v4.9.0 · 72aee83c
Lysandre authored Jul 22, 2021

72aee83c
Fix type of max_seq_length arg in run_swag.py (#12832) · fcf83011
Maxwell Forbes authored Jul 21, 2021

fcf83011

21 Jul, 2021 9 commits

[parallelism doc] document Deepspeed-Inference and parallelformers (#12836) · 27a8c9e4

Stas Bekman authored Jul 21, 2021



* document Deepspeed-Inference and parallelformers

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

27a8c9e4

[Deepspeed] warmup_ratio docs (#12830) · 807b6bd1

Stas Bekman authored Jul 21, 2021



* [Deepspeed] warmup_ratio docs

* Update docs/source/main_classes/deepspeed.rst
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* style

* Update docs/source/main_classes/deepspeed.rst
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* style
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

807b6bd1

Raise warning in HP search when hp is not in args (#12831) · 8c2384d8
Sylvain Gugger authored Jul 21, 2021

8c2384d8
[debug] DebugUnderflowOverflow doesn't work with DP (#12816) · cf0755aa
Stas Bekman authored Jul 21, 2021

cf0755aa

Add _CHECKPOINT_FOR_DOC to all models (#12811) · ac3cb660

Lysandre Debut authored Jul 21, 2021



* Add _CHECKPOINT_FOR_DOC

* Update src/transformers/models/funnel/modeling_funnel.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

ac3cb660

Add versioning system to fast tokenizer files (#12713) · 786ced36

Sylvain Gugger authored Jul 21, 2021



* Add versioning system to fast tokenizer files

* Deal with offline mode

* Use staging env in tests

* Style

* Apply suggestions from code review
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Style
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

786ced36

Refer warmup_ratio when setting warmup_num_steps. (#12818) · 037bdf82

Masatoshi TSUCHIYA authored Jul 21, 2021

* Refer warmup_ratio when setting warmup_num_steps.

* Add a method to get number of warmup steps to TrainerArguments class.

* Fix.

* Fix.

037bdf82

fix convert_tokens_to_string calls (#11716) · 15d19ecf
Philip May authored Jul 21, 2021

15d19ecf
Expose get_config() on ModelTesters (#12812) · c3d9ac76
Lysandre Debut authored Jul 21, 2021
```
* Expose get_config() on ModelTesters

* Typo
```
c3d9ac76

20 Jul, 2021 5 commits
- [trainer] sanity checks for `save_steps=0|None` and `logging_steps=0` (#12796) · cabcc751
  Stas Bekman authored Jul 20, 2021
```
* [trainer] fix % 0

* sanity checks

* fix logging_strategy

* correction

* Update src/transformers/training_args.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
```
  cabcc751
- Update README.md · acdd78db
  Patrick von Platen authored Jul 20, 2021
  
  acdd78db
- add and fix examples (#12810) · b5b4e549
  Suraj Patil authored Jul 20, 2021
  
  b5b4e549
- Update README.md · 31d06729
  Patrick von Platen authored Jul 20, 2021
  
  31d06729
- [Longformer] Correct longformer docs (#12809) · 2955d50e
  Patrick von Platen authored Jul 20, 2021
```
* fix_torch_device_generate_test

* remove @

* correct longformer docs
Co-authored-by: Patrick von Platen <patrick@huggingface.co>
```
  2955d50e