Commits · fe6ff4a920d79ae2fc01efd3afe32182a905ec06 · chenpangpang / transformers

30 Jul, 2021 3 commits

Add substep callbacks (#12951) · fe6ff4a9
wulu473 authored Jul 30, 2021
```
Co-authored-by: Lukas Wutschitz <lukas.wutschitz@microsoft.com>
```
fe6ff4a9
Log Azure ML metrics only for rank 0 (#12766) · f84226b7
harshithapv authored Jul 30, 2021
```
* minor change to log azureml only for rank 0

* fix typo
```
f84226b7

fix typo in gradient_checkpointing arg (#12855) · 5c673efa

21jun authored Jul 30, 2021

help for `ModelArguments.gradient_checkpointing` should be
"If True, use gradient checkpointing to save memory
at the expense of slower backward pass."
not "Whether to freeze the feature extractor layers of the model."
(which is duplicated from `freeze_feature_extractor` arg)

5c673efa

29 Jul, 2021 3 commits

Add CpmTokenizerFast (#12938) · fd0255b4
Kevin Canwen Xu authored Jul 30, 2021
```
* Add CpmTokenizerFast

* Fix isort

* Overwrite _batch_encode_plus
```
fd0255b4

Moving feature-extraction pipeline to new testing scheme (#12843) · e2d22eef

Nicolas Patry authored Jul 29, 2021



* Update feature extraction pipelilne.

* Leaving 1 small model for actual values check.

* Fixes tests

- Better support for tokenizer with no pad token
- Increasing PegasusModelTesterConfig for pipelines
- Test of feature extraction are more permissive + don't test Multimodel
models + encoder-decoder.

* Fixing model loading with incorrect shape (+ model with HEAD).

* Update tests/test_pipelines_common.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Revert modeling_utils modification.

* Some corrections.

* Update tests/test_pipelines_common.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update tests/test_pipelines_feature_extraction.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Syntax.

* Fixing text-classification tests.

* Don't modify this file.
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

e2d22eef

ONNX v2 raises an Exception when using PyTorch < 1.8.0 (#12933) · 640421c0

Funtowicz Morgan authored Jul 29, 2021

* Raise an issue if the pytorch version is < 1.8.0

* Attempt to add a test to ensure it correctly raises.

* Missing docstring.

* Second attempt, patch with string absolute import.

* Let's do the call before checking it was called ...

* use the correct function ... 🤦

* Raise ImportError and AssertionError respectively when unable to find torch and torch version is not sufficient.

* Correct path mock patching

* relax constraint for torch_onnx_dict_inputs to ge instead of eq.

* Style.

* Split each version requirements for torch.

* Let's compare version directly.

* Import torch_version after checking pytorch is installed.

* @require_torch

640421c0

28 Jul, 2021 12 commits

Fix docstring typo in tokenization_auto.py (#12891) · 9160d81c

Will Frey authored Jul 28, 2021

Change `PreTrainedConfig` -> `PretrainedConfig` in the docstring for `AutoTokenizer.from_pretrained(...)`.

9160d81c

Fix typo in tokenization_auto.py (#12896) · 0d00c08d
Will Frey authored Jul 28, 2021
```
Fix `config.decoder.__class` -> `config.decoder.__class__`
```
0d00c08d

Update typing in generation_logits_process.py (#12900) · c3287ebd

Will Frey authored Jul 28, 2021

Change `torch.Tensor` -> `torch.FloatTensor` in `TemperatureLogitsWarper` to be consistent with the `LogitsWarper` ABC signature annotation.

c3287ebd

Update typing in generation_logits_process.py (#12901) · df55c2b9

Will Frey authored Jul 28, 2021

While `Iterable[Iterable[int]]` is a nicer annotation (it's covariant!), the defensive statements parsing out `bad_words_ids` in `__init__(...)` force the caller to pass in `List[List[int]]`. I've changed the annotation to make that clear.

df55c2b9

Fix distiller.py (#12910) · c164064e
chutaklee authored Jul 29, 2021
```
* fix distiller

* fix style
```
c164064e

Add missing classmethod decorators (#12927) · 1da782cb

Will Frey authored Jul 28, 2021

`_BaseAutoModelClass` was missing `classmethod` decorators on the `from_config(...)` and `from_pretrained(...)` methods.

1da782cb

Fix StoppingCriteria ABC signature (#12918) · bf78f523

Will Frey authored Jul 28, 2021

Change `score` -> `scores` because the argument is not positional-only, so you need consistently named parameters for the subclasses. The subclasses appear to favor `scores` over `score`.

bf78f523

Print defaults when using --help for scripts (#12930) · 63f2b9ab
Sylvain Gugger authored Jul 28, 2021

63f2b9ab
Fix QA examples for roberta tokenizer (#12928) · 3ec851dc
Sylvain Gugger authored Jul 28, 2021

3ec851dc
Add option to set max_len in run_ner (#12929) · fd85734e
Sylvain Gugger authored Jul 28, 2021

fd85734e
Fix typo in the example of MobileBertForPreTraining (#12919) · 1486fb81
Buddhi Chathuranga Senarathna authored Jul 28, 2021

1486fb81

Correct validation_split_percentage argument from int (ex:5) to float (0.05) (#12897) · f3d0866e

Elysium1436 authored Jul 27, 2021



* Fixed train_test_split test_size argument

* `Seq2SeqTrainer` set max_length and num_beams only when non None  (#12899)

* set max_length and num_beams only when non None

* fix instance variables

* fix code style

* [FLAX] Minor fixes in CLM example (#12914)

* readme: fix retrieval of vocab size for flax clm example

* examples: fix flax clm example when using training/evaluation files

* Fix module path for symbolic_trace example
Co-authored-by: cchen-dialpad <47165889+cchen-dialpad@users.noreply.github.com>
Co-authored-by: Stefan Schweter <stefan@schweter.it>
Co-authored-by: Sylvain Gugger <sylvain.gugger@gmail.com>

f3d0866e

27 Jul, 2021 3 commits
- Fix module path for symbolic_trace example · 68a441fa
  Sylvain Gugger authored Jul 27, 2021
  
  68a441fa
- [FLAX] Minor fixes in CLM example (#12914) · d3c3e722
  Stefan Schweter authored Jul 27, 2021
```
* readme: fix retrieval of vocab size for flax clm example

* examples: fix flax clm example when using training/evaluation files
```
  d3c3e722
- `Seq2SeqTrainer` set max_length and num_beams only when non None (#12899) · 12e02e33
  cchen-dialpad authored Jul 27, 2021
```
* set max_length and num_beams only when non None

* fix instance variables

* fix code style
```
  12e02e33
26 Jul, 2021 10 commits

Fix push_to_hub for TPUs (#12895) · ba15fe79
Sylvain Gugger authored Jul 26, 2021

ba15fe79
Merge remote-tracking branch 'origin/master' · b3f95dce
Sylvain Gugger authored Jul 26, 2021

b3f95dce
Update doc · a492aec8
Sylvain Gugger authored Jul 26, 2021

a492aec8

Better heuristic for token-classification pipeline. (#12611) · a3bd7637

Nicolas Patry authored Jul 26, 2021

* Better heuristic for token-classification pipeline.

Relooking at the problem makes thing actually much simpler,
when we look at ids from a tokenizer, we have no way in **general**
to recover if some substring is part of a word or not.

However, within the pipeline, with offsets we still have access to the
original string, so we can simply look if previous character (if it
exists) of a token, is actually a space. This will obviously be wrong
for tokenizers that contain spaces within tokens, tokenizers where
offsets include spaces too (Don't think there are a lot).

This heuristic hopefully is fully bc and still can handle non-word based
tokenizers.

* Updating test with real values.

* We still need the older "correct" heuristic to prevent fusing
punctuation.

* Adding a real warning when important.

a3bd7637

Add TF multiple choice example (#12865) · 569f61a7
Matt authored Jul 26, 2021
```
* Add new multiple-choice example, remove old one
```
569f61a7
Fix documentation of BigBird tokenizer (#12889) · 4f19881f
Sylvain Gugger authored Jul 26, 2021

4f19881f
Add accelerate to examples requirements (#12888) · 303989de
Sylvain Gugger authored Jul 26, 2021

303989de
Add possibility to ignore imports in test_fecther (#12801) · 5f436238
Sylvain Gugger authored Jul 26, 2021
```
* Add possibility to ignore imports in test_fecther

* Style
```
5f436238
Fix barrier for SM distributed (#12853) · 7c300d6d
Sylvain Gugger authored Jul 26, 2021

7c300d6d

add `classifier_dropout` to classification heads (#12794) · 0c1c42c1

Philip May authored Jul 26, 2021



* add classifier_dropout to Electra

* no type annotations yet
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* add classifier_dropout to Electra

* add classifier_dropout to Electra ForTokenClass.

* add classifier_dropout to bert

* add classifier_dropout to roberta

* add classifier_dropout to big_bird

* add classifier_dropout to mobilebert

* empty commit to trigger CI

* add classifier_dropout to reformer

* add classifier_dropout to ConvBERT

* add classifier_dropout to Albert

* add classifier_dropout to Albert
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

0c1c42c1

24 Jul, 2021 2 commits

BaseLazyModule -> LazyModule in RemBERT · 9ff672fc
Lysandre authored Jul 24, 2021

9ff672fc

Add RemBERT model code to huggingface (#10692) · 434022ad

Thibault FEVRY authored Jul 24, 2021



* Faster list concat for trainer_pt_utils.get_length_grouped_indices() (#11825)

get_length_grouped_indices() in LengthGroupedSampler and DistributedLengthGroupedSampler
is prohibitively slow for large number of megabatches (in test case takes hours for ~270k
megabatches with 100 items each) due to slow list concatenation with sum(megabatches, []).

Resolves: #11795
Co-authored-by: ctheodoris <cvtheodo@ds.dfci.harvard.edu>

* Replace double occurrences as the last step (#11367)

* [Flax] Fix PyTorch import error (#11839)

* fix_torch_device_generate_test

* remove @

* change pytorch import to flax import

* Fix reference to XLNet (#11846)

* Switch mem metrics flag (#11851)

* Switch mem metrics flag

* Update src/transformers/training_args.py
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* Fix flos single node (#11844)

* fixing flos bug/typo ...

434022ad

23 Jul, 2021 3 commits

[Sequence Feature Extraction] Add truncation (#12804) · f6e25447

Patrick von Platen authored Jul 23, 2021



* fix_torch_device_generate_test

* remove @

* add truncate

* finish

* correct test

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* clean tests

* correct normalization for truncation

* remove casting

* up

* save intermed

* finish

* finish

* correct
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

f6e25447

[tests] fix logging_steps requirements (#12860) · 98364ea7
Stas Bekman authored Jul 23, 2021

98364ea7
Pin git python to <3.10.0 (#12858) · e218249b
Patrick von Platen authored Jul 23, 2021
```
* fix_torch_device_generate_test

* remove @

* pin git python

* make style

* typo
```
e218249b

22 Jul, 2021 4 commits

Improving pipeline tests (#12784) · 795c1444

Nicolas Patry authored Jul 22, 2021



* Proposal

* Testing pipelines slightly better.

- Overall same design
- Metaclass to get proper different tests instead of subTest (not well
supported by Pytest)
- Added ANY meta object to make output checking more readable.
- Skipping architectures either without tiny_config or without
architecture.

* Small fix.

* Fixing the tests in case of None value.

* Oups.

* Rebased with more architectures.

* Fixing reformer tests (no override anymore).

* Adding more options for model tester config.
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>

795c1444

Docs for v4.10.0dev0 · 40de2d5a
Lysandre authored Jul 22, 2021

40de2d5a
Release: v4.9.0 · 72aee83c
Lysandre authored Jul 22, 2021

72aee83c
Fix type of max_seq_length arg in run_swag.py (#12832) · fcf83011
Maxwell Forbes authored Jul 21, 2021

fcf83011