Commits · 04a9709c2700e2f2cf4245f389f3c6e86d314e26 · chenpangpang / transformers

31 May, 2021 1 commit
- Remove `datasets` submodule · 04a9709c
  Lysandre authored May 31, 2021
  
  04a9709c
28 May, 2021 3 commits

Test optuna and ray (#11924) · 8d171628
Lysandre Debut authored May 28, 2021

8d171628

[Flax] Return Attention from BERT, ELECTRA, RoBERTa and GPT2 (#11918) · af1a10bf

Jayendra authored May 28, 2021



* Added logic to return attention from flax-bert model and added test cases to check that

* Added new line at the end of file to test_modeling_flax_common.py

* fixing code style

* Fixing Roberta and Elextra models too from cpoying bert

* Added temporary hack to not run test_attention_outputs for FlaxGPT2

* Returning attention weights from GPT2 and changed the tests accordingly.

* last fixes

* bump flax dependency
Co-authored-by: jayendra <jayendra@infocusp.in>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

af1a10bf

Added Sequence Classification class in GPTNeo (#11906) · e1205e47
Bhadresh Savani authored May 28, 2021
```
* seq classification changes

* fix tests
```
e1205e47

27 May, 2021 3 commits

Adding new argument `max_new_tokens` for generate. (#11476) · 80d712fa

Nicolas Patry authored May 27, 2021

* Adding new argument `max_new_tokens` for generate.

This is a proposal to add a new argument `max_new_tokens` to `generate`.
This include a `MaxNewTokensCriteria` that enables callers that don't
know about the token length ahead (like pipelines callers) to manage
more easily the length of their generated output.

* Adding a test for the user warning when both`max_length` and
`max_new_tokens` are used together.

* Removed redundant `no_grad`.

80d712fa

Update deepspeed config to reflect hyperparameter search parameters (#11896) · 2dd6fb25
Josh Tanner authored May 27, 2021
```
* rebuild deepspeed config for hyperparameter search

* reformat code to fix style issues
```
2dd6fb25
Add Emotion Speech Noteboook (#11900) · 42fe0dc2
Patrick von Platen authored May 27, 2021

42fe0dc2

26 May, 2021 7 commits

Flax Generate (#11777) · 996a315e

Patrick von Platen authored May 27, 2021



* fix_torch_device_generate_test

* remove @

* add

* indexing

* correct a couple of tests

* fix tests

* add logits processor

* finish top_k, top_p, temp

* add docs

* correct flax prng key default

* improve generate

* add generation docs

* add docs

* make style

* revert model outputs change

* make style

* correct typo

* fix tests

* fix slow test

* add raise

* finish generation
Co-authored-by: Patrick von Platen <patrick@huggingface.co>

996a315e

Link official Cloud TPU JAX docs (#11892) · 2df54691
Avital Oliver authored May 26, 2021

2df54691

changing find_batch_size to work with tokenizer outputs (#11890) · 1530384e

joerenner authored May 26, 2021



* changing find_batch_size to work with tokenizer outputs

trainer_pt_utils.find_batch_size does not recognize the batch size of BatchEncoding objects. This can cause an error when a trainer relies on find_batch_size to report the number of observed examples in the evaluation loop.

* Trigger CI
Co-authored-by: jrenner <joseph.renner@inria.fr>

1530384e

[Flax] Allow dataclasses to be jitted (#11886) · d5a72b6e

Patrick von Platen authored May 26, 2021

* fix_torch_device_generate_test

* remove @

* change dataclasses to flax ones

* fix typo

* fix jitted tests

* fix bert & electra

d5a72b6e

Correcting comments in T5Stack to reflect correct tuple order (#11330) · e6126e19

talkhaldi authored May 26, 2021



* Correcting comments to reflect correct tuple order

In order to match the actual order (line 513 and 516, and as accessed in 968), I've changed the order mentioned in comments L962 and L966-967.

* Update modeling_t5.py

Updating another comment as well

* Removing extra space

* Fixing style and quality

* style & quality

* Update src/transformers/models/t5/modeling_t5.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

e6126e19

Fix usage of head masks by TF encoder-decoder models' `generate()` function (#11775) · 0b933584

Daniel Stancl authored May 26, 2021

* Fix Bart

* Fix Blenderbot{,_small}

* Fix LED

* Fix Marian

* Fix MBart

* Fix Pegasus

* Fix T5

* Add test for generation with head_mask

* Add a common TF test

* Override a test for the LED model as head masking is not yet properly implemented

* Remove all head_masks from input preparation for LED

* Drop masking for T5 as it needs a bit of refactor

0b933584

Ensure input tensor are on device. (#11874) · 0b0a5984

francescorubbo authored May 26, 2021

The feature extractor does not create tensors on the appropriate device,
so we call `ensure_tensor_on_device` before feeding the processed inputs
to the model.

0b0a5984

25 May, 2021 9 commits
- [Wav2Vec2ForCTC] example typo fixed (#11878) · a9c797f9
  Ahmet Akkoç authored May 26, 2021
  
  a9c797f9
- [Examples] create model with custom config on the fly (#11798) · 1b653010
  Stas Bekman authored May 25, 2021
```
* create custom model on the flight

* better wording

* add update_from_string

* cleanup

* cleanup

* Update src/transformers/configuration_utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* more bool options

* style

* fix logger

* add test

* add the doc

* assert on conflict of options
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
```
  1b653010
- [lm examples] fix overflow in perplexity calc (#11855) · 6287c929
  Stas Bekman authored May 25, 2021
```
* fix overflow in perplexity calc

* use inf

* fix
```
  6287c929
- [Wav2Vec2] SpecAugment Fast (#11764) · 7630c11f
  Patrick von Platen authored May 25, 2021
```
* first try

* finish
```
  7630c11f
- Add option to log only once in multinode training (#11819) · f086652b
  Sylvain Gugger authored May 25, 2021
```
* Add option to long only once in multinode training

* Use an alternate property
```
  f086652b
- typo (#11858) · b8344a27
  Wang Ran (汪然) authored May 25, 2021
  
  b8344a27
- fixed a small typo in the doc (#11856) · f9880f62
  Shiro T authored May 25, 2021
  
  f9880f62
- Enable memory metrics in tests that need it (#11859) · 6da129cb
  Lysandre Debut authored May 25, 2021
  
  6da129cb
- Add some tests to the slow suite #11860 · db0b2477
  Lysandre Debut authored May 25, 2021
  
  db0b2477
24 May, 2021 7 commits

[Trainer] Report both steps and num samples per second (#11818) · afe479ad

Sylvain Gugger authored May 24, 2021



* [Trainer] Report both steps and num samples per second

* Fix batch number

* Update src/transformers/trainer_utils.py
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* Address review comments
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

afe479ad

Fix two typos in docs (#11852) · eaab9397
Nick Lane-Smith authored May 24, 2021
```
* typo2

* fix typo
```
eaab9397

Fix flos single node (#11844) · 8a2a3a25

Teven authored May 24, 2021

* fixing flos bug/typo in non-distributed setting

* storing flos every logging_interval

8a2a3a25

Switch mem metrics flag (#11851) · adb785b0

Sylvain Gugger authored May 24, 2021



* Switch mem metrics flag

* Update src/transformers/training_args.py
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

adb785b0

Fix reference to XLNet (#11846) · fcdb85e9
Sylvain Gugger authored May 24, 2021

fcdb85e9
[Flax] Fix PyTorch import error (#11839) · f5806041
Patrick von Platen authored May 24, 2021
```
* fix_torch_device_generate_test

* remove @

* change pytorch import to flax import
```
f5806041
Replace double occurrences as the last step (#11367) · 0cbddfb1
Lysandre Debut authored May 24, 2021

0cbddfb1

22 May, 2021 1 commit

Faster list concat for trainer_pt_utils.get_length_grouped_indices() (#11825) · 73fde1de

ctheodoris authored May 22, 2021

get_length_grouped_indices() in LengthGroupedSampler and DistributedLengthGroupedSampler
is prohibitively slow for large number of megabatches (in test case takes hours for ~270k
megabatches with 100 items each) due to slow list concatenation with sum(megabatches, []).

Resolves: #11795
Co-authored-by: ctheodoris <cvtheodo@ds.dfci.harvard.edu>

73fde1de

21 May, 2021 7 commits
- Add flax text class colab (#11824) · da22245e
  Patrick von Platen authored May 21, 2021
```
* fix_torch_device_generate_test

* remove @

* add flax glue link
```
  da22245e
- [Deepspeed] support `zero.Init` in `from_config` (#11805) · a26f4d62
  Stas Bekman authored May 21, 2021
```
* support zero.Init in from_config

* no need for eval test
```
  a26f4d62
- [Flax] Small fixes in `run_flax_glue.py` (#11820) · 82335185
  Patrick von Platen authored May 21, 2021
```
* fix_torch_device_generate_test

* remove @

* correct best seed for flax fine-tuning
Co-authored-by: Patrick von Platen <patrick@huggingface.co>
```
  82335185
- Avoid TensorFlow import in Trainer · b8697bc6
  Sylvain Gugger authored May 21, 2021
  
  b8697bc6
- fix roformer config doc (#11813) · e2c1dd09
  yujun authored May 21, 2021
  
  e2c1dd09
- Patch recursive import (#11812) · 1b652295
  Lysandre Debut authored May 21, 2021
  
  1b652295
- [Flax] Align GLUE training script with mlm training script (#11778) · bd987165
  Patrick von Platen authored May 21, 2021
```
* speed up flax glue

* remove unnecessary line

* remove folder

* remove run in loop
Co-authored-by: Patrick von Platen <patrick@huggingface.co>
```
  bd987165
20 May, 2021 2 commits

Fix failing test on Windows Platform (#11589) · 22394387

Keren Fuentes authored May 20, 2021

* add separator for windows

* fixes test_is_copy_consistent on Windows

* fixing writing encoding issue on extended test (for Windows)

* resolving comments

22394387

A cleaner and more scalable implementation of symbolic tracing (#11763) · f4a0d6ff

Michael Benayoun authored May 20, 2021



Cleaner and more scalable implementation of symbolic tracing with torch.fx, and provides support for new architectures:
- ALBERT
- DistilBERT
- MobileBERT
- MegatronBERT
- GPT2
- GPT Neo
Co-authored-by: Michael Benayoun <michael@huggingface.co>

f4a0d6ff