Commits · b7fc043fce413e0a01da0e0dc0b2df23e9298a62 · chenpangpang / transformers

23 Apr, 2021 1 commit

Fix cross-attention head mask for Torch encoder-decoder models (#10605) · e3ff165a

Daniel Stancl authored Apr 23, 2021

* Fix cross-attention head mask for Torch BART models

* Fix head masking for cross-attention module for the following
models: BART, Blenderbot, Blenderbot_small, M2M_100, Marian, MBart,
Pegasus

* Enable test_headmasking for M2M_100 model

* Fix cross_head_mask for FSMT, LED and T5

* This commit fixes `head_mask` for cross-attention modules
in the following models: FSMT, LED, T5

* It also contains some smaller changes in doc so that
it is be perfectly clear the shape of `cross_head_mask`
is the same as of `decoder_head_mask`

* Update template

* Fix template for BartForCausalLM

* Fix cross_head_mask for Speech2Text models

* Fix cross_head_mask in templates

* Fix args order in BartForCausalLM template

* Fix doc in BART templates

* Make more explicit naming

* `cross_head_mask` -> `cross_attn_head_mask`

* `cross_layer_head_mask` -> `cross_attn_layer_head_mask`

* Fix doc

* make style quality

* Fix speech2text docstring

e3ff165a

08 Apr, 2021 1 commit
- Add support for multiple models for one config in auto classes (#11150) · ba8b1f47
  Sylvain Gugger authored Apr 08, 2021
```
* Add support for multiple models for one config in auto classes

* Use get_values everywhere

* Prettier doc
```
  ba8b1f47
02 Feb, 2021 1 commit

Add head_mask and decoder_head_mask to PyTorch LED (#9856) · 71bdc076

Daniel Stancl authored Feb 02, 2021

* Add {decoder_,}head_mask to LED

* Fix create_custom_forward signatue in encoder

* Add head_mask to longformer

* Add head_mask to longformer to fix dependencies
of LED on Longformer.

* Not working yet

* Add mising one input in longofrmer_modeling.py

* make fix-copies

71bdc076

21 Jan, 2021 1 commit
- reduce led memory (#9723) · c8ea582e
  Patrick von Platen authored Jan 21, 2021
  
  c8ea582e
07 Jan, 2021 1 commit
- [LED Test] fix common inputs pt for flaky pt-tf led test (#9459) · a400fe89
  Patrick von Platen authored Jan 07, 2021
```
* fix common inputs pt flakey led

* fix other tests correspondingly
```
  a400fe89
06 Jan, 2021 1 commit

[GenerationOutputs] Fix GenerationOutputs Tests (#9443) · b8462b5b

Patrick von Platen authored Jan 06, 2021

* fix generation models

* fix led

* fix docs

* add is_decoder

* fix last docstrings

* make style

* fix t5 cross attentions

* correct t5

b8462b5b

05 Jan, 2021 1 commit

LED (#9278) · 189387e9

Patrick von Platen authored Jan 05, 2021

* create model

* add integration

* save current state

* make integration tests pass

* add one more test

* add explanation to tests

* remove from bart

* add padding

* remove unnecessary test

* make all tests pass

* re-add cookie cutter tests

* finish PyTorch

* fix attention test

* Update tests/test_modeling_common.py

* revert change

* remove unused file

* add string to doc

* save intermediate

* make tf integration tests pass

* finish tf

* fix doc

* fix docs again

* add led to doctree

* add to auto tokenizer

* added tips for led

* make style

* apply jplus statements

* correct tf longformer

* apply lysandres suggestions

* apply sylvains suggestions

* Apply suggestions from code review

189387e9