Commits · d541938c48f759522f81fa177aae49098e0e0149 · chenpangpang / transformers

10 Jun, 2020 3 commits
- Make multiple choice models work with input_embeds (#4921) · d541938c
  Sylvain Gugger authored Jun 10, 2020
  
  d541938c
- Add more models to common tests (#4910) · 4e10acb3
  Sylvain Gugger authored Jun 10, 2020
  
  4e10acb3
- [All models] fix docs after adding output attentions to all forward functions (#4909) · 3b3619a3
  Patrick von Platen authored Jun 10, 2020
```
* fix doc

* add format file

* add output attentions to all docs

* add also for bart

* fix naming

* re-add doc to config
```
  3b3619a3
09 Jun, 2020 1 commit

[All models] Extend config.output_attentions with output_attentions function arguments (#4538) · 6e603cb7

Bharat Raghunathan authored Jun 10, 2020



* DOC: Replace instances of ``config.output_attentions`` with function argument ``output_attentions``

* DOC: Apply Black Formatting

* Fix errors where output_attentions was undefined

* Remove output_attentions in classes per review

* Fix regressions on tests having `output_attention`

* Fix further regressions in tests relating to `output_attentions`

Ensure proper propagation of `output_attentions` as a function parameter
to all model subclasses

* Fix more regressions in `test_output_attentions`

* Fix issues with BertEncoder

* Rename related variables to `output_attentions`

* fix pytorch tests

* fix bert and gpt2 tf

* Fix most TF tests for `test_output_attentions`

* Fix linter errors and more TF tests

* fix conflicts

* DOC: Apply Black Formatting

* Fix errors where output_attentions was undefined

* Remove output_attentions in classes per review

* Fix regressions on tests having `output_attention`

* fix conflicts

* fix conflicts

* fix conflicts

* fix conflicts

* fix pytorch tests

* fix conflicts

* fix conflicts

* Fix linter errors and more TF tests

* fix tf tests

* make style

* fix isort

* improve output_attentions

* improve tensorflow
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

6e603cb7

08 Jun, 2020 2 commits
- fix (#4839) · 4c7f564f
  ZhuBaohe authored Jun 09, 2020
  
  4c7f564f
- Clean documentation (#4849) · 37be3786
  Sylvain Gugger authored Jun 08, 2020
```
* Clean documentation
```
  37be3786
03 Jun, 2020 1 commit

Unify label args (#4722) · 1b5820a5

Sylvain Gugger authored Jun 03, 2020

* Deprecate masked_lm_labels argument

* Apply to all models

* Better error message

1b5820a5

02 Jun, 2020 1 commit

Kill model archive maps (#4636) · d4c2cb40

Julien Chaumond authored Jun 02, 2020

* Kill model archive maps

* Fixup

* Also kill model_archive_map for MaskedBertPreTrainedModel

* Unhook config_archive_map

* Tokenizers: align with model id changes

* make style && make quality

* Fix CI

d4c2cb40

29 May, 2020 4 commits

Fix longformer attention mask type casting when using apex (#4574) · 33b7532e
Wei Fang authored May 30, 2020
```
* Fix longformer attention mask casting when using apex

* remove extra type casting
```
33b7532e

[Longformer] Better handling of global attention mask vs local attention mask (#4672) · 56ee2560

Patrick von Platen authored May 29, 2020

* better api

* improve automatic setting of global attention mask

* fix longformer bug

* fix global attention mask in test

* fix global attn mask flatten

* fix slow tests

* update docstring

* update docs and make more robust

* improve attention mask

56ee2560

[Longformer] Multiple choice for longformer (#4645) · 9c172564

Patrick von Platen authored May 29, 2020

* add multiple choice for longformer

* add models to docs

* adapt docstring

* add test to longformer

* add longformer for mc in init and modeling auto

* fix tests

9c172564

[Longformer] fix model name in examples (#4653) · 91487cbb
Iz Beltagy authored May 29, 2020
```
* fix longformer model names in examples

* a better name for the notebook
```
91487cbb

28 May, 2020 2 commits
- LongformerForTokenClassification (#4638) · e444648a
  Suraj Patil authored May 28, 2020
  
  e444648a
- [Longformer] more models + model cards (#4628) · ef03ae87
  Iz Beltagy authored May 28, 2020
```
* adding freeze roberta models

* model cards

* lint
```
  ef03ae87
27 May, 2020 1 commit

LongformerForSequenceClassification (#4580) · ec4cdfdd

Suraj Patil authored May 28, 2020



* LongformerForSequenceClassification

* better naming x=>hidden_states, fix typo in doc

* Update src/transformers/modeling_longformer.py

* Update src/transformers/modeling_longformer.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

ec4cdfdd

26 May, 2020 1 commit
- [Longformer For Question Answering] Conversion script, doc, small fixes (#4593) · c589eae2
  Patrick von Platen authored May 26, 2020
```
* add new longformer for question answering model

* add new config as well

* fix links

* fix links part 2
```
  c589eae2
25 May, 2020 1 commit

Longformer for question answering (#4500) · 03d8527d

Suraj Patil authored May 25, 2020

* added LongformerForQuestionAnswering

* add LongformerForQuestionAnswering

* fix import for LongformerForMaskedLM

* add LongformerForQuestionAnswering

* hardcoded sep_token_id

* compute attention_mask if not provided

* combine global_attention_mask with attention_mask when provided

* update example in  docstring

* add assert error messages, better attention combine

* add test for longformerForQuestionAnswering

* typo

* cast gloabl_attention_mask to long

* make style

* Update src/transformers/configuration_longformer.py

* Update src/transformers/configuration_longformer.py

* fix the code quality

* Merge branch 'longformer-for-question-answering' of https://github.com/patil-suraj/transformers

 into longformer-for-question-answering
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

03d8527d

19 May, 2020 2 commits

[Longformer] Docs and clean API (#4464) · 48c3a70b
Patrick von Platen authored May 19, 2020
```
* add longformer docs

* improve docs
```
48c3a70b

Longformer (#4352) · 8f1d0471

Iz Beltagy authored May 19, 2020

* first commit

* bug fixes

* better examples

* undo padding

* remove wrong VOCAB_FILES_NAMES

* License

* make style

* make isort happy

* unit tests

* integration test

* make `black` happy by undoing `isort` changes!!

* lint

* no need for the padding value

* batch_size not bsz

* remove unused type casting

* seqlen not seq_len

* staticmethod

* `bert` selfattention instead of `n2`

* uint8 instead of bool + lints

* pad inputs_embeds using embeddings not a constant

* black

* unit test with padding

* fix unit tests

* remove redundant unit test

* upload model weights

* resolve todo

* simpler _mask_invalid_locations without lru_cache + backward compatible masked_fill_

* increase unittest coverage

8f1d0471