Commits · 3b3619a327df3c273050a5bc1d1fd7a710cf979a · chenpangpang / transformers

10 Jun, 2020 1 commit
- [All models] fix docs after adding output attentions to all forward functions (#4909) · 3b3619a3
  Patrick von Platen authored Jun 10, 2020
```
* fix doc

* add format file

* add output attentions to all docs

* add also for bart

* fix naming

* re-add doc to config
```
  3b3619a3
09 Jun, 2020 1 commit

[All models] Extend config.output_attentions with output_attentions function arguments (#4538) · 6e603cb7

Bharat Raghunathan authored Jun 10, 2020



* DOC: Replace instances of ``config.output_attentions`` with function argument ``output_attentions``

* DOC: Apply Black Formatting

* Fix errors where output_attentions was undefined

* Remove output_attentions in classes per review

* Fix regressions on tests having `output_attention`

* Fix further regressions in tests relating to `output_attentions`

Ensure proper propagation of `output_attentions` as a function parameter
to all model subclasses

* Fix more regressions in `test_output_attentions`

* Fix issues with BertEncoder

* Rename related variables to `output_attentions`

* fix pytorch tests

* fix bert and gpt2 tf

* Fix most TF tests for `test_output_attentions`

* Fix linter errors and more TF tests

* fix conflicts

* DOC: Apply Black Formatting

* Fix errors where output_attentions was undefined

* Remove output_attentions in classes per review

* Fix regressions on tests having `output_attention`

* fix conflicts

* fix conflicts

* fix conflicts

* fix conflicts

* fix pytorch tests

* fix conflicts

* fix conflicts

* Fix linter errors and more TF tests

* fix tf tests

* make style

* fix isort

* improve output_attentions

* improve tensorflow
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

6e603cb7

04 Jun, 2020 1 commit

Tensorflow improvements (#4530) · f9414f75

Julien Plu authored Jun 05, 2020



* Better None gradients handling

* Apply Style

* Apply Style

* Create a loss class per task to compute its respective loss

* Add loss classes to the ALBERT TF models

* Add loss classes to the BERT TF models

* Add question answering and multiple choice to TF Camembert

* Remove prints

* Add multiple choice model to TF DistilBERT + loss computation

* Add question answering model to TF Electra + loss computation

* Add token classification, question answering and multiple choice models to TF Flaubert

* Add multiple choice model to TF Roberta + loss computation

* Add multiple choice model to TF XLM + loss computation

* Add multiple choice and question answering models to TF XLM-Roberta

* Add multiple choice model to TF XLNet + loss computation

* Remove unused parameters

* Add task loss classes

* Reorder TF imports + add new model classes

* Add new model classes

* Bugfix in TF T5 model

* Bugfix for TF T5 tests

* Bugfix in TF T5 model

* Fix TF T5 model tests

* Fix T5 tests + some renaming

* Fix inheritance issue in the AutoX tests

* Add tests for TF Flaubert and TF XLM Roberta

* Add tests for TF Flaubert and TF XLM Roberta

* Remove unused piece of code in the TF trainer

* bugfix and remove unused code

* Bugfix for TF 2.2

* Apply Style

* Divide TFSequenceClassificationAndMultipleChoiceLoss into their two respective name

* Apply style

* Mirror the PT Trainer in the TF one: fp16, optimizers and tb_writer as class parameter and better dataset handling

* Fix TF optimizations tests and apply style

* Remove useless parameter

* Bugfix and apply style

* Fix TF Trainer prediction

* Now the TF models return the loss such as their PyTorch couterparts

* Apply Style

* Ignore some tests output

* Take into account the SQuAD cls_index, p_mask and is_impossible parameters for the QuestionAnswering task models.

* Fix names for SQuAD data

* Apply Style

* Fix conflicts with 2.11 release

* Fix conflicts with 2.11

* Fix wrongname

* Add better documentation on the new create_optimizer function

* Fix isort

* logging_dir: use same default as PyTorch
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

f9414f75

02 Jun, 2020 1 commit

Kill model archive maps (#4636) · d4c2cb40

Julien Chaumond authored Jun 02, 2020

* Kill model archive maps

* Fixup

* Also kill model_archive_map for MaskedBertPreTrainedModel

* Unhook config_archive_map

* Tokenizers: align with model id changes

* make style && make quality

* Fix CI

d4c2cb40

29 May, 2020 1 commit

[Longformer] Multiple choice for longformer (#4645) · 9c172564

Patrick von Platen authored May 29, 2020

* add multiple choice for longformer

* add models to docs

* adapt docstring

* add test to longformer

* add longformer for mc in init and modeling auto

* fix tests

9c172564

28 May, 2020 1 commit
- gpt2 typo (#4629) · b5015a2a
  flozi00 authored May 28, 2020
```
* gpt2 typo

* Add files via upload
```
  b5015a2a
12 May, 2020 1 commit

Add MultipleChoice to TFTrainer [WIP] (#4270) · e4512aab

Viktor Alm authored May 12, 2020



* catch gpu len 1 set to gpu0

* Add mpc to trainer

* Add MPC for TF

* fix TF automodel for MPC and add Albert

* Apply style

* Fix import

* Note to self: double check

* Make shape None, None for datasetgenerator output shapes

* Add from_pt bool which doesnt seem to work

* Original checkpoint dir

* Fix docstrings for automodel

* Update readme and apply style

* Colab should probably not be from users

* Colabs should probably not be from users

* Add colab

* Update README.md

* Update README.md

* Cleanup __intit__

* Cleanup flake8 trailing comma

* Update src/transformers/training_args_tf.py

* Update src/transformers/modeling_tf_auto.py
Co-authored-by: Viktor Alm <viktoralm@pop-os.localdomain>
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

e4512aab

07 May, 2020 2 commits
- Add AlbertForPreTraining and TFAlbertForPreTraining models. (#4057) · 8bf73126
  Jared T Nielsen authored May 07, 2020
```
* Add AlbertForPreTraining and TFAlbertForPreTraining models.

* PyTorch conversion

* TensorFlow conversion

* style
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>
```
  8bf73126
- [doc] Fix broken links + remove crazy big notebook · c99fe038
  Julien Chaumond authored May 07, 2020
  
  c99fe038
30 Apr, 2020 1 commit
- Fix TF input docstrings to refer to tf.Tensor rather than torch.FloatTensor. (#4051) · 64070cbb
  Jared T Nielsen authored Apr 30, 2020
  
  64070cbb
29 Apr, 2020 1 commit

CDN urls (#4030) · 455c6390

Julien Chaumond authored Apr 28, 2020

* [file_utils] use_cdn + documentation

* Move to cdn. urls for weights

* [urls] Hotfix for bert-base-japanese

455c6390

23 Apr, 2020 1 commit

Fix TFAlbertForSequenceClassification classifier dropout probability. It was... · a79a9e12

Jared T Nielsen authored Apr 23, 2020

Fix TFAlbertForSequenceClassification classifier dropout probability. It was set to config.hidden_dropout_prob, but should be config.classifier_dropout_prob. (#3928)

a79a9e12

17 Apr, 2020 1 commit

Question Answering support for Albert and Roberta in TF (#3812) · 6d00033e

Pierric Cistac authored Apr 17, 2020

* Add TFAlbertForQuestionAnswering

* Add TFRobertaForQuestionAnswering

* Update TFAutoModel with Roberta/Albert for QA

* Clean `super` TF Albert calls

6d00033e

08 Apr, 2020 1 commit
- Updating the TensorFlow models to work as expected with tokenizers v3.0.0 (#3684) · 6435b9f9
  Lysandre Debut authored Apr 08, 2020
```
* Updating modeling tf files; adding tests

* Merge `encode_plus` and `batch_encode_plus`
```
  6435b9f9
04 Mar, 2020 1 commit
- Explicit config_class instead of module inspection · 4f338ed4
  Gunnlaugur Thor Briem authored Mar 04, 2020
  
  4f338ed4
03 Mar, 2020 3 commits

fix: passing config as Layer trainable param · b1116fd6
Gunnlaugur Thor Briem authored Mar 03, 2020
```
Lurking bugs discovered while working on other stuff.
```
b1116fd6

Use class decorator instead of superclass · 0c716ede

Gunnlaugur Thor Briem authored Mar 03, 2020

When supplied by Keras deserialization, the config parameter to initializers
will be a dict. So intercept it and convert to PretrainedConfig object (and
store in instance attribute for get_config to get at it) before passing to the
actual initializer. To accomplish this, and repeat as little code as possible,
use a class decorator on TF*MainLayer classes.

0c716ede

Support keras JSON/HDF5 serialization of main layers · ba281707
Gunnlaugur Thor Briem authored Mar 03, 2020
```
Fixes #3101
```
ba281707

19 Feb, 2020 2 commits
- Patch ALBERT with heads in TensorFlow · 2708b44e
  Lysandre authored Feb 19, 2020
  
  2708b44e
- Patch ALBERT with heads in TensorFlow · 1abd53b1
  Lysandre authored Feb 19, 2020
  
  1abd53b1
23 Jan, 2020 5 commits
- Tips + whitespaces · 9ddf60b6
  Lysandre authored Jan 21, 2020
  
  9ddf60b6
- Fixes · 0e9899f4
  Lysandre authored Jan 20, 2020
  
  0e9899f4
- PyTorch CTRL + Style · 7511f3dd
  Lysandre authored Jan 20, 2020
  
  7511f3dd
- BERT TensorFlow · dbeb7fb4
  Lysandre authored Jan 16, 2020
  
  dbeb7fb4
- TF ALBERT + TF Utilities + Fix warnings · 3922a249
  Lysandre authored Jan 15, 2020
  
  3922a249
15 Jan, 2020 1 commit
- 💄 super · 83a41d39
  Julien Chaumond authored Jan 15, 2020
  
  83a41d39
07 Jan, 2020 1 commit
- Fix typograpical errors (#2438) · d6a677b1
  Genta Indra Winata authored Jan 08, 2020
  
  d6a677b1
06 Jan, 2020 2 commits
- GPU text generation: mMoved the encoded_prompt to correct device · 81d6841b
  alberduris authored Dec 31, 2019
  
  81d6841b
- Moved the encoded_prompts to correct device · dd4df80f
  alberduris authored Dec 31, 2019
  
  dd4df80f
22 Dec, 2019 7 commits
- Remove sys.version_info[0] == 2 or 3. · 798b3b38
  Aymeric Augustin authored Dec 22, 2019
  
  798b3b38
- Remove __future__ imports. · c824d15a
  Aymeric Augustin authored Dec 22, 2019
  
  c824d15a
- Move source code inside a src subdirectory. · 6be7cdda
  Aymeric Augustin authored Dec 22, 2019
```
This prevents transformers from being importable simply because the CWD
is the root of the git repository, while not being importable from other
directories. That led to inconsistent behavior, especially in examples.

Once you fetch this commit, in your dev environment, you must run:

    $ pip uninstall transformers
    $ pip install -e .
```
  6be7cdda
- Fix F821 flake8 warning (x47). · 2ab78325
  Aymeric Augustin authored Dec 21, 2019
```
Ignore warnings related to Python 2, because it's going away soon.
```
  2ab78325
- Fix E714 flake8 warning (x8). · fd2f17a7
  Aymeric Augustin authored Dec 21, 2019
  
  fd2f17a7
- Remove trailing whitespace from all Python files. · 28e608a2
  Aymeric Augustin authored Dec 21, 2019
```
Fixes flake8 warning W291 (x224).
```
  28e608a2
- Sort imports with isort. · 158e82e0
  Aymeric Augustin authored Dec 21, 2019
```
This is the result of:

    $ isort --recursive examples templates transformers utils hubconf.py setup.py
```
  158e82e0
21 Dec, 2019 1 commit

Reformat source code with black. · fa84ae26

Aymeric Augustin authored Dec 21, 2019

This is the result of:

    $ black --line-length 119 examples templates transformers utils hubconf.py setup.py

There's a lot of fairly long lines in the project. As a consequence, I'm
picking the longest widely accepted line length, 119 characters.

This is also Thomas' preference, because it allows for explicit variable
names, to make the code easier to understand.

fa84ae26

19 Dec, 2019 1 commit
- Fix albert example · 33adab2b
  Lysandre authored Dec 19, 2019
  
  33adab2b
02 Dec, 2019 1 commit
- Fix ALBERT exports with pretraining + sp classifier; Fix naming for ALBERT TF models · e85855f2
  Lysandre authored Dec 02, 2019
  
  e85855f2
29 Nov, 2019 1 commit
- update all tf.shape and tensor.shape to shape_list · adb5c79f
  thomwolf authored Nov 28, 2019
  
  adb5c79f