Commits · ee5de0ba449d638da704e1c03ffcc20a930f5589 · chenpangpang / transformers

11 Feb, 2020 1 commit

BERT decoder: Fix causal mask dtype. · ee5de0ba

Oleksiy Syvokon authored Feb 06, 2020

PyTorch < 1.3 requires multiplication operands to be of the same type.
This was violated when using default attention mask (i.e.,
attention_mask=None in arguments) given BERT in the decoder mode.

In particular, this was breaking Model2Model and made tutorial
from the quickstart failing.

ee5de0ba

07 Feb, 2020 2 commits
- omission · d8b43600
  VictorSanh authored Feb 07, 2020
  
  d8b43600
- distilbert-base-cased weights + Readmes + omissions · ee5a6856
  VictorSanh authored Feb 07, 2020
  
  ee5a6856
04 Feb, 2020 8 commits
- RoBERTa TensorFlow Tests · 2184f870
  Lysandre authored Feb 03, 2020
  
  2184f870
- Correct slow test · e615269c
  Lysandre authored Feb 03, 2020
  
  e615269c
- Style · 5f96ebc0
  Lysandre authored Feb 03, 2020
  
  5f96ebc0
- Flaubert PyTorch tests · 950c6a4f
  Lysandre authored Feb 03, 2020
  
  950c6a4f
- RoBERTa Pytorch tests · d28b81dc
  Lysandre authored Feb 03, 2020
  
  d28b81dc
- fix default getattr · 9e5b549b
  sshleifer authored Feb 04, 2020
  
  9e5b549b
- double quotes · 25848a60
  sshleifer authored Feb 04, 2020
  
  25848a60
- minor cleanup of test_attention_outputs · cbcb83f2
  sshleifer authored Feb 03, 2020
  
  cbcb83f2
31 Jan, 2020 1 commit
- Flaubert auto tokenizer + tests · 1e82cd84
  Lysandre authored Jan 31, 2020
```
cc @julien-c
```
  1e82cd84
30 Jan, 2020 2 commits

fill_mask helper (#2576) · 9fa836a7

Julien Chaumond authored Jan 30, 2020

* fill_mask helper

* [poc] FillMaskPipeline

* Revert "[poc] FillMaskPipeline"

This reverts commit 67eeea55b0f97b46c2b828de0f4ee97d87338335.

* Revert "fill_mask helper"

This reverts commit cacc17b884e14bb6b07989110ffe884ad9e36eaa.

* README: clarify that Pipelines can also do text-classification

cf. question at the AI&ML meetup last week, @mfuntowicz

* Fix test: test feature-extraction pipeline

* Test tweaks

* Slight refactor of existing pipeline (in preparation of new FillMaskPipeline)

* Extraneous doc

* More robust way of doing this

@mfuntowicz as we don't rely on the model name anymore (see AutoConfig)

* Also add RobertaConfig as a quickfix for wrong token_type_ids

* cs

* [BIG] FillMaskPipeline

9fa836a7

Rename test_examples to test_doc_samples · df27648b
Lysandre authored Jan 30, 2020

df27648b

29 Jan, 2020 2 commits
- Style · e63a81dd
  Lysandre authored Jan 29, 2020
  
  e63a81dd
- Copy object instead of passing the reference · 21734901
  Lysandre authored Jan 29, 2020
  
  21734901
28 Jan, 2020 1 commit
- Absolute definitive HeisenDistilBug solve · ea2600bd
  Lysandre authored Jan 27, 2020
```
cc @julien-c @thomwolf
```
  ea2600bd
27 Jan, 2020 2 commits
- Add AutoModelForPreTraining · 0e31e06a
  thomwolf authored Jan 24, 2020
  
  0e31e06a
- Definitive HeisenDistilBug fix · 875c4ae4
  Lysandre authored Jan 27, 2020
```
cc @julien-c @@thomwolf
```
  875c4ae4
23 Jan, 2020 6 commits
- Run the examples in slow · 24d5ad1d
  Lysandre authored Jan 22, 2020
  
  24d5ad1d
- Flake8 violation · f81b6c95
  Lysandre authored Jan 15, 2020
  
  f81b6c95
- Can test examples spread over multiple blocks · 632675ea
  Lysandre authored Jan 14, 2020
  
  632675ea
- Require Torch when testing examples · eaa6b9af
  Lysandre authored Jan 14, 2020
  
  eaa6b9af
- Multi-line examples can be tested + ALBERT patch for CircleCI · 64abd3e0
  Lysandre authored Jan 14, 2020
```
All tests should now work fine.
```
  64abd3e0
- Automatic testing of examples · 83757725
  Lysandre authored Jan 14, 2020
```
The CircleCI test should fail.
```
  83757725
17 Jan, 2020 1 commit
- Fix BasicTokenizer to respect `never_split` parameters (#2557) · 65a89a89
  Mark Neumann authored Jan 17, 2020
```
* add failing test

* fix call to _run_split_on_punc

* format with black
```
  65a89a89
16 Jan, 2020 3 commits
- Tokenizer.from_pretrained: fetch all possible files remotely · 23a2cea8
  Julien Chaumond authored Jan 15, 2020
  
  23a2cea8
- tokenizer.save_pretrained: only save file if non-empty · 9d8fd2d4
  Julien Chaumond authored Jan 15, 2020
  
  9d8fd2d4
- Fix failing torchscript test for xlnet · d9fa1bad
  Julien Chaumond authored Jan 15, 2020
```
model.parameters() order is apparently not stable (only for xlnet, for some reason)
```
  d9fa1bad
15 Jan, 2020 3 commits
- 💄 super · 83a41d39
  Julien Chaumond authored Jan 15, 2020
  
  83a41d39
- Graduate sst-2 to a canonical one · eb59e9f7
  Julien Chaumond authored Jan 15, 2020
  
  eb59e9f7
- Close #2392 · e184ad13
  Julien Chaumond authored Jan 15, 2020
  
  e184ad13
14 Jan, 2020 5 commits
- Bias should be resized with the weights · 100e3b6f
  Lysandre authored Jan 14, 2020
```
Created a link between the linear layer bias and the model attribute bias. This does not change anything for the user nor for the conversion scripts, but allows the `resize_token_embeddings` method to resize the bias as well as the weights of the decoder.

Added a test.
```
  100e3b6f
- Update test_tokenization_auto.py · 764f836d
  Julien Chaumond authored Jan 13, 2020
  
  764f836d
- Update test_tokenization_auto.py · d5831acb
  Julien Chaumond authored Jan 13, 2020
  
  d5831acb
- Update test_tokenization_auto.py · ed6cd597
  Julien Chaumond authored Jan 13, 2020
  
  ed6cd597
- Update test_tokenization_auto.py · 5cb463a7
  Julien Chaumond authored Jan 13, 2020
  
  5cb463a7
13 Jan, 2020 2 commits
- Map configs to models and tokenizers · 03046285
  Julien Chaumond authored Jan 13, 2020
  
  03046285
- [tests] Safety checks on CONFIG_MAPPING · 1fc855e4
  Julien Chaumond authored Jan 13, 2020
  
  1fc855e4
11 Jan, 2020 1 commit
- More AutoConfig tests · cf8a70bf
  Julien Chaumond authored Jan 11, 2020
  
  cf8a70bf