Commits · ddb6f9476b58ed9bf4433622ca9aa49932929bc0 · chenpangpang / transformers

31 Jan, 2020 18 commits
- [model_cards] dbmdz models · ddb6f947
  Julien Chaumond authored Jan 31, 2020
```
Co-Authored-By: Stefan Schweter <stefan-it@users.noreply.github.com>
```
  ddb6f947
- [model_cards] Multilingual + Dutch SQuAD2.0 · 6636826f
  Julien Chaumond authored Jan 31, 2020
```
Co-Authored-By: HenrykBorzymowski <henrykborzymowski@users.noreply.github.com>
```
  6636826f
- [model_cards] UmBERTo · 98dadc98
  Julien Chaumond authored Jan 31, 2020
```
Co-Authored-By: Loreto Parisi <loretoparisi@gmail.com>
Co-Authored-By: Simone Francia <francia.simone1@gmail.com>
```
  98dadc98
- [model_cards] add mine · d6fc34b4
  Julien Chaumond authored Jan 31, 2020
  
  d6fc34b4
- Patch: v2.4.1 · d426b58b
  Lysandre authored Jan 31, 2020
  
  d426b58b
- Flaubert auto tokenizer + tests · 1e82cd84
  Lysandre authored Jan 31, 2020
```
cc @julien-c
```
  1e82cd84
- run_generation style · d18d47be
  Lysandre authored Jan 31, 2020
  
  d18d47be
- FlauBERT load in AutoModel · ff6f1492
  Lysandre authored Jan 31, 2020
```
The FlauBERT configuration file inherits from XLMConfig, and is recognized as such when loading from AutoModels as the XLMConfig is checked before the FlaubertConfig.

Changing the order solves this problem, but a test should be added.
```
  ff6f1492
- do_sample should be set to True in run_generation.py · 7365f01d
  Lysandre authored Jan 31, 2020
  
  7365f01d
- Typo on markdown link in README.md · 3a21d6da
  Arnaud authored Jan 31, 2020
  
  3a21d6da
- v2.4.0 documentation · 0aa40e95
  Lysandre authored Jan 31, 2020
  
  0aa40e95
- Update commands for pypi test · 8036ceb7
  Lysandre authored Jan 31, 2020
  
  8036ceb7
- Release: v2.4.0 · 6664ea94
  Lysandre authored Jan 31, 2020
  
  6664ea94
- [Umberto] model shortcuts (#2661) · 5a6b138b
  Julien Chaumond authored Jan 30, 2020
```
* [Umberto] model shortcuts

cc @loretoparisi @simonefrancia

see #2485

* Ensure that tokenizers will be correctly configured
```
  5a6b138b
- Hotfix: same handling of non-existent files as for config · 7fe294bf
  Julien Chaumond authored Jan 30, 2020
  
  7fe294bf
- config.architectures · b85c59f9
  Julien Chaumond authored Jan 30, 2020
  
  b85c59f9
- style tweak · f9bc3f57
  Julien Chaumond authored Jan 30, 2020
  
  f9bc3f57
- No need for a model_type here · 0b13fb82
  Julien Chaumond authored Jan 30, 2020
```
cc @lysandrejik
```
  0b13fb82
30 Jan, 2020 11 commits
- Correct documentation · 71a38231
  Jared Nielsen authored Jan 30, 2020
  
  71a38231
- Add FlauBERT to automodels · 01a14ebd
  Lysandre authored Jan 30, 2020
  
  01a14ebd
- fill_mask helper (#2576) · 9fa836a7
  Julien Chaumond authored Jan 30, 2020
```
* fill_mask helper

* [poc] FillMaskPipeline

* Revert "[poc] FillMaskPipeline"

This reverts commit 67eeea55b0f97b46c2b828de0f4ee97d87338335.

* Revert "fill_mask helper"

This reverts commit cacc17b884e14bb6b07989110ffe884ad9e36eaa.

* README: clarify that Pipelines can also do text-classification

cf. question at the AI&ML meetup last week, @mfuntowicz

* Fix test: test feature-extraction pipeline

* Test tweaks

* Slight refactor of existing pipeline (in preparation of new FillMaskPipeline)

* Extraneous doc

* More robust way of doing this

@mfuntowicz as we don't rely on the model name anymore (see AutoConfig)

* Also add RobertaConfig as a quickfix for wrong token_type_ids

* cs

* [BIG] FillMaskPipeline
```
  9fa836a7
- Add layerdrop · b43cb09a
  Hang Le authored Jan 30, 2020
  
  b43cb09a
- Rename test_examples to test_doc_samples · df27648b
  Lysandre authored Jan 30, 2020
  
  df27648b
- Pretrained models · 93dccf52
  Lysandre authored Jan 30, 2020
  
  93dccf52
- Style · 90787fed
  Lysandre authored Jan 29, 2020
  
  90787fed
- FlauBERT documentation · 73306d02
  Lysandre authored Jan 29, 2020
  
  73306d02
- Fix failing FlauBERT test · ce2f4227
  Lysandre authored Jan 29, 2020
  
  ce2f4227
- Add Flaubert · f0a4fc6c
  Hang Le authored Jan 15, 2020
  
  f0a4fc6c
- Added classifier dropout rate in ALBERT · a5381495
  Peter Izsak authored Jan 29, 2020
  
  a5381495
29 Jan, 2020 11 commits
- Use _pad_token of pad_token_id · 83446a88
  Bram Vanroy authored Jan 29, 2020
```
Requesting pad_token_id would cause an error message when it is None. Use private _pad_token instead.
```
  83446a88
- Add check to verify existence of pad_token_id · 9fde13a3
  BramVanroy authored Jan 28, 2020
```
In batch_encode_plus we have to ensure that the tokenizer has a pad_token_id so that, when padding, no None values are added as padding. That would happen with gpt2, openai, transfoxl.

closes https://github.com/huggingface/transformers/issues/2640
```
  9fde13a3
- Style · e63a81dd
  Lysandre authored Jan 29, 2020
  
  e63a81dd
- Copy object instead of passing the reference · 21734901
  Lysandre authored Jan 29, 2020
  
  21734901
- Remove lines causing a KeyError · adb8c931
  Jared Nielsen authored Jan 28, 2020
  
  adb8c931
- Update documentation · c69b0826
  Lysandre authored Jan 29, 2020
  
  c69b0826
- Apply quality and style requirements once again · ca1d6673
  Julien Plu authored Jan 07, 2020
  
  ca1d6673
- bugfix on model name · 5e3c7284
  Julien Plu authored Jan 07, 2020
  
  5e3c7284
- Apply quality and style requirements · 0731fa15
  Julien Plu authored Jan 07, 2020
  
  0731fa15
- Add TF2 CamemBERT model · a3998e76
  Julien Plu authored Jan 07, 2020
  
  a3998e76
- Style · b5625f13
  Lysandre authored Jan 29, 2020
  
  b5625f13