Commits · 8deff3acf2fd34e2a2161a4b833f1ff78a0d5d52 · chenpangpang / transformers

30 Mar, 2020 10 commits
- [bart-tiny-random] Put a 5MB model on S3 to allow faster exampl… (#3488) · 8deff3ac
  Sam Shleifer authored Mar 30, 2020
  
  8deff3ac
- [BART] Update encoder and decoder on set_input_embedding (#3501) · 1f728657
  dougian authored Mar 30, 2020
```
Co-authored-by: Ioannis Douratsos <ioannisd@amazon.com>
```
  1f728657
- [InputExample] Unfreeze for now, cf. #3423 · cc598b31
  Julien Chaumond authored Mar 30, 2020
  
  cc598b31
- Update the NER TF script (#3511) · d38bbb22
  Julien Plu authored Mar 30, 2020
```
* Update the NER TF script to remove the softmax and make the pad token label id to -1

* Reformat the quality and style
Co-authored-by: Julien Plu <julien.plu@adevinta.com>
```
  d38bbb22
- Re-pin isort version · eff757f2
  LysandreJik authored Mar 30, 2020
  
  eff757f2
- Un-pin isort for v2.7.0 pypi · a009d751
  LysandreJik authored Mar 30, 2020
  
  a009d751
- Release: v2.7.0 · 6f5a12a5
  LysandreJik authored Mar 30, 2020
  
  6f5a12a5
- fix lm lables in docstring (#3529) · 296252c4
  Patrick von Platen authored Mar 30, 2020
  
  296252c4
- [T5] make decoder input ids optional for t5 training (#3521) · 75ec6c9e
  Patrick von Platen authored Mar 30, 2020
```
* make decoder input ids optional for t5 training

* lm_lables should not be shifted in t5

* add tests

* finish shift right functionality for PT T5

* move shift right to correct class

* cleaner code

* replace -100 values with pad token id

* add assert statement

* remove unnecessary for loop

* make style
```
  75ec6c9e
- [T5] Add training documenation (#3507) · 5b44e0a3
  Patrick von Platen authored Mar 30, 2020
```
* Add clear description of how to train T5

* correct docstring in T5

* correct typo

* correct docstring format

* update t5 model docs

* implement collins feedback

* fix typo and add more explanation for sentinal tokens

* delete unnecessary todos
```
  5b44e0a3
29 Mar, 2020 2 commits
- [Docs] examples/summarization/bart: Simplify CNN/DM preprocessi… (#3516) · 33ef7002
  Sam Shleifer authored Mar 29, 2020
  
  33ef7002
- [BART] add bart-large-xsum weights (#3422) · f6a23d19
  Sam Shleifer authored Mar 29, 2020
  
  f6a23d19
27 Mar, 2020 10 commits

[model_cards]: use MIT license for all dbmdz models · 601ac5b1
Stefan Schweter authored Mar 27, 2020

601ac5b1

Fix circle ci flaky fail of wmt example (#3485) · 17dceae7

Patrick von Platen authored Mar 27, 2020

* force bleu

* fix wrong file name

* rename file

* different filenames for each example test

* test files should clean up after themselves

* test files should clean up after themselves

* do not force bleu

* correct typo

* fix isort

17dceae7

add summarization and translation to notebook (#3478) · 00ea100e
Patrick von Platen authored Mar 27, 2020

00ea100e

run_ner.py / bert-base-multilingual-cased can output empty tokens (#2991) · b08259a1

Funtowicz Morgan authored Mar 27, 2020



* Use tokenizer.num_added_tokens to count number of added special_tokens instead of hardcoded numbers.
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

* run_ner.py - Do not add a label to the labels_ids if word_tokens is empty.

This can happen when using bert-base-multilingual-cased with an input containing an unique space.
In this case, the tokenizer will output just an empty word_tokens thus leading to an non-consistent behavior
over the labels_ids tokens adding one more tokens than tokens vector.
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

b08259a1

Rename `t5-large` to `t5-base` in README.md · f4f49468
Patrick von Platen authored Mar 27, 2020

f4f49468

Add T5 to docs (#3461) · fa9af246

Patrick von Platen authored Mar 27, 2020

* add t5 docs basis

* improve docs

* add t5 docs

* improve t5 docstring

* add t5 tokenizer docstring

* finish docstring

* make style

* add pretrained models

* correct typo

* make examples work

* finalize docs

fa9af246

Add option to choose T5 model size. (#3480) · ff80b731
Lysandre Debut authored Mar 27, 2020
```
T5-small in test


isort
```
ff80b731
Correct indentation in docstring · e2c05f06
LysandreJik authored Mar 27, 2020
```
For some reason Sphinx extremely dislikes this and crashes.
```
e2c05f06
[Bart/Memory] Two separate, smaller decoder attention masks (#3371) · 3ee431dd
Sam Shleifer authored Mar 26, 2020

3ee431dd
Model Cards: Fix grammar error (#3467) · 53fe7338
Manuel Romero authored Mar 27, 2020

53fe7338

26 Mar, 2020 15 commits

[Bart: example] drop columns that are exclusively pad_token_id… (#3400) · c10decf7

Sam Shleifer authored Mar 26, 2020

* trim seq_len below 1024 if there are columns full of pad_token_id
* Centralize trim_batch so SummarizationDataset can use it too

c10decf7

[Bart/Memory] SelfAttention only returns weights if config.outp… (#3369) · 63f4d8ca
Sam Shleifer authored Mar 26, 2020

63f4d8ca
[Bart] Fix: put dummy_inputs on correct device (#3398) · 2b2a2f8d
Sam Shleifer authored Mar 26, 2020
```
* Dummy inputs to model.device

* Move self.device to ModuleUtilsMixin
```
2b2a2f8d
[Seq2Seq Generation] Call encoder before expanding input_ids (#3370) · 1a5aefc9
Sam Shleifer authored Mar 26, 2020

1a5aefc9
[Bart/Memory] don't create lm_head (#3323) · 39371ee4
Sam Shleifer authored Mar 26, 2020
```
* delete lm_head, skips weight tying
* Fixed s3
```
39371ee4

Add wmt translation example (#3428) · 5ad2ea06

Patrick von Platen authored Mar 26, 2020

* add translation example

* make style

* adapt docstring

* add gpu device as input for example

* small renaming

* better README

5ad2ea06

revert unpin isort commit · b4fb94fe
Patrick von Platen authored Mar 26, 2020

b4fb94fe

Add t5 summarization example (#3411) · e703e923

Patrick von Platen authored Mar 26, 2020

* rebase to master

* change tf to pytorch

* change to pytorch

* small fix

* renaming

* add gpu training possibility

* renaming

* improve README

* incoorporate collins feedback

* better Readme

* better README.md

e703e923

Add missing token classification for XLM (#3277) · 1a6c546c

sakares saengkaew authored Mar 26, 2020



* Add the missing token classification for XLM

* fix styling

* Add XLMForTokenClassification to AutoModelForTokenClassification class

* Fix docstring typo for non-existing class

* Add the missing token classification for XLM

* fix styling

* fix styling

* Add XLMForTokenClassification to AutoModelForTokenClassification class

* Fix docstring typo for non-existing class

* Add missing description for AlbertForTokenClassification

* fix styling

* Add missing docstring for AlBert

* Slow tests should be slow
Co-authored-by: Sakares Saengkaew <s.sakares@gmail.com>
Co-authored-by: LysandreJik <lysandre.debut@reseau.eseo.fr>

1a6c546c

rename string in pipeline · 31197054
Patrick von Platen authored Mar 26, 2020

31197054
Create card for model GPT-2-finetuned-CORD19 · 7420a6a9
Manuel Romero authored Mar 26, 2020

7420a6a9

Adds translation pipeline (#3419) · 022e8fab

Patrick von Platen authored Mar 26, 2020

* fix merge conflicts

* add t5 summarization example

* change parameters for t5 summarization

* make style

* add first code snippet for translation

* only add prefixes

* add prefix patterns

* make style

* renaming

* fix conflicts

* remove unused patterns

* solve conflicts

* fix merge conflicts

* remove translation example

* remove summarization example

* make sure tensors are in numpy for float comparsion

* re-add t5 config

* fix t5 import config typo

* make style

* remove unused numpy statements

* update doctstring

* import translation pipeline

022e8fab

Update model card huseinzol05/bert-base-bahasa-cased (#3425) · 3c5c5675
HUSEIN ZOLKEPLI authored Mar 26, 2020
```
* add bert bahasa readme

* update readme

* update readme

* added xlnet
```
3c5c5675

Add t5 to pipeline(task='summarization') (#3413) · 9c683ef0

Patrick von Platen authored Mar 26, 2020

* solve conflicts

* move warnings below

* incorporate changes

* add pad_to_max_length to pipelines

* add bug fix for T5 beam search

* add prefix patterns

* make style

* fix conflicts

* adapt pipelines for task specific parameters

* improve docstring

* remove unused patterns

9c683ef0

Force the return of token type IDs (#3439) · ffcffebe
Lysandre Debut authored Mar 26, 2020

ffcffebe

25 Mar, 2020 3 commits

Updated/added model cards (#3435) · 010e0460
Travis McGuire authored Mar 25, 2020

010e0460
Extend config with task specific configs. (#3433) · ffa17fe3
Patrick von Platen authored Mar 25, 2020
```
* add new default configs

* change prefix default to None
```
ffa17fe3

Experiment w/ dataclasses (including Py36) (#3423) · 83272a38

Julien Chaumond authored Mar 25, 2020

* [ci] Also run test_examples in py37

(will revert at the end of the experiment)

* InputExample: use immutable dataclass

* [deps] Install dataclasses for Py<3.7

* [skip ci] Revert "[ci] Also run test_examples in py37"

This reverts commit d29afd9959786b77759b0b8fa4e6b4335b952015.

83272a38