Commits · 5f50d619ddf7dbee24f7a63be8aba48b459545d9 · chenpangpang / transformers

11 May, 2020 4 commits
- Fix XTREME link + add number of eval documents + fix usage code (#4280) · 5f50d619
  Julien Plu authored May 11, 2020
  
  5f50d619
- fix reformer apex scaling issue (#4242) · 7751be7c
  theblackcat102 authored May 11, 2020
  
  7751be7c
- [Reformer] Add Enwiki8 Reformer Model - Adapt convert script (#4282) · ac7d5f67
  Patrick von Platen authored May 11, 2020
```
* adapt convert script

* update convert script

* finish

* fix marian pretrained docs
```
  ac7d5f67
- Reformer enwik8 - Model card (#4286) · 336116d9
  Patrick von Platen authored May 11, 2020
  
  336116d9
10 May, 2020 3 commits
- [docs] fix typo (#4249) · b290c32e
  flozi00 authored May 10, 2020
  
  b290c32e
- [Marian] documentation and AutoModel support (#4152) · 3487be75
  Sam Shleifer authored May 10, 2020
```
- MarianSentencepieceTokenizer - > MarianTokenizer
- Start using unk token.
- add docs page
- add better generation params to MarianConfig
- more conversion utilities
```
  3487be75
- [README] Corrected some grammatical mistakes (#4199) · 9d2f467b
  Girishkumar authored May 10, 2020
  
  9d2f467b
08 May, 2020 9 commits
- [TPU] Doc, fix xla_spawn.py, only preprocess dataset once (#4223) · 7b75aa9f
  Julien Chaumond authored May 08, 2020
```
* [TPU] Doc, fix xla_spawn.py, only preprocess dataset once

* Update examples/README.md

* [xla_spawn] Add `_mp_fn` to other Trainer scripts

* [TPU] Fix: eval dataloader was None
```
  7b75aa9f
- Fix #4098 · 274d850d
  Julien Chaumond authored May 08, 2020
  
  274d850d
- example updated to use generation pipeline (#4230) · 26dad0a9
  Lorenzo De Mattei authored May 08, 2020
```
* example updated to use generation pipeline

* Update model_cards/LorenzoDeMattei/GePpeTto/README.md
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
```
  26dad0a9
- Model card for allegro/herbert-klej-cased-tokenizer-v1 (#4184) · 9ebb5b2a
  rmroczkowski authored May 08, 2020
  
  9ebb5b2a
- Model card for allegro/herbert-klej-cased-v1 (#4183) · 9e54efd0
  rmroczkowski authored May 08, 2020
  
  9e54efd0
- Model card for spanish electra small (#4196) · a8b798e6
  Manuel Romero authored May 08, 2020
  
  a8b798e6
- Create README.md (#4132) · 242005d7
  Savaş Yıldırım authored May 08, 2020
```
* Create README.md

* Adding code fence around code block
```
  242005d7
- Create README.md (#4179) · 5940c73b
  Manuel Romero authored May 08, 2020
```
model card for my De Novo Drug discovery model using MLM
```
  5940c73b
- [Pipeline, Generation] tf generation pipeline bug (#4217) · cf08830c
  Patrick von Platen authored May 08, 2020
```
* fix PR

* move tests to correct place
```
  cf08830c
07 May, 2020 17 commits

Add AlbertForPreTraining and TFAlbertForPreTraining models. (#4057) · 8bf73126

Jared T Nielsen authored May 07, 2020



* Add AlbertForPreTraining and TFAlbertForPreTraining models.

* PyTorch conversion

* TensorFlow conversion

* style
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>

8bf73126

[doc] Fix broken links + remove crazy big notebook · c99fe038
Julien Chaumond authored May 07, 2020

c99fe038
Create README.md (#4202) · 66113bd6
Savaş Yıldırım authored May 08, 2020

66113bd6
[examples] Add column for pytorch-lightning support · 6669915b
Julien Chaumond authored May 07, 2020

6669915b
Examples readme.md (#4215) · 612fa1b1
Julien Chaumond authored May 07, 2020
```
* README

* Update README.md
```
612fa1b1
Pin isort and tf <= 2.1.0 · 2e578243
Lysandre authored May 07, 2020

2e578243
Release: v2.9.0 · e7cfc1a3
Lysandre authored May 07, 2020

e7cfc1a3

BIG Reorganize examples (#4213) · 0ae96ff8

Julien Chaumond authored May 07, 2020

* Created using Colaboratory

* [examples] reorganize files

* remove run_tpu_glue.py as superseded by TPU support in Trainer

* Bugfix: int, not tuple

* move files around

0ae96ff8

[Trainer] Ability to specify optimizer/scheduler at init · cafa6a9e
Julien Chaumond authored May 07, 2020
```
cc @patrickvonplaten @thomwolf
```
cafa6a9e
Use with_extension to change the extension (#4203) · e4fd5e39
Bram Vanroy authored May 07, 2020
```
As per https://github.com/huggingface/transformers/pull/3934#discussion_r421307659
```
e4fd5e39

Tpu trainer (#4146) · ebf80e2e

Lysandre Debut authored May 07, 2020



* wip

* wip

* a last wip

* Better logging when using TPUs

* Correct argument name

* Tests

* fix

* Metrics in evaluation

* Update src/transformers/training_args.py

* [tpu] Use launcher script instead

* [tpu] lots of tweaks

* Fix formatting
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

ebf80e2e

Ensure fast tokenizer can construct tensor without pad token if only one... · 026097b9
Funtowicz Morgan authored May 07, 2020
```
Ensure fast tokenizer can construct tensor without pad token if only one sample is provided. (#4201)
```
026097b9

Rewritten batch support in pipelines. (#4154) · 0a6cbea0

Funtowicz Morgan authored May 07, 2020



* Rewritten batch support in pipelines.
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

* Fix imports sorting 🔧

Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

* Set pad_to_max_length=True by default on Pipeline.

* Set pad_to_max_length=False for generation pipelines.

Most of generation models doesn't have padding token.

* Address @joeddav review comment: Uniformized *args.
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

* Address @joeddav review comment: Uniformized *args (second).
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

0a6cbea0

fix examples (#4192) · 99d1a694
Patrick von Platen authored May 07, 2020

99d1a694
[Reformer] Fix example and error message (#4191) · 74ffc9ea
Patrick von Platen authored May 07, 2020
```
* fix example reformer

* fix error message and example docstring

* improved error message
```
74ffc9ea
fix docstring reformer (#4190) · 96c78396
Patrick von Platen authored May 07, 2020

96c78396

Reformer (#3351) · dca34695

Patrick von Platen authored May 07, 2020

* first copy & past commit from Bert and morgans LSH code

* add easy way to compare to trax original code

* translate most of function

* make trax lsh self attention deterministic with numpy seed + copy paste code

* add same config

* add same config

* make layer init work

* implemented hash_vectors function for lsh attention

* continue reformer translation

* hf LSHSelfAttentionLayer gives same output as trax layer

* refactor code

* refactor code

* refactor code

* refactor

* refactor + add reformer config

* delete bogus file

* split reformer attention layer into two layers

* save intermediate step

* save intermediate step

* make test work

* add complete reformer block layer

* finish reformer layer

* implement causal and self mask

* clean reformer test and refactor code

* fix merge conflicts

* fix merge conflicts

* update init

* fix device for GPU

* fix chunk length init for tests

* include morgans optimization

* improve memory a bit

* improve comment

* factorize num_buckets

* better testing parameters

* make whole model work

* make lm model work

* add t5 copy paste tokenizer

* add chunking feed forward

* clean config

* add improved assert statements

* make tokenizer work

* improve test

* correct typo

* extend config

* add complexer test

* add new axial position embeddings

* add local block attention layer

* clean tests

* refactor

* better testing

* save intermediate progress

* clean test file

* make shorter input length work for model

* allow variable input length

* refactor

* make forward pass for pretrained model work

* add generation possibility

* finish dropout and init

* make style

* refactor

* add first version of RevNet Layers

* make forward pass work and add convert file

* make uploaded model forward pass work

* make uploaded model forward pass work

* refactor code

* add namedtuples and cache buckets

* correct head masks

* refactor

* made reformer more flexible

* make style

* remove set max length

* add attention masks

* fix up tests

* fix lsh attention mask

* make random seed optional for the moment

* improve memory in reformer

* add tests

* make style

* make sure masks work correctly

* detach gradients

* save intermediate

* correct backprob through gather

* make style

* change back num hashes

* rename to labels

* fix rotation shape

* fix detach

* update

* fix trainer

* fix backward dropout

* make reformer more flexible

* fix conflict

* fix

* fix

* add tests for fixed seed in reformer layer

* fix trainer typo

* fix typo in activations

* add fp16 tests

* add fp16 training

* support fp16

* correct gradient bug in reformer

* add fast gelu

* re-add dropout for embedding dropout

* better naming

* better naming

* renaming

* finalize test branch

* finalize tests

* add more tests

* finish tests

* fix

* fix type trainer

* fix fp16 tests

* fix tests

* fix tests

* fix tests

* fix issue with dropout

* fix dropout seeds

* correct random seed on gpu

* finalize random seed for dropout

* finalize random seed for dropout

* remove duplicate line

* correct half precision bug

* make style

* refactor

* refactor

* docstring

* remove sinusoidal position encodings for reformer

* move chunking to modeling_utils

* make style

* clean config

* make style

* fix tests

* fix auto tests

* pretrained models

* fix docstring

* update conversion file

* Update pretrained_models.rst

* fix rst

* fix rst

* update copyright

* fix test path

* fix test path

* fix small issue in test

* include reformer in generation tests

* add docs for axial position encoding

* finish docs

* Update convert_reformer_trax_checkpoint_to_pytorch.py

* remove isort

* include sams comments

* remove wrong comment in utils

* correct typos

* fix typo

* Update reformer.rst

* applied morgans optimization

* make style

* make gpu compatible

* remove bogus file

* big test refactor

* add example for chunking

* fix typo

* add to README

dca34695

06 May, 2020 7 commits

change order pytorch/tf in readme (#4167) · 877fc564
Clement authored May 06, 2020

877fc564

TF version of the trainer (#4017) · aad50151

Julien Plu authored May 06, 2020

* First commit to add a TF version of the trainer.

* Make the TF trainer closer to what looks the PT trainer

* Refactoring common code between the PT and TF trainer into an util file.

* Some bugfix + better similarity with the PT trainer

* Add missing class in transformers init

* Bugfix over prediction + use classification report instead of simple metrics

* Fix name error

* Fix optimization tests + style

* Apply style

* Several bugfix for multi-gpu training

* Apply style

* Apply style

* Add glue example for the TF trainer

* Several bugix + address the reviews

* Fix on the TF training args file

* Add a debug mode

* Bugfix in utils_ner.py when segment_ids is None

* Apply style

* Apply style

* Add TPU strategy

* Fix selection strategy

aad50151

Fix overwrite_cache behaviour for pytorch lightning examples (#4093) · 25296b12
Simone Primarosa authored May 06, 2020

25296b12
Include ElectraPreTrainedModel into __init__ (#4173) · 9972562d
kumapo authored May 07, 2020

9972562d
Camembert-large-fquad model card (#4143) · ff8ed52d
martindh authored May 06, 2020
```
Description for the model card describing the camembert-large-fquad model.
```
ff8ed52d
Add model card for the NER model (#4162) · 4c3be2e7
Julien Plu authored May 06, 2020

4c3be2e7
Fix markdown to show the results table properly (#4119) · 17ae0363
Manuel Romero authored May 06, 2020

17ae0363