Commits · 56d5d160cdd177ae6e644506535b56e79feccf68 · chenpangpang / transformers

05 Jun, 2020 3 commits
- Add model and doc badges (#4811) · 56d5d160
  Sylvain Gugger authored Jun 05, 2020
```
* Add badges for models and docs
```
  56d5d160
- Add link to community models (#4804) · 5c0cfc2c
  Sylvain Gugger authored Jun 05, 2020
  
  5c0cfc2c
- Add model summary (#4789) · fa661ce7
  Sylvain Gugger authored Jun 05, 2020
```
* Add model summary

* Add link to pretrained models
```
  fa661ce7
04 Jun, 2020 1 commit
- Add note about doc generation (#4770) · cd4e07a8
  Sylvain Gugger authored Jun 04, 2020
  
  cd4e07a8
03 Jun, 2020 1 commit

Pipelines: miscellanea of QoL improvements and small features... (#4632) · 99207bd1

Julien Chaumond authored Jun 03, 2020

* [hf_api] Attach all unknown attributes for future-proof compatibility

* [Pipeline] NerPipeline is really a TokenClassificationPipeline

* modelcard.py: I don't think we need to force the download

* Remove config, tokenizer from SUPPORTED_TASKS as we're moving to one model = one weight + one tokenizer

* FillMaskPipeline: also output token in string form

* TextClassificationPipeline: option to return all scores, not just the argmax

* Update docs/source/main_classes/pipelines.rst

99207bd1

02 Jun, 2020 3 commits

Fix CI after killing archive maps (#4724) · b42586ea
Julien Chaumond authored Jun 02, 2020
```
* 🐛 Fix model ids for BART and Flaubert
```
b42586ea
Release: v2.11.0 · b43c78e5
Lysandre authored Jun 02, 2020

b43c78e5

Kill model archive maps (#4636) · d4c2cb40

Julien Chaumond authored Jun 02, 2020

* Kill model archive maps

* Fixup

* Also kill model_archive_map for MaskedBertPreTrainedModel

* Unhook config_archive_map

* Tokenizers: align with model id changes

* make style && make quality

* Fix CI

d4c2cb40

29 May, 2020 2 commits

[Longformer] Better handling of global attention mask vs local attention mask (#4672) · 56ee2560

Patrick von Platen authored May 29, 2020

* better api

* improve automatic setting of global attention mask

* fix longformer bug

* fix global attention mask in test

* fix global attn mask flatten

* fix slow tests

* update docstring

* update docs and make more robust

* improve attention mask

56ee2560

[Longformer] Multiple choice for longformer (#4645) · 9c172564

Patrick von Platen authored May 29, 2020

* add multiple choice for longformer

* add models to docs

* adapt docstring

* add test to longformer

* add longformer for mc in init and modeling auto

* fix tests

9c172564

27 May, 2020 1 commit

per_device instead of per_gpu/error thrown when argument unknown (#4618) · 6a176880

Lysandre Debut authored May 27, 2020



* per_device instead of per_gpu/error thrown when argument unknown

* [docs] Restore examples.md symlink

* Correct absolute links so that symlink to the doc works correctly

* Update src/transformers/hf_argparser.py
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

* Warning + reorder

* Docs

* Style

* not for squad
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

6a176880

26 May, 2020 1 commit
- [Longformer For Question Answering] Conversion script, doc, small fixes (#4593) · c589eae2
  Patrick von Platen authored May 26, 2020
```
* add new longformer for question answering model

* add new config as well

* fix links

* fix links part 2
```
  c589eae2
25 May, 2020 1 commit
- [Reformer] fix reformer num buckets (#4564) · 3e3e5521
  Patrick von Platen authored May 25, 2020
```
* fix reformer num buckets

* fix

* adapt docs

* set num buckets in config
```
  3e3e5521
22 May, 2020 2 commits
- link to paper was broken (#4526) · 95a26fcf
  Alexander Measure authored May 22, 2020
```
changed from https://https://arxiv.org/abs/2001.04451.pdf to https://arxiv.org/abs/2001.04451.pdf
```
  95a26fcf
- Release: v2.10.0 · e0db6bbd
  Lysandre authored May 22, 2020
  
  e0db6bbd
19 May, 2020 2 commits

[Longformer] Docs and clean API (#4464) · 48c3a70b
Patrick von Platen authored May 19, 2020
```
* add longformer docs

* improve docs
```
48c3a70b

Longformer (#4352) · 8f1d0471

Iz Beltagy authored May 19, 2020

* first commit

* bug fixes

* better examples

* undo padding

* remove wrong VOCAB_FILES_NAMES

* License

* make style

* make isort happy

* unit tests

* integration test

* make `black` happy by undoing `isort` changes!!

* lint

* no need for the padding value

* batch_size not bsz

* remove unused type casting

* seqlen not seq_len

* staticmethod

* `bert` selfattention instead of `n2`

* uint8 instead of bool + lints

* pad inputs_embeds using embeddings not a constant

* black

* unit test with padding

* fix unit tests

* remove redundant unit test

* upload model weights

* resolve todo

* simpler _mask_invalid_locations without lru_cache + backward compatible masked_fill_

* increase unittest coverage

8f1d0471

18 May, 2020 1 commit
- Fixed spelling of training (#4416) · fa6113f9
  Soham Chatterjee authored May 18, 2020
  
  fa6113f9
13 May, 2020 3 commits
- Release: v2.9.1 · 7cb203fa
  Lysandre authored May 13, 2020
  
  7cb203fa
- [Marian Fixes] prevent predicting pad_token_id before softmax, support... · 9a687ebb
  Sam Shleifer authored May 13, 2020
```
[Marian Fixes] prevent predicting pad_token_id before softmax, support language codes, name multilingual models (#4290)
```
  9a687ebb
- [Docs, Notebook] Include generation pipeline (#4295) · 839bfaed
  Patrick von Platen authored May 13, 2020
```
* add first text for generation

* add generation pipeline to usage

* Created using Colaboratory

* correct docstring

* finish
```
  839bfaed
11 May, 2020 5 commits
- Documentation specification (#4294) · 95249568
  Lysandre Debut authored May 11, 2020
  
  95249568
- Add migrating from `pytorch-transformers` (#4273) · 39994051
  Guo, Quan authored May 11, 2020
```
"Migrating from pytorch-transformers to transformers" is missing in the main document. It is available in the main `readme` thought. Just move it to the document.
```
  39994051
- Add ALBERT to the Tensorflow to Pytorch model conversion cli (#3933) · 41e82912
  fgaim authored May 11, 2020
```
* Add ALBERT to convert command of transformers-cli

* Document ALBERT tf to pytorch model conversion
```
  41e82912
- Documentation: fix links to NER examples (#4279) · 3f42eb97
  Stefan Schweter authored May 11, 2020
```
* docs: fix link to token classification (NER) example

* examples: fix links to NER scripts
```
  3f42eb97
- [Reformer] Add Enwiki8 Reformer Model - Adapt convert script (#4282) · ac7d5f67
  Patrick von Platen authored May 11, 2020
```
* adapt convert script

* update convert script

* finish

* fix marian pretrained docs
```
  ac7d5f67
10 May, 2020 2 commits
- [Marian] documentation and AutoModel support (#4152) · 3487be75
  Sam Shleifer authored May 10, 2020
```
- MarianSentencepieceTokenizer - > MarianTokenizer
- Start using unk token.
- add docs page
- add better generation params to MarianConfig
- more conversion utilities
```
  3487be75
- [README] Corrected some grammatical mistakes (#4199) · 9d2f467b
  Girishkumar authored May 10, 2020
  
  9d2f467b
07 May, 2020 5 commits

[doc] Fix broken links + remove crazy big notebook · c99fe038
Julien Chaumond authored May 07, 2020

c99fe038
Examples readme.md (#4215) · 612fa1b1
Julien Chaumond authored May 07, 2020
```
* README

* Update README.md
```
612fa1b1
Release: v2.9.0 · e7cfc1a3
Lysandre authored May 07, 2020

e7cfc1a3

BIG Reorganize examples (#4213) · 0ae96ff8

Julien Chaumond authored May 07, 2020

* Created using Colaboratory

* [examples] reorganize files

* remove run_tpu_glue.py as superseded by TPU support in Trainer

* Bugfix: int, not tuple

* move files around

0ae96ff8

Reformer (#3351) · dca34695

Patrick von Platen authored May 07, 2020

* first copy & past commit from Bert and morgans LSH code

* add easy way to compare to trax original code

* translate most of function

* make trax lsh self attention deterministic with numpy seed + copy paste code

* add same config

* add same config

* make layer init work

* implemented hash_vectors function for lsh attention

* continue reformer translation

* hf LSHSelfAttentionLayer gives same output as trax layer

* refactor code

* refactor code

* refactor code

* refactor

* refactor + add reformer config

* delete bogus file

* split reformer attention layer into two layers

* save intermediate step

* save intermediate step

* make test work

* add complete reformer block layer

* finish reformer layer

* implement causal and self mask

* clean reformer test and refactor code

* fix merge conflicts

* fix merge conflicts

* update init

* fix device for GPU

* fix chunk length init for tests

* include morgans optimization

* improve memory a bit

* improve comment

* factorize num_buckets

* better testing parameters

* make whole model work

* make lm model work

* add t5 copy paste tokenizer

* add chunking feed forward

* clean config

* add improved assert statements

* make tokenizer work

* improve test

* correct typo

* extend config

* add complexer test

* add new axial position embeddings

* add local block attention layer

* clean tests

* refactor

* better testing

* save intermediate progress

* clean test file

* make shorter input length work for model

* allow variable input length

* refactor

* make forward pass for pretrained model work

* add generation possibility

* finish dropout and init

* make style

* refactor

* add first version of RevNet Layers

* make forward pass work and add convert file

* make uploaded model forward pass work

* make uploaded model forward pass work

* refactor code

* add namedtuples and cache buckets

* correct head masks

* refactor

* made reformer more flexible

* make style

* remove set max length

* add attention masks

* fix up tests

* fix lsh attention mask

* make random seed optional for the moment

* improve memory in reformer

* add tests

* make style

* make sure masks work correctly

* detach gradients

* save intermediate

* correct backprob through gather

* make style

* change back num hashes

* rename to labels

* fix rotation shape

* fix detach

* update

* fix trainer

* fix backward dropout

* make reformer more flexible

* fix conflict

* fix

* fix

* add tests for fixed seed in reformer layer

* fix trainer typo

* fix typo in activations

* add fp16 tests

* add fp16 training

* support fp16

* correct gradient bug in reformer

* add fast gelu

* re-add dropout for embedding dropout

* better naming

* better naming

* renaming

* finalize test branch

* finalize tests

* add more tests

* finish tests

* fix

* fix type trainer

* fix fp16 tests

* fix tests

* fix tests

* fix tests

* fix issue with dropout

* fix dropout seeds

* correct random seed on gpu

* finalize random seed for dropout

* finalize random seed for dropout

* remove duplicate line

* correct half precision bug

* make style

* refactor

* refactor

* docstring

* remove sinusoidal position encodings for reformer

* move chunking to modeling_utils

* make style

* clean config

* make style

* fix tests

* fix auto tests

* pretrained models

* fix docstring

* update conversion file

* Update pretrained_models.rst

* fix rst

* fix rst

* update copyright

* fix test path

* fix test path

* fix small issue in test

* include reformer in generation tests

* add docs for axial position encoding

* finish docs

* Update convert_reformer_trax_checkpoint_to_pytorch.py

* remove isort

* include sams comments

* remove wrong comment in utils

* correct typos

* fix typo

* Update reformer.rst

* applied morgans optimization

* make style

* make gpu compatible

* remove bogus file

* big test refactor

* add example for chunking

* fix typo

* add to README

dca34695

01 May, 2020 1 commit
- docs: add xlm-roberta section to multi-lingual section (#4101) · e80be7f1
  Stefan Schweter authored May 01, 2020
  
  e80be7f1
28 Apr, 2020 2 commits

Clean Encoder-Decoder models with Bart/T5-like API and add generate possibility (#3383) · fa49b9af

Patrick von Platen authored Apr 28, 2020

* change encoder decoder style to bart & t5 style

* make encoder decoder generation dummy work for bert

* make style

* clean init config in encoder decoder

* add tests for encoder decoder models

* refactor and add last tests

* refactor and add last tests

* fix attn masks for bert encoder decoder

* make style

* refactor prepare inputs for Bert

* refactor

* finish encoder decoder

* correct typo

* add docstring to config

* finish

* add tests

* better naming

* make style

* fix flake8

* clean docstring

* make style

* rename

fa49b9af

add dialogpt training tips (#3996) · 52679fbc
Patrick von Platen authored Apr 28, 2020

52679fbc

27 Apr, 2020 1 commit

Fix t5 doc typos (#3978) · 12bb7fe7

Lorenzo Ampil authored Apr 28, 2020

* Fix tpo in into and add line under

* Add missing blank line under

* Correct types under

12bb7fe7

22 Apr, 2020 2 commits

Pipeline for Text Generation: GenerationPipeline (#3758) · f16540fc

Lorenzo Ampil authored Apr 22, 2020



* Add GenerationPipeline

* Fix parameter names

* Correct parameter __call__ parameters

* Add model type attribute and correct function calls for prepare_input

* Take out trailing commas from init attributes

* Remove unnecessary tokenization line

* Implement support for multiple text inputs

* Apply generation support for multiple input text prompts

* Take out tensor coersion

* Take out batch index

* Add text prompt to return sequence

* Squeeze token tensore before decoding

* Return only a single list of sequences if only one prompt was used

* Correct results variable name

* Add GenerationPipeline to SUPPORTED_TASKS with the alias , initalized w GPT2

* Registedred AutoModelWithLMHead for both pt and t

* Update docstring for GenerationPipeline

* Add kwargs parameter to mode.generate

* Take out kwargs parameter after all

* Add generation pipeline example in pipeline docstring

* Fix max length by squeezing tokens tensor

* Apply ensure_tensor_on_device to pytorch tensor

* Include generation step in torch.no_grad

* Take out input from prepare_xlm_input and set 'en' as default xlm_language

* Apply framework specific encoding during prepare_input

* Format w make style

* Move GenerationPipeline import to follow proper import sorting

* Take out training comma from generation dict

* Apply requested changes

* Change name to TextGenerationPipeline

* Apply TextGenerationPipeline rename to __init___

* Changing alias to

* Set input mapping as input to ensure_tensor_on_device

* Fix assertion placement

* Add test_text_generation

* Add TextGenerationPipeline to PipelineCommonTests

* Take out whitespace

* Format __init__ w black

* Fix __init__ style

* Forman __init___

* Add line to end of __init__

* Correct model tokenizer set for test_text_generation

* Ensure to return list of list, not list of string (to pass test)

* Limit test models to only 3 to limit runtime to address circleCI timeout error

* Update src/transformers/pipelines.py
Co-Authored-By: Patrick von Platen <patrick.v.platen@gmail.com>

* Update src/transformers/pipelines.py
Co-Authored-By: Patrick von Platen <patrick.v.platen@gmail.com>

* Update src/transformers/pipelines.py
Co-Authored-By: Patrick von Platen <patrick.v.platen@gmail.com>

* Update src/transformers/pipelines.py
Co-Authored-By: Patrick von Platen <patrick.v.platen@gmail.com>

* Update src/transformers/pipelines.py
Co-Authored-By: Patrick von Platen <patrick.v.platen@gmail.com>

* Update tests/test_pipelines.py
Co-Authored-By: Patrick von Platen <patrick.v.platen@gmail.com>

* Update src/transformers/pipelines.py
Co-Authored-By: Patrick von Platen <patrick.v.platen@gmail.com>

* Update src/transformers/pipelines.py
Co-Authored-By: Patrick von Platen <patrick.v.platen@gmail.com>

* Update src/transformers/pipelines.py
Co-Authored-By: Patrick von Platen <patrick.v.platen@gmail.com>

* Remove argument docstring, __init__, add additional __call__ arguments, and reformat results to list of dict

* Fix blank result list

* Add TextGenerationPipeline to pipelines.rst

* Update src/transformers/pipelines.py
Co-Authored-By: Patrick von Platen <patrick.v.platen@gmail.com>

* Update src/transformers/pipelines.py
Co-Authored-By: Patrick von Platen <patrick.v.platen@gmail.com>

* Fix typos from adding PADDING_TEXT_TOKEN_LENGTH

* Fix incorrectly moved result list

* Update src/transformers/pipelines.py
Co-Authored-By: Patrick von Platen <patrick.v.platen@gmail.com>

* Update src/transformers/pipelines.py

* Update src/transformers/pipelines.py

* Update src/transformers/pipelines.py

* Update src/transformers/pipelines.py

* Update src/transformers/pipelines.py

* Update src/transformers/pipelines.py

* Update src/transformers/pipelines.py

* Update src/transformers/pipelines.py

* Update src/transformers/pipelines.py

* Update src/transformers/pipelines.py

* Update src/transformers/pipelines.py

* Update src/transformers/pipelines.py
Co-Authored-By: Patrick von Platen <patrick.v.platen@gmail.com>

* Add back generation line and make style

* Take out blank whitespace

* Apply new alis, text-generation, to test_pipelines

* Fix text generation alias in test

* Update src/transformers/pipelines.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

f16540fc

Fixes #3877 · 1dc9b3c7
Julien Chaumond authored Apr 22, 2020

1dc9b3c7

18 Apr, 2020 1 commit

Cleanup fast tokenizers integration (#3706) · 827d6d6e

Thomas Wolf authored Apr 18, 2020



* First pass on utility classes and python tokenizers

* finishing cleanup pass

* style and quality

* Fix tests

* Updating following @mfuntowicz comment

* style and quality

* Fix Roberta

* fix batch_size/seq_length inBatchEncoding

* add alignement methods + tests

* Fix OpenAI and Transfo-XL tokenizers

* adding trim_offsets=True default for GPT2 et RoBERTa

* style and quality

* fix tests

* add_prefix_space in roberta

* bump up tokenizers to rc7

* style

* unfortunately tensorfow does like these - removing shape/seq_len for now

* Update src/transformers/tokenization_utils.py
Co-Authored-By: Stefan Schweter <stefan@schweter.it>

* Adding doc and docstrings

* making flake8 happy
Co-authored-by: Stefan Schweter <stefan@schweter.it>

827d6d6e