Commits · 3487be75ef8f0d51c51208f266644fc04a947085 · chenpangpang / transformers

10 May, 2020 1 commit

[Marian] documentation and AutoModel support (#4152) · 3487be75

Sam Shleifer authored May 10, 2020

- MarianSentencepieceTokenizer - > MarianTokenizer
- Start using unk token.
- add docs page
- add better generation params to MarianConfig
- more conversion utilities

3487be75

07 May, 2020 1 commit

Reformer (#3351) · dca34695

Patrick von Platen authored May 07, 2020

* first copy & past commit from Bert and morgans LSH code

* add easy way to compare to trax original code

* translate most of function

* make trax lsh self attention deterministic with numpy seed + copy paste code

* add same config

* add same config

* make layer init work

* implemented hash_vectors function for lsh attention

* continue reformer translation

* hf LSHSelfAttentionLayer gives same output as trax layer

* refactor code

* refactor code

* refactor code

* refactor

* refactor + add reformer config

* delete bogus file

* split reformer attention layer into two layers

* save intermediate step

* save intermediate step

* make test work

* add complete reformer block layer

* finish reformer layer

* implement causal and self mask

* clean reformer test and refactor code

* fix merge conflicts

* fix merge conflicts

* update init

* fix device for GPU

* fix chunk length init for tests

* include morgans optimization

* improve memory a bit

* improve comment

* factorize num_buckets

* better testing parameters

* make whole model work

* make lm model work

* add t5 copy paste tokenizer

* add chunking feed forward

* clean config

* add improved assert statements

* make tokenizer work

* improve test

* correct typo

* extend config

* add complexer test

* add new axial position embeddings

* add local block attention layer

* clean tests

* refactor

* better testing

* save intermediate progress

* clean test file

* make shorter input length work for model

* allow variable input length

* refactor

* make forward pass for pretrained model work

* add generation possibility

* finish dropout and init

* make style

* refactor

* add first version of RevNet Layers

* make forward pass work and add convert file

* make uploaded model forward pass work

* make uploaded model forward pass work

* refactor code

* add namedtuples and cache buckets

* correct head masks

* refactor

* made reformer more flexible

* make style

* remove set max length

* add attention masks

* fix up tests

* fix lsh attention mask

* make random seed optional for the moment

* improve memory in reformer

* add tests

* make style

* make sure masks work correctly

* detach gradients

* save intermediate

* correct backprob through gather

* make style

* change back num hashes

* rename to labels

* fix rotation shape

* fix detach

* update

* fix trainer

* fix backward dropout

* make reformer more flexible

* fix conflict

* fix

* fix

* add tests for fixed seed in reformer layer

* fix trainer typo

* fix typo in activations

* add fp16 tests

* add fp16 training

* support fp16

* correct gradient bug in reformer

* add fast gelu

* re-add dropout for embedding dropout

* better naming

* better naming

* renaming

* finalize test branch

* finalize tests

* add more tests

* finish tests

* fix

* fix type trainer

* fix fp16 tests

* fix tests

* fix tests

* fix tests

* fix issue with dropout

* fix dropout seeds

* correct random seed on gpu

* finalize random seed for dropout

* finalize random seed for dropout

* remove duplicate line

* correct half precision bug

* make style

* refactor

* refactor

* docstring

* remove sinusoidal position encodings for reformer

* move chunking to modeling_utils

* make style

* clean config

* make style

* fix tests

* fix auto tests

* pretrained models

* fix docstring

* update conversion file

* Update pretrained_models.rst

* fix rst

* fix rst

* update copyright

* fix test path

* fix test path

* fix small issue in test

* include reformer in generation tests

* add docs for axial position encoding

* finish docs

* Update convert_reformer_trax_checkpoint_to_pytorch.py

* remove isort

* include sams comments

* remove wrong comment in utils

* correct typos

* fix typo

* Update reformer.rst

* applied morgans optimization

* make style

* make gpu compatible

* remove bogus file

* big test refactor

* add example for chunking

* fix typo

* add to README

dca34695

28 Apr, 2020 1 commit

Clean Encoder-Decoder models with Bart/T5-like API and add generate possibility (#3383) · fa49b9af

Patrick von Platen authored Apr 28, 2020

* change encoder decoder style to bart & t5 style

* make encoder decoder generation dummy work for bert

* make style

* clean init config in encoder decoder

* add tests for encoder decoder models

* refactor and add last tests

* refactor and add last tests

* fix attn masks for bert encoder decoder

* make style

* refactor prepare inputs for Bert

* refactor

* finish encoder decoder

* correct typo

* add docstring to config

* finish

* add tests

* better naming

* make style

* fix flake8

* clean docstring

* make style

* rename

fa49b9af

16 Apr, 2020 1 commit

[Docs] Add DialoGPT (#3755) · d22894df

Patrick von Platen authored Apr 16, 2020



* add dialoGPT

* update README.md

* fix conflict

* update readme

* add code links to docs

* Update README.md

* Update dialo_gpt2.rst

* Update pretrained_models.rst

* Update docs/source/model_doc/dialo_gpt2.rst
Co-Authored-By: Julien Chaumond <chaumond@gmail.com>

* change filename of dialogpt
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

d22894df

03 Apr, 2020 1 commit

ELECTRA (#3257) · d5d7d886

Lysandre Debut authored Apr 03, 2020

* Electra wip

* helpers

* Electra wip

* Electra v1

* ELECTRA may be saved/loaded

* Generator & Discriminator

* Embedding size instead of halving the hidden size

* ELECTRA Tokenizer

* Revert BERT helpers

* ELECTRA Conversion script

* Archive maps

* PyTorch tests

* Start fixing tests

* Tests pass

* Same configuration for both models

* Compatible with base + large

* Simplification + weight tying

* Archives

* Auto + Renaming to standard names

* ELECTRA is uncased

* Tests

* Slight API changes

* Update tests

* wip

* ElectraForTokenClassification

* temp

* Simpler arch + tests

Removed ElectraForPreTraining which will be in a script

* Conversion script

* Auto model

* Update links to S3

* Split ElectraForPreTraining and ElectraForTokenClassification

* Actually test PreTraining model

* Remove num_labels from configuration

* wip

* wip

* From discriminator and generator to electra

* Slight API changes

* Better naming

* TensorFlow ELECTRA tests

* Accurate conversion script

* Added to conversion script

* Fast ELECTRA tokenizer

* Style

* Add ELECTRA to README

* Modeling Pytorch Doc + Real style

* TF Docs

* Docs

* Correct links

* Correct model intialized

* random fixes

* style

* Addressing Patrick's and Sam's comments

* Correct links in docs

d5d7d886

27 Mar, 2020 1 commit

Add T5 to docs (#3461) · fa9af246

Patrick von Platen authored Mar 27, 2020

* add t5 docs basis

* improve docs

* add t5 docs

* improve t5 docstring

* add t5 tokenizer docstring

* finish docstring

* make style

* add pretrained models

* correct typo

* make examples work

* finalize docs

fa9af246

02 Mar, 2020 1 commit

Pipeline doc (#3055) · d3eb7d23

Lysandre Debut authored Mar 02, 2020

* Pipeline doc initial commit

* pipeline abstraction

* Remove modelcard argument from pipeline

* Task-specific pipelines can be instantiated with no model or tokenizer

* All pipelines doc

d3eb7d23

25 Feb, 2020 1 commit

Adding usage examples for common tasks (#2850) · 65e7c90a

Lysandre Debut authored Feb 25, 2020

* Usage: Sequence Classification & Question Answering

* Pipeline example

* Language modeling

* TensorFlow code for Sequence classification

* Custom TF/PT toggler in docs

* QA + LM for TensorFlow

* Finish Usage for both PyTorch and TensorFlow

* Addressing Julien's comments

* More assertive

* cleanup

* Favicon
- added favicon option in conf.py along with the favicon image
- udpated 🤗

 logo. slightly smaller and should appear more consistent across editing programs (no more tongue on the outside of the mouth)
Co-authored-by: joshchagani <joshua@joshuachagani.com>

65e7c90a

20 Feb, 2020 1 commit

New BartModel (#2745) · 53ce3854

Sam Shleifer authored Feb 20, 2020

* Results same as fairseq
* Wrote a ton of tests
* Struggled with api signatures
* added some docs

53ce3854

30 Jan, 2020 2 commits
- Add layerdrop · b43cb09a
  Hang Le authored Jan 30, 2020
  
  b43cb09a
- FlauBERT documentation · 73306d02
  Lysandre authored Jan 29, 2020
  
  73306d02
23 Jan, 2020 2 commits
- XLM-RoBERTa · 980211a6
  Lysandre authored Jan 20, 2020
  
  980211a6
- Glossary · 9bab9b83
  Lysandre authored Jan 14, 2020
  
  9bab9b83
06 Jan, 2020 2 commits
- GPU text generation: mMoved the encoded_prompt to correct device · 81d6841b
  alberduris authored Dec 31, 2019
  
  81d6841b
- Moved the encoded_prompts to correct device · dd4df80f
  alberduris authored Dec 31, 2019
  
  dd4df80f
18 Dec, 2019 2 commits
- docs: fix numbering 😅 · f09d9996
  Stefan Schweter authored Dec 18, 2019
  
  f09d9996
- docs: add XLM-RoBERTa to index page · d35405b7
  Stefan Schweter authored Dec 18, 2019
  
  d35405b7
16 Dec, 2019 1 commit
- [doc] Model upload and sharing · 855ff0e9
  Julien Chaumond authored Dec 16, 2019
```
ping @lysandrejik @thomwolf

Is this clear enough? Anything we should add?
```
  855ff0e9
09 Dec, 2019 1 commit
- fix albert links · 5c877fe9
  Pierric Cistac authored Dec 09, 2019
  
  5c877fe9
26 Nov, 2019 1 commit
- CamemBERT & ALBERT doc · ee4647bd
  Lysandre authored Nov 26, 2019
  
  ee4647bd
18 Oct, 2019 1 commit
- Benchmark section added to the documentation · 82f6abd9
  LysandreJik authored Oct 18, 2019
  
  82f6abd9
09 Oct, 2019 1 commit
- Update CTRL documentation · 7fe98d8c
  LysandreJik authored Oct 09, 2019
  
  7fe98d8c
07 Oct, 2019 1 commit
- Multilingual · 8fcc6507
  LysandreJik authored Oct 07, 2019
  
  8fcc6507
03 Oct, 2019 1 commit
- fix links in doc index · e2ae9c0b
  VictorSanh authored Oct 03, 2019
  
  e2ae9c0b
26 Sep, 2019 5 commits
- Repository link in the documentation · 93f0c5fc
  LysandreJik authored Sep 26, 2019
  
  93f0c5fc
- Documentation · de5e4864
  LysandreJik authored Sep 26, 2019
  
  de5e4864
- GLUE processors · c4ac7a76
  LysandreJik authored Sep 25, 2019
  
  c4ac7a76
- Documentation · cf5c5c9e
  LysandreJik authored Sep 26, 2019
  
  cf5c5c9e
- [BIG] pytorch-transformers => transformers · 31c23bd5
  thomwolf authored Sep 26, 2019
  
  31c23bd5
30 Aug, 2019 4 commits
- Fix documentation index · 09363f2a
  LysandreJik authored Aug 30, 2019
  
  09363f2a
- fix link · e0caab0c
  LysandreJik authored Aug 30, 2019
  
  e0caab0c
- Fix index number in documentation · a600b30c
  LysandreJik authored Aug 30, 2019
  
  a600b30c
- Added DistilBERT to documentation index · 20c06fa3
  LysandreJik authored Aug 30, 2019
  
  20c06fa3
28 Aug, 2019 1 commit
- Documentation additions · 1dc43e56
  LysandreJik authored Aug 28, 2019
  
  1dc43e56
14 Aug, 2019 1 commit
- Doc · 572dcfd1
  LysandreJik authored Aug 14, 2019
  
  572dcfd1
05 Aug, 2019 1 commit
- update doc and tests · 13936a96
  thomwolf authored Aug 05, 2019
  
  13936a96
04 Aug, 2019 2 commits
- updating docs - adding few tests to tokenizers · 00132b7a
  thomwolf authored Aug 04, 2019
  
  00132b7a
- big doc update [WIP] · 009273db
  thomwolf authored Aug 04, 2019
  
  009273db
16 Jul, 2019 1 commit
- updates to readme and doc · 43e0e8fa
  thomwolf authored Jul 16, 2019
  
  43e0e8fa
14 Jul, 2019 1 commit
- updating examples and doc · 2397f958
  thomwolf authored Jul 14, 2019
  
  2397f958