Commits · cafa6a9e29f3e99c67a1028f8ca779d439bc0689 · chenpangpang / transformers

07 May, 2020 1 commit

Patrick von Platen authored May 07, 2020

* first copy & past commit from Bert and morgans LSH code

* add easy way to compare to trax original code

* translate most of function

* make trax lsh self attention deterministic with numpy seed + copy paste code

* add same config

* add same config

* make layer init work

* implemented hash_vectors function for lsh attention

* continue reformer translation

* hf LSHSelfAttentionLayer gives same output as trax layer

* refactor code

* refactor code

* refactor code

* refactor

* refactor + add reformer config

* delete bogus file

* split reformer attention layer into two layers

* save intermediate step

* save intermediate step

* make test work

* add complete reformer block layer

* finish reformer layer

* implement causal and self mask

* clean reformer test and refactor code

* fix merge conflicts

* fix merge conflicts

* update init

* fix device for GPU

* fix chunk length init for tests

* include morgans optimization

* improve memory a bit

* improve comment

* factorize num_buckets

* better testing parameters

* make whole model work

* make lm model work

* add t5 copy paste tokenizer

* add chunking feed forward

* clean config

* add improved assert statements

* make tokenizer work

* improve test

* correct typo

* extend config

* add complexer test

* add new axial position embeddings

* add local block attention layer

* clean tests

* refactor

* better testing

* save intermediate progress

* clean test file

* make shorter input length work for model

* allow variable input length

* refactor

* make forward pass for pretrained model work

* add generation possibility

* finish dropout and init

* make style

* refactor

* add first version of RevNet Layers

* make forward pass work and add convert file

* make uploaded model forward pass work

* make uploaded model forward pass work

* refactor code

* add namedtuples and cache buckets

* correct head masks

* refactor

* made reformer more flexible

* make style

* remove set max length

* add attention masks

* fix up tests

* fix lsh attention mask

* make random seed optional for the moment

* improve memory in reformer

* add tests

* make style

* make sure masks work correctly

* detach gradients

* save intermediate

* correct backprob through gather

* make style

* change back num hashes

* rename to labels

* fix rotation shape

* fix detach

* update

* fix trainer

* fix backward dropout

* make reformer more flexible

* fix conflict

* fix

* fix

* add tests for fixed seed in reformer layer

* fix trainer typo

* fix typo in activations

* add fp16 tests

* add fp16 training

* support fp16

* correct gradient bug in reformer

* add fast gelu

* re-add dropout for embedding dropout

* better naming

* better naming

* renaming

* finalize test branch

* finalize tests

* add more tests

* finish tests

* fix

* fix type trainer

* fix fp16 tests

* fix tests

* fix tests

* fix tests

* fix issue with dropout

* fix dropout seeds

* correct random seed on gpu

* finalize random seed for dropout

* finalize random seed for dropout

* remove duplicate line

* correct half precision bug

* make style

* refactor

* refactor

* docstring

* remove sinusoidal position encodings for reformer

* move chunking to modeling_utils

* make style

* clean config

* make style

* fix tests

* fix auto tests

* pretrained models

* fix docstring

* update conversion file

* Update pretrained_models.rst

* fix rst

* fix rst

* update copyright

* fix test path

* fix test path

* fix small issue in test

* include reformer in generation tests

* add docs for axial position encoding

* finish docs

* Update convert_reformer_trax_checkpoint_to_pytorch.py

* remove isort

* include sams comments

* remove wrong comment in utils

* correct typos

* fix typo

* Update reformer.rst

* applied morgans optimization

* make style

* make gpu compatible

* remove bogus file

* big test refactor

* add example for chunking

* fix typo

* add to README

dca34695

06 May, 2020 1 commit
- change order pytorch/tf in readme (#4167) · 877fc564
  Clement authored May 06, 2020
  
  877fc564
30 Apr, 2020 1 commit
- Fix TF input docstrings to refer to tf.Tensor rather than torch.FloatTensor. (#4051) · 64070cbb
  Jared T Nielsen authored Apr 30, 2020
  
  64070cbb
23 Apr, 2020 1 commit
- quick fix wording readme for community models (#3900) · 6ba254ee
  Clement authored Apr 23, 2020
  
  6ba254ee
22 Apr, 2020 1 commit

Trainer (#3800) · dd9d483d

Julien Chaumond authored Apr 21, 2020

* doc

* [tests] Add sample files for a regression task

* [HUGE] Trainer

* Feedback from @sshleifer

* Feedback from @thomwolf + logging tweak

* [file_utils] when downloading concurrently, get_from_cache will use the cached file for subsequent processes

* [glue] Use default max_seq_length of 128 like before

* [glue] move DataTrainingArguments around

* [ner] Change interface of InputExample, and align run_{tf,pl}

* Re-align the pl scripts a little bit

* ner

* [ner] Add integration test

* Fix language_modeling with API tweak

* [ci] Tweak loss target

* Don't break console output

* amp.initialize: model must be on right device before

* [multiple-choice] update for Trainer

* Re-align to 827d6d6e

dd9d483d

18 Apr, 2020 1 commit
- add "by" to ReadMe · a21d4fa4
  Patrick von Platen authored Apr 18, 2020
  
  a21d4fa4
16 Apr, 2020 1 commit

[Docs] Add DialoGPT (#3755) · d22894df

Patrick von Platen authored Apr 16, 2020



* add dialoGPT

* update README.md

* fix conflict

* update readme

* add code links to docs

* Update README.md

* Update dialo_gpt2.rst

* Update pretrained_models.rst

* Update docs/source/model_doc/dialo_gpt2.rst
Co-Authored-By: Julien Chaumond <chaumond@gmail.com>

* change filename of dialogpt
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

d22894df

10 Apr, 2020 1 commit
- [docs] The use of `do_lower_case` in scripts is on its way to deprecation (#3738) · cbad305c
  Julien Chaumond authored Apr 10, 2020
  
  cbad305c
08 Apr, 2020 1 commit
- Update doc for {Summarization,Translation}Pipeline and other tweaks · 83703cd0
  Julien Chaumond authored Apr 07, 2020
  
  83703cd0
03 Apr, 2020 1 commit

ELECTRA (#3257) · d5d7d886

Lysandre Debut authored Apr 03, 2020

* Electra wip

* helpers

* Electra wip

* Electra v1

* ELECTRA may be saved/loaded

* Generator & Discriminator

* Embedding size instead of halving the hidden size

* ELECTRA Tokenizer

* Revert BERT helpers

* ELECTRA Conversion script

* Archive maps

* PyTorch tests

* Start fixing tests

* Tests pass

* Same configuration for both models

* Compatible with base + large

* Simplification + weight tying

* Archives

* Auto + Renaming to standard names

* ELECTRA is uncased

* Tests

* Slight API changes

* Update tests

* wip

* ElectraForTokenClassification

* temp

* Simpler arch + tests

Removed ElectraForPreTraining which will be in a script

* Conversion script

* Auto model

* Update links to S3

* Split ElectraForPreTraining and ElectraForTokenClassification

* Actually test PreTraining model

* Remove num_labels from configuration

* wip

* wip

* From discriminator and generator to electra

* Slight API changes

* Better naming

* TensorFlow ELECTRA tests

* Accurate conversion script

* Added to conversion script

* Fast ELECTRA tokenizer

* Style

* Add ELECTRA to README

* Modeling Pytorch Doc + Real style

* TF Docs

* Docs

* Correct links

* Correct model intialized

* random fixes

* style

* Addressing Patrick's and Sam's comments

* Correct links in docs

d5d7d886

17 Mar, 2020 1 commit

CPU/GPU memory benchmarking utilities - Remove support for python 3.5 (now only 3.6+) (#3186) · 2187c49f

Thomas Wolf authored Mar 17, 2020

* memory benchmark rss

* have both forward pass and line-by-line mem tracing

* cleaned up tracing

* refactored and cleaning up API

* no f-strings yet...

* add GPU mem logging

* fix GPU memory monitoring

* style and quality

* clean up and doc

* update with comments

* Switching to python 3.6+

* fix quality

2187c49f

12 Mar, 2020 1 commit
- add BART to README (#3255) · 087465b9
  Sam Shleifer authored Mar 12, 2020
  
  087465b9
10 Mar, 2020 2 commits
- [doc] --organization tweak · d6de6423
  Julien Chaumond authored Mar 10, 2020
```
Co-Authored-By: Thomas Wolf <thomwolf@users.noreply.github.com>
```
  d6de6423
- [doc] Document the new --organization flag of CLI · 0e56dc30
  Julien Chaumond authored Mar 10, 2020
  
  0e56dc30
20 Feb, 2020 1 commit
- Add syntax highlighting to the BibTeX in README · 976e9afe
  Santiago Castro authored Feb 19, 2020
  
  976e9afe
19 Feb, 2020 1 commit
- README link + better instructions for release · 59c23ad9
  Lysandre authored Feb 19, 2020
  
  59c23ad9
07 Feb, 2020 1 commit
- distilbert-base-cased weights + Readmes + omissions · ee5a6856
  VictorSanh authored Feb 07, 2020
  
  ee5a6856
06 Feb, 2020 1 commit
- Add contributors snapshot · c069932f
  Clement authored Feb 06, 2020
```
powered by https://github.com/sourcerer-io/hall-of-fame
```
  c069932f
05 Feb, 2020 1 commit
- [doc] model sharing: mention README.md + tweaks · eae8ee03
  Julien Chaumond authored Feb 05, 2020
```
cc @lysandrejik @thomwolf
```
  eae8ee03
31 Jan, 2020 2 commits
- Typo on markdown link in README.md · 3a21d6da
  Arnaud authored Jan 31, 2020
  
  3a21d6da
- v2.4.0 documentation · 0aa40e95
  Lysandre authored Jan 31, 2020
  
  0aa40e95
30 Jan, 2020 2 commits

fill_mask helper (#2576) · 9fa836a7

Julien Chaumond authored Jan 30, 2020

* fill_mask helper

* [poc] FillMaskPipeline

* Revert "[poc] FillMaskPipeline"

This reverts commit 67eeea55b0f97b46c2b828de0f4ee97d87338335.

* Revert "fill_mask helper"

This reverts commit cacc17b884e14bb6b07989110ffe884ad9e36eaa.

* README: clarify that Pipelines can also do text-classification

cf. question at the AI&ML meetup last week, @mfuntowicz

* Fix test: test feature-extraction pipeline

* Test tweaks

* Slight refactor of existing pipeline (in preparation of new FillMaskPipeline)

* Extraneous doc

* More robust way of doing this

@mfuntowicz as we don't rely on the model name anymore (see AutoConfig)

* Also add RobertaConfig as a quickfix for wrong token_type_ids

* cs

* [BIG] FillMaskPipeline

9fa836a7

Add Flaubert · f0a4fc6c
Hang Le authored Jan 15, 2020

f0a4fc6c

23 Jan, 2020 1 commit
- Doc tweak on model sharing · 119dc50e
  Julien Chaumond authored Jan 22, 2020
  
  119dc50e
06 Jan, 2020 2 commits
- GPU text generation: mMoved the encoded_prompt to correct device · 81d6841b
  alberduris authored Dec 31, 2019
  
  81d6841b
- Moved the encoded_prompts to correct device · dd4df80f
  alberduris authored Dec 31, 2019
  
  dd4df80f
05 Jan, 2020 2 commits
- Fix syntax + link to community page · 78528742
  Julien Chaumond authored Jan 05, 2020
  
  78528742
- Proposition to include community models in readme · 12e0aa43
  Clement authored Jan 01, 2020
  
  12e0aa43
28 Dec, 2019 1 commit
- [cli] Update doc · 9b2badf3
  Julien Chaumond authored Dec 27, 2019
  
  9b2badf3
27 Dec, 2019 1 commit
- Quote square brackets in shell commands. · 3233b58a
  Aymeric Augustin authored Dec 27, 2019
```
This ensures compatibility with zsh.

Fix #2316.
```
  3233b58a
24 Dec, 2019 1 commit

Remove [--editable] in install instructions. · a8d34e53

Aymeric Augustin authored Dec 24, 2019

Use -e only in docs targeted at contributors.

If a user copy-pastes  command line with [--editable], they will hit
an error. If they don't know the --editable option, we're giving them
a choice to make before they can move forwards, but this isn't a choice
they need to make right now.

a8d34e53

23 Dec, 2019 1 commit
- Update contribution instructions. · 70373a5f
  Aymeric Augustin authored Dec 22, 2019
```
Also provide shortcuts in a Makefile.
```
  70373a5f
22 Dec, 2019 5 commits
- Remove references to Python 2 in documentation. · 45841eaf
  Aymeric Augustin authored Dec 22, 2019
  
  45841eaf
- Remove duplicate -v flag. · b6ea0f43
  Aymeric Augustin authored Dec 22, 2019
  
  b6ea0f43
- Switch test files to the standard test_*.py scheme. · ced0a942
  Aymeric Augustin authored Dec 22, 2019
  
  ced0a942
- Move tests outside of library. · 067395d5
  Aymeric Augustin authored Dec 22, 2019
  
  067395d5
- Remove trailing whitespace in README. · 698f9e3d
  Aymeric Augustin authored Dec 22, 2019
  
  698f9e3d
20 Dec, 2019 3 commits
- Release: v2.3.0 · a436574b
  Lysandre authored Dec 20, 2019
  
  a436574b
- update link in readme · 71883b6d
  thomwolf authored Dec 20, 2019
  
  71883b6d
- Added pipelines quick tour in README · b98ff885
  Morgan Funtowicz authored Dec 20, 2019
  
  b98ff885