Commits · d22894dfd40d5c858e8398e2783545103d191b47 · chenpangpang / transformers

16 Apr, 2020 1 commit

Patrick von Platen authored Apr 16, 2020



* add dialoGPT

* update README.md

* fix conflict

* update readme

* add code links to docs

* Update README.md

* Update dialo_gpt2.rst

* Update pretrained_models.rst

* Update docs/source/model_doc/dialo_gpt2.rst
Co-Authored-By: Julien Chaumond <chaumond@gmail.com>

* change filename of dialogpt
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

d22894df

10 Apr, 2020 2 commits
- [docs] The use of `do_lower_case` in scripts is on its way to deprecation (#3738) · cbad305c
  Julien Chaumond authored Apr 10, 2020
  
  cbad305c
- Multilingual BART - (#3602) · 7a7fdf71
  Sam Shleifer authored Apr 10, 2020
```
- support mbart-en-ro weights
- add MBartTokenizer
```
  7a7fdf71
06 Apr, 2020 2 commits
- Update notebooks (#3620) · 261c4ff4
  Lysandre Debut authored Apr 06, 2020
```
* Update notebooks

* From local to global link

* from local links to *actual* global links
```
  261c4ff4
- Release: v2.8.0 · 36bffc81
  LysandreJik authored Apr 06, 2020
  
  36bffc81
04 Apr, 2020 1 commit
- weigths*weights · 94eb68d7
  Julien Chaumond authored Apr 04, 2020
  
  94eb68d7
03 Apr, 2020 1 commit

ELECTRA (#3257) · d5d7d886

Lysandre Debut authored Apr 03, 2020

* Electra wip

* helpers

* Electra wip

* Electra v1

* ELECTRA may be saved/loaded

* Generator & Discriminator

* Embedding size instead of halving the hidden size

* ELECTRA Tokenizer

* Revert BERT helpers

* ELECTRA Conversion script

* Archive maps

* PyTorch tests

* Start fixing tests

* Tests pass

* Same configuration for both models

* Compatible with base + large

* Simplification + weight tying

* Archives

* Auto + Renaming to standard names

* ELECTRA is uncased

* Tests

* Slight API changes

* Update tests

* wip

* ElectraForTokenClassification

* temp

* Simpler arch + tests

Removed ElectraForPreTraining which will be in a script

* Conversion script

* Auto model

* Update links to S3

* Split ElectraForPreTraining and ElectraForTokenClassification

* Actually test PreTraining model

* Remove num_labels from configuration

* wip

* wip

* From discriminator and generator to electra

* Slight API changes

* Better naming

* TensorFlow ELECTRA tests

* Accurate conversion script

* Added to conversion script

* Fast ELECTRA tokenizer

* Style

* Add ELECTRA to README

* Modeling Pytorch Doc + Real style

* TF Docs

* Docs

* Correct links

* Correct model intialized

* random fixes

* style

* Addressing Patrick's and Sam's comments

* Correct links in docs

d5d7d886

31 Mar, 2020 3 commits
- [Docs] Add usage examples for translation and summarization (#3538) · 83d1fbcf
  Patrick von Platen authored Mar 31, 2020
  
  83d1fbcf
- Update usage doc regarding generate fn (#3504) · 42e1e3c6
  Patrick von Platen authored Mar 31, 2020
  
  42e1e3c6
- Add better explanation to check `docs` locally. (#3459) · 57b0fab6
  Patrick von Platen authored Mar 31, 2020
  
  57b0fab6
30 Mar, 2020 2 commits

Release: v2.7.0 · 6f5a12a5
LysandreJik authored Mar 30, 2020

6f5a12a5

[T5] Add training documenation (#3507) · 5b44e0a3

Patrick von Platen authored Mar 30, 2020

* Add clear description of how to train T5

* correct docstring in T5

* correct typo

* correct docstring format

* update t5 model docs

* implement collins feedback

* fix typo and add more explanation for sentinal tokens

* delete unnecessary todos

5b44e0a3

27 Mar, 2020 1 commit

Add T5 to docs (#3461) · fa9af246

Patrick von Platen authored Mar 27, 2020

* add t5 docs basis

* improve docs

* add t5 docs

* improve t5 docstring

* add t5 tokenizer docstring

* finish docstring

* make style

* add pretrained models

* correct typo

* make examples work

* finalize docs

fa9af246

24 Mar, 2020 1 commit
- Release: v2.6.0 · 471cce24
  LysandreJik authored Mar 24, 2020
  
  471cce24
17 Mar, 2020 2 commits

Add Summarization to Pipelines (#3128) · 38a555a8

Sam Shleifer authored Mar 17, 2020

* passing

* Undo stupid chg

* docs

* undo rename

* delete-cruft

* only import if you have torch

* Dont rely on dict ordering

* Fix dict ordering upstream

* docstring link

* docstring link

* remove trailing comma for 3.5 compat

* new name

* delegate kwarging

* Update kwargs

38a555a8

CPU/GPU memory benchmarking utilities - Remove support for python 3.5 (now only 3.6+) (#3186) · 2187c49f

Thomas Wolf authored Mar 17, 2020

* memory benchmark rss

* have both forward pass and line-by-line mem tracing

* cleaned up tracing

* refactored and cleaning up API

* no f-strings yet...

* add GPU mem logging

* fix GPU memory monitoring

* style and quality

* clean up and doc

* update with comments

* Switching to python 3.6+

* fix quality

2187c49f

10 Mar, 2020 2 commits
- [doc] --organization tweak · d6de6423
  Julien Chaumond authored Mar 10, 2020
```
Co-Authored-By: Thomas Wolf <thomwolf@users.noreply.github.com>
```
  d6de6423
- [doc] Document the new --organization flag of CLI · 0e56dc30
  Julien Chaumond authored Mar 10, 2020
  
  0e56dc30
05 Mar, 2020 2 commits
- Rename BartForMaskedLM -> BartForConditionalGeneration (#3114) · 857e0a0d
  Sam Shleifer authored Mar 05, 2020
```
* improved documentation
```
  857e0a0d
- Fix failing doc samples · 07a79db5
  Lysandre authored Mar 04, 2020
  
  07a79db5
02 Mar, 2020 2 commits

Pipeline doc (#3055) · d3eb7d23

Lysandre Debut authored Mar 02, 2020

* Pipeline doc initial commit

* pipeline abstraction

* Remove modelcard argument from pipeline

* Task-specific pipelines can be instantiated with no model or tokenizer

* All pipelines doc

d3eb7d23

Bart-CNN (#3059) · b54ef78d

Sam Shleifer authored Mar 02, 2020

`generate` code that produces 99% identical summarizations to fairseq on CNN test data, with caching.

b54ef78d

26 Feb, 2020 1 commit
- Delete all mentions of Model2Model (#3019) · 9df74b8b
  Sam Shleifer authored Feb 26, 2020
  
  9df74b8b
25 Feb, 2020 2 commits

Documentation (#2989) · bb7c4685

Lysandre Debut authored Feb 25, 2020

* All Tokenizers

BertTokenizer + few fixes
RobertaTokenizer
OpenAIGPTTokenizer + Fixes
GPT2Tokenizer + fixes
TransfoXLTokenizer
Correct rst for TransformerXL
XLMTokenizer + fixes
XLNet Tokenizer + Style
DistilBERT + Fix XLNet RST
CTRLTokenizer
CamemBERT Tokenizer
FlaubertTokenizer
XLMRobertaTokenizer
cleanup

* cleanup

bb7c4685

Adding usage examples for common tasks (#2850) · 65e7c90a

Lysandre Debut authored Feb 25, 2020

* Usage: Sequence Classification & Question Answering

* Pipeline example

* Language modeling

* TensorFlow code for Sequence classification

* Custom TF/PT toggler in docs

* QA + LM for TensorFlow

* Finish Usage for both PyTorch and TensorFlow

* Addressing Julien's comments

* More assertive

* cleanup

* Favicon
- added favicon option in conf.py along with the favicon image
- udpated 🤗

 logo. slightly smaller and should appear more consistent across editing programs (no more tongue on the outside of the mouth)
Co-authored-by: joshchagani <joshua@joshuachagani.com>

65e7c90a

24 Feb, 2020 1 commit
- Release: v2.5.1 · f9ec5ca9
  Lysandre authored Feb 24, 2020
  
  f9ec5ca9
20 Feb, 2020 1 commit

New BartModel (#2745) · 53ce3854

Sam Shleifer authored Feb 20, 2020

* Results same as fairseq
* Wrote a ton of tests
* Struggled with api signatures
* added some docs

53ce3854

19 Feb, 2020 1 commit
- Release: v2.5.0 · fb560dcb
  Lysandre authored Feb 19, 2020
```
Welcome Rust Tokenizers
```
  fb560dcb
10 Feb, 2020 1 commit
- Correct quickstart example when using the past · fd639e5b
  Lysandre authored Feb 10, 2020
  
  fd639e5b
07 Feb, 2020 4 commits
- Update RoBERTa tips · dd288303
  Lysandre authored Feb 07, 2020
  
  dd288303
- Update XLM-R tips · db979301
  Lysandre authored Feb 07, 2020
  
  db979301
- distilbert-base-cased weights + Readmes + omissions · ee5a6856
  VictorSanh authored Feb 07, 2020
  
  ee5a6856
- [examples] rename run_lm_finetuning to run_language_modeling · 42f08e59
  Julien Chaumond authored Feb 06, 2020
  
  42f08e59
06 Feb, 2020 2 commits
- Oopsie · 7748cbbe
  Julien Chaumond authored Feb 06, 2020
  
  7748cbbe
- [docs] Add menu w/ links to other pages on hf.co · 432c1252
  Julien Chaumond authored Feb 06, 2020
  
  432c1252
05 Feb, 2020 1 commit
- [doc] model sharing: mention README.md + tweaks · eae8ee03
  Julien Chaumond authored Feb 05, 2020
```
cc @lysandrejik @thomwolf
```
  eae8ee03
04 Feb, 2020 1 commit
- Update quickstart · 9c67196b
  Lysandre authored Feb 04, 2020
  
  9c67196b
31 Jan, 2020 2 commits
- Patch: v2.4.1 · d426b58b
  Lysandre authored Jan 31, 2020
  
  d426b58b
- Release: v2.4.0 · 6664ea94
  Lysandre authored Jan 31, 2020
  
  6664ea94
30 Jan, 2020 1 commit
- Add layerdrop · b43cb09a
  Hang Le authored Jan 30, 2020
  
  b43cb09a