Commits · 47ab3e8262144cb962fc2769bf7551fc23c3ce2b · chenpangpang / transformers

20 Sep, 2020 1 commit

Stas Bekman authored Sep 20, 2020

Found an issue when `@slow` isn't the last decorator (gets ignored!), so documenting this significance.

47ab3e82

17 Sep, 2020 1 commit

[ported model] FSMT (FairSeq MachineTranslation) (#6940) · 1eeb206b

Stas Bekman authored Sep 17, 2020

* ready for PR

* cleanup

* correct FSMT_PRETRAINED_MODEL_ARCHIVE_LIST

* fix

* perfectionism

* revert change from another PR

* odd, already committed this one

* non-interactive upload workaround

* backup the failed experiment

* store langs in config

* workaround for localizing model path

* doc clean up as in https://github.com/huggingface/transformers/pull/6956



* style

* back out debug mode

* document: run_eval.py --num_beams 10

* remove unneeded constant

* typo

* re-use bart's Attention

* re-use EncoderLayer, DecoderLayer from bart

* refactor

* send to cuda and fp16

* cleanup

* revert (moved to another PR)

* better error message

* document run_eval --num_beams

* solve the problem of tokenizer finding the right files when model is local

* polish, remove hardcoded config

* add a note that the file is autogenerated to avoid losing changes

* prep for org change, remove unneeded code

* switch to model4.pt, update scores

* s/python/bash/

* missing init (but doesn't impact the finetuned model)

* cleanup

* major refactor (reuse-bart)

* new model, new expected weights

* cleanup

* cleanup

* full link

* fix model type

* merge porting notes

* style

* cleanup

* have to create a DecoderConfig object to handle vocab_size properly

* doc fix

* add note (not a public class)

* parametrize

* - add bleu scores integration tests

* skip test if sacrebleu is not installed

* cache heavy models/tokenizers

* some tweaks

* remove tokens that aren't used

* more purging

* simplify code

* switch to using decoder_start_token_id

* add doc

* Revert "major refactor (reuse-bart)"

This reverts commit 226dad15ca6a9ef4e26178526e878e8fc5c85874.

* decouple from bart

* remove unused code #1

* remove unused code #2

* remove unused code #3

* update instructions

* clean up

* move bleu eval to examples

* check import only once

* move data+gen script into files

* reuse via import

* take less space

* add prepare_seq2seq_batch (auto-tested)

* cleanup

* recode test to use json instead of yaml

* ignore keys not needed

* use the new -y in transformers-cli upload -y

* [xlm tok] config dict: fix str into int to match definition (#7034)

* [s2s] --eval_max_generate_length (#7018)

* Fix CI with change of name of nlp (#7054)

* nlp -> datasets

* More nlp -> datasets

* Woopsie

* More nlp -> datasets

* One last

* extending to support allen_nlp wmt models

- allow a specific checkpoint file to be passed
- more arg settings
- scripts for allen_nlp models

* sync with changes

* s/fsmt-wmt/wmt/ in model names

* s/fsmt-wmt/wmt/ in model names (p2)

* s/fsmt-wmt/wmt/ in model names (p3)

* switch to a better checkpoint

* typo

* make non-optional args such - adjust tests where possible or skip when there is no other choice

* consistency

* style

* adjust header

* cards moved (model rename)

* use best custom hparams

* update info

* remove old cards

* cleanup

* s/stas/facebook/

* update scores

* s/allen_nlp/allenai/

* url maps aren't needed

* typo

* move all the doc / build /eval generators to their own scripts

* cleanup

* Apply suggestions from code review
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Apply suggestions from code review
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* fix indent

* duplicated line

* style

* use the correct add_start_docstrings

* oops

* resizing can't be done with the core approach, due to 2 dicts

* check that the arg is a list

* style

* style
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

1eeb206b

16 Sep, 2020 1 commit
- [doc] improve/expand the Parametrization section (#7156) · f8590c56
  Stas Bekman authored Sep 16, 2020
  
  f8590c56
15 Sep, 2020 1 commit

[docs] add testing documentation (#7101) · b00cafbd

Stas Bekman authored Sep 15, 2020



* [docs] add testing documentation

* Update docs/source/testing.rst
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* tweaks as suggested

* Update docs/source/testing.rst
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update docs/source/testing.rst
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update docs/source/testing.rst
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update docs/source/testing.rst
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update docs/source/testing.rst
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update docs/source/testing.rst
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update docs/source/testing.rst
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update docs/source/testing.rst
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update docs/source/testing.rst
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update docs/source/testing.rst
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update docs/source/testing.rst
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* tweaks

* Update docs/source/testing.rst
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update docs/source/testing.rst
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* more tweaks

* suggestions from @LysandreJik
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

b00cafbd

14 Sep, 2020 3 commits
- Extra ) · 5636cbb2
  sgugger authored Sep 14, 2020
  
  5636cbb2
- Clean up autoclass doc (#7081) · ccc8e30c
  Sylvain Gugger authored Sep 14, 2020
  
  ccc8e30c
- fix link to paper (#7116) · 15d18e03
  Bartosz Telenczuk authored Sep 14, 2020
  
  15d18e03
11 Sep, 2020 2 commits

Compute loss method (#7074) · 4cbd50e6
Sylvain Gugger authored Sep 11, 2020

4cbd50e6

Automate the lists in auto-xxx docs (#7061) · e841b75d

Sylvain Gugger authored Sep 11, 2020

* More readable dict

* More nlp -> datasets

* Revert "More nlp -> datasets"

This reverts commit 3cd1883d226c63c4a686fc1fed35f2cd586ebe45.

* Automate the lists in auto-xxx docs

* More readable dict

* Revert "More nlp -> datasets"

This reverts commit 3cd1883d226c63c4a686fc1fed35f2cd586ebe45.

* Automate the lists in auto-xxx docs

* nlp -> datasets

* Fix new key

e841b75d

10 Sep, 2020 5 commits

[BertGeneration, Docs] Fix another old name in docs (#7050) · db38f7ce
Patrick von Platen authored Sep 10, 2020
```
* correct docs for bert generation

* upload
```
db38f7ce
correct docs for bert generation (#7048) · 3bd95b0f
Patrick von Platen authored Sep 10, 2020

3bd95b0f

Add TF Funnel Transformer (#7029) · 15a18904

Sylvain Gugger authored Sep 10, 2020



* Add TF Funnel Transformer

* Proper dummy input

* Formatting

* Update src/transformers/modeling_tf_funnel.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Address review comments

* One review comment forgotten
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

15a18904

Add "Leveraging Pretrained Checkpoints for Generation" Seq2Seq models. (#6594) · 7fd1febf

Patrick von Platen authored Sep 10, 2020

* add conversion script

* improve conversion script

* make style

* add tryout files

* fix

* update

* add causal bert

* better names

* add tokenizer file as well

* finish causal_bert

* fix small bugs

* improve generate

* change naming

* renaming

* renaming

* renaming

* remove leftover files

* clean files

* add fix tokenizer

* finalize

* correct slow test

* update docs

* small fixes

* fix link

* adapt check repo

* apply sams and sylvains recommendations

* fix import

* implement Lysandres recommendations

* fix logger warn

7fd1febf

add -y to bypass prompt for transformers-cli upload (#7035) · 4ee1053d
Stas Bekman authored Sep 10, 2020

4ee1053d

09 Sep, 2020 1 commit

adding TRANSFORMERS_VERBOSITY env var (#6961) · d0963486

Stas Bekman authored Sep 09, 2020

* introduce TRANSFORMERS_VERBOSITY env var + test + test helpers

* cleanup

* remove helper function

d0963486

08 Sep, 2020 2 commits

pegasus.rst: fix expected output (#7017) · f0fc0aea
Sam Shleifer authored Sep 08, 2020

f0fc0aea

Funnel transformer (#6908) · d155b38d

Sylvain Gugger authored Sep 08, 2020



* Initial model

* Fix upsampling

* Add special cls token id and test

* Formatting

* Test and fist FunnelTokenizerFast

* Common tests

* Fix the check_repo script and document Funnel

* Doc fixes

* Add all models

* Write doc

* Fix test

* Initial model

* Fix upsampling

* Add special cls token id and test

* Formatting

* Test and fist FunnelTokenizerFast

* Common tests

* Fix the check_repo script and document Funnel

* Doc fixes

* Add all models

* Write doc

* Fix test

* Fix copyright

* Forgot some layers can be repeated

* Apply suggestions from code review
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update src/transformers/modeling_funnel.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Address review comments

* Update src/transformers/modeling_funnel.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Address review comments

* Update src/transformers/modeling_funnel.py
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>

* Slow integration test

* Make small integration test

* Formatting

* Add checkpoint and separate classification head

* Formatting

* Expand list, fix link and add in pretrained models

* Styling

* Add the model in all summaries

* Typo fixes
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>

d155b38d

03 Sep, 2020 1 commit

Adding the LXMERT pretraining model (MultiModal languageXvision) to... · ea2c6f1a

Antonio V Mendoza authored Sep 03, 2020


Adding the LXMERT pretraining model (MultiModal  languageXvision)  to HuggingFace's suite of models (#5793)

* added template files for LXMERT and competed the configuration_lxmert.py

* added modeling, tokization, testing, and finishing touched for lxmert [yet to be tested]

* added model card for lxmert

* cleaning up lxmert code

* Update src/transformers/modeling_lxmert.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Update src/transformers/modeling_tf_lxmert.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Update src/transformers/modeling_tf_lxmert.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Update src/transformers/modeling_lxmert.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* tested torch lxmert, changed documtention, updated outputs, and other small fixes

* Update src/transformers/convert_pytorch_checkpoint_to_tf2.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Update src/transformers/convert_pytorch_checkpoint_to_tf2.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Update src/transformers/convert_pytorch_checkpoint_to_tf2.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* renaming, other small issues, did not change TF code in this commit

* added lxmert question answering model in pytorch

* added capability to edit number of qa labels for lxmert

* made answer optional for lxmert question answering

* add option to return hidden_states for lxmert

* changed default qa labels for lxmert

* changed config archive path

* squshing 3 commits: merged UI + testing improvments + more UI and testing

* changed some variable names for lxmert

* TF LXMERT

* Various fixes to LXMERT

* Final touches to LXMERT

* AutoTokenizer order

* Add LXMERT to index.rst and README.md

* Merge commit test fixes + Style update

* TensorFlow 2.3.0 sequential model changes variable names

Remove inherited test

* Update src/transformers/modeling_tf_pytorch_utils.py

* Update docs/source/model_doc/lxmert.rst
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update docs/source/model_doc/lxmert.rst
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/modeling_tf_lxmert.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* added suggestions

* Fixes

* Final fixes for TF model

* Fix docs
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

ea2c6f1a

02 Sep, 2020 2 commits

[pipelines] Text2TextGenerationPipeline (#6744) · 4230d30f

Suraj Patil authored Sep 02, 2020

* add Text2TextGenerationPipeline

* remove max length warning

* remove comments

* remove input_length

* fix typo

* add tests

* use TFAutoModelForSeq2SeqLM

* doc

* typo

* add the doc below TextGenerationPipeline

* doc nit

* style

* delete comment

4230d30f

minor docs grammar fixes (#6889) · ee1bff06
Harry Wang authored Sep 02, 2020

ee1bff06

01 Sep, 2020 6 commits

[EncoderDecoder] Add xlm-roberta to encoder decoder (#6878) · 4d1a3ffd
Patrick von Platen authored Sep 01, 2020
```
* finish xlm-roberta

* finish docs

* expose XLMRobertaForCausalLM
```
4d1a3ffd
Update docs stable version · 1461aac8
Lysandre Debut authored Sep 01, 2020

1461aac8
v3.1.0 documentation · 3726754a
Lysandre authored Sep 01, 2020

3726754a
Release: v3.1.0 · 4b3ee9cb
Lysandre authored Sep 01, 2020

4b3ee9cb

[Generate] Facilitate PyTorch generate using `ModelOutputs` (#6735) · afc4ece4

Patrick von Platen authored Sep 01, 2020

* fix generate for GPT2 Double Head

* fix gpt2 double head model

* fix  bart / t5

* also add for no beam search

* fix no beam search

* fix encoder decoder

* simplify t5

* simplify t5

* fix t5 tests

* fix BART

* fix transfo-xl

* fix conflict

* integrating sylvains and sams comments

* fix tf past_decoder_key_values

* fix enc dec test

afc4ece4

Logging doc (#6852) · d5f1ffa0

Sylvain Gugger authored Sep 01, 2020



* Add logging doc

* Foamtting

* Update docs/source/main_classes/logging.rst

* Update src/transformers/utils/logging.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

d5f1ffa0

27 Aug, 2020 1 commit
- Adafactor docs (#6765) · 41aa2b4e
  Lysandre Debut authored Aug 27, 2020
  
  41aa2b4e
26 Aug, 2020 1 commit
- fix torchscript docs (#6740) · fa8ee8e8
  Patrick von Platen authored Aug 26, 2020
  
  fa8ee8e8
25 Aug, 2020 1 commit

Add DPR to models summary (#6690) · 0f16dd0a

Quentin Lhoest authored Aug 25, 2020



* add dpr to models summary

* minor

* minor

* Update docs/source/model_summary.rst

qa -> question answering
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update docs/source/model_summary.rst

qa -> question ansering (cont'd)
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

0f16dd0a

24 Aug, 2020 2 commits
- [fixdoc] Add import to pegasus usage doc (#6698) · 0ebc9699
  Sam Shleifer authored Aug 24, 2020
  
  0ebc9699
- remove BartForConditionalGeneration.generate (#6659) · 912a21ec
  Stas Bekman authored Aug 24, 2020
```
As suggested here: https://github.com/huggingface/transformers/issues/6651#issuecomment-678594233
this removes generic `generate` doc with examples not-relevant to bart.
```
  912a21ec
21 Aug, 2020 4 commits
- [Doc model summary] add MBart model summary (#6649) · cbda7293
  Suraj Patil authored Aug 21, 2020
  
  cbda7293
- [Docs model summaries] Add pegasus to docs (#6640) · a4db4e30
  Patrick von Platen authored Aug 21, 2020
```
* add pegasus to docs

* Update docs/source/model_summary.rst
```
  a4db4e30
- CamembertForCausalLM (#6577) · d0e42a7b
  Suraj Patil authored Aug 21, 2020
```
* added CamembertForCausalLM

* add in __init__ and auto model

* style

* doc
```
  d0e42a7b
- Update ONNX doc to match the removal of --optimize argument. · b105f2c6
  Morgan Funtowicz authored Aug 21, 2020
```
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>
```
  b105f2c6
20 Aug, 2020 2 commits

add intro to nlp lib & dataset links to custom datasets tutorial (#6583) · 039d8d65
Joe Davison authored Aug 20, 2020
```
* add intro to nlp lib + links

* unique links...
```
039d8d65

Docs copy button misses ... prefixed code (#6518) · cabfdfaf

Romain Rigaux authored Aug 20, 2020

Tested in a local build of the docs.

e.g. Just above https://huggingface.co/transformers/task_summary.html#causal-language-modeling

Copy will copy the full code, e.g.

for token in top_5_tokens:
     print(sequence.replace(tokenizer.mask_token, tokenizer.decode([token])))

Instead of currently only:

for token in top_5_tokens:


>>> for token in top_5_tokens:
...     print(sequence.replace(tokenizer.mask_token, tokenizer.decode([token])))
Distilled models are smaller than the models they mimic. Using them instead of the large versions would help reduce our carbon footprint.
Distilled models are smaller than the models they mimic. Using them instead of the large versions would help increase our carbon footprint.
Distilled models are smaller than the models they mimic. Using them instead of the large versions would help decrease our carbon footprint.
Distilled models are smaller than the models they mimic. Using them instead of the large versions would help offset our carbon footprint.
Distilled models are smaller than the models they mimic. Using them instead of the large versions would help improve our carbon footprint.

Docs for the option fix:
https://sphinx-copybutton.readthedocs.io/en/latest/

cabfdfaf

19 Aug, 2020 1 commit
- Fix #6575 (#6596) · 18ca0e91
  Sylvain Gugger authored Aug 19, 2020
  
  18ca0e91
18 Aug, 2020 2 commits
- [Pegasus Doc] minor typo (#6579) · fb6844af
  Suraj Patil authored Aug 18, 2020
```
Minor typo correction
@sshleifer
```
  fb6844af
- [docs] Fix number of 'ug' occurrences in tokenizer_summary (#6574) · 7516bcf2
  Romain Rigaux authored Aug 18, 2020
  
  7516bcf2