Commits · e643a297228c8cb2c189fe4c93e11125f938d20b · chenpangpang / transformers

17 Sep, 2020 9 commits

Change to use relative imports in some files & Add python prompt symbols to example codes (#7202) · e643a297

Sohee Yang authored Sep 18, 2020



* Move 'from transformers' statements to relative imports in some files

* Add python prompt symbols in front of the example codes

* Reformat the code

* Add one missing space
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

e643a297

[model cards] ported allenai Deep Encoder, Shallow Decoder models (#7153) · 0fe6e435

Stas Bekman authored Sep 17, 2020

* [model cards] ported allenai Deep Encoder, Shallow Decoder models

* typo

* fix references

* add allenai/wmt19-de-en-6-6 model cards

* fill-in the missing info for the build script as provided by the searcher.

0fe6e435

[ported model] FSMT (FairSeq MachineTranslation) (#6940) · 1eeb206b

Stas Bekman authored Sep 17, 2020

* ready for PR

* cleanup

* correct FSMT_PRETRAINED_MODEL_ARCHIVE_LIST

* fix

* perfectionism

* revert change from another PR

* odd, already committed this one

* non-interactive upload workaround

* backup the failed experiment

* store langs in config

* workaround for localizing model path

* doc clean up as in https://github.com/huggingface/transformers/pull/6956

* style

* back out debug mode

* document: run_eval.py --num_beams 10

* remove unneeded constant

* typo

* re-use bart's Attention

* re-use EncoderLayer, DecoderLayer from bart

* refactor

* send to cuda and fp16

* cleanup

* revert (moved to another PR)

* better error message

* document run_eval --num_beams

* solve the problem of tokenizer finding the right files when model is local

* polish, remove hardcoded config

* add a note that the file is autogenerated to avoid losing changes

* prep for org change, remove u...

1eeb206b

Trainer multi label (#7191) · 492bb6aa
Sylvain Gugger authored Sep 17, 2020
```
* Trainer accep multiple labels

* Missing import

* Fix dosctrings
```
492bb6aa

Transformer-XL: Remove unused parameters (#7087) · 70974592

RafaelWO authored Sep 17, 2020

* Removed 'tgt_len' and 'ext_len' from Transfomer-XL

 * Some changes are still to be done

* Removed 'tgt_len' and 'ext_len' from Transfomer-XL (2)

 * Removed comments
 * Fixed quality

* Changed warning to info

70974592

added multilabel text classification notebook using distilbert to community notebooks (#7201) · c183d81e

Dhaval Taunk authored Sep 17, 2020

* added multilabel classification using distilbert notebook to community notebooks

* added multilabel classification using distilbert notebook to community notebooks

c183d81e

remove deprecated flag (#7171) · 79111b77

Stas Bekman authored Sep 17, 2020

```
/home/circleci/.local/lib/python3.6/site-packages/isort/main.py:915: UserWarning: W0501: The following deprecated CLI flags were used and ignored: --recursive!
  "W0501: The following deprecated CLI flags were used and ignored: "
```

79111b77

remove duplicated code (#7173) · 0cdafbf7
Stas Bekman authored Sep 17, 2020

0cdafbf7
[s2s] fix kwarg typo (#7196) · 45b0b1ff
Sam Shleifer authored Sep 16, 2020

45b0b1ff

16 Sep, 2020 11 commits
- [s2s] distributed eval cleanup (#7186) · 0203ad43
  Sam Shleifer authored Sep 16, 2020
  
  0203ad43
- Formatting · 3babef81
  sgugger authored Sep 16, 2020
  
  3babef81
- use the correct add_start_docstrings (#7174) · 42049b8e
  Stas Bekman authored Sep 16, 2020
  
  42049b8e
- [s2s run_eval] new features (#7109) · fdaf8ab3
  Stas Bekman authored Sep 16, 2020
```
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>
```
  fdaf8ab3
- [model_cards] antoiloui/belgpt2 🇧🇪 (#7166) · df165065
  Antoine Louis authored Sep 16, 2020
```
* Create README.md

* Update README.md
```
  df165065
- Update README (#7133) · 108c9aef
  Sylvain Gugger authored Sep 16, 2020
```
* Rewrite and update README

* Typo and migration guide

* Apply suggestions from code review
Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com>

* Address Clem's comments
Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com>
```
  108c9aef
- Add condition (#7161) · 9e376e15
  Donna Choi authored Sep 16, 2020
  
  9e376e15
- [doc] improve/expand the Parametrization section (#7156) · f8590c56
  Stas Bekman authored Sep 16, 2020
  
  f8590c56
- build/eval/gen-card scripts for fsmt (#7155) · d3391c87
  Stas Bekman authored Sep 16, 2020
```
* build/eval/gen-card scripts for fsmt

* adjust for model renames
```
  d3391c87
- fix the warning message of overflowed sequence (#7151) · 08bfc171
  Xi Ye authored Sep 16, 2020
  
  08bfc171
- Refactoring the TF activations functions (#7150) · af8425b7
  Julien Plu authored Sep 16, 2020
```
* Refactoring the activations functions into a common file

* Apply style

* remove unused import

* fix tests

* Fix tests.
```
  af8425b7
15 Sep, 2020 15 commits

[docs] add testing documentation (#7101) · b00cafbd

Stas Bekman authored Sep 15, 2020



* [docs] add testing documentation

* Update docs/source/testing.rst
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* tweaks as suggested

* Update docs/source/testing.rst
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update docs/source/testing.rst
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update docs/source/testing.rst
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update docs/source/testing.rst
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update docs/source/testing.rst
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update docs/source/testing.rst
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update docs/source/testing.rst
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update docs/source/testing.rst
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update docs/source/testing.rst
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update docs/source/testing.rst
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update docs/source/testing.rst
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* tweaks

* Update docs/source/testing.rst
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update docs/source/testing.rst
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* more tweaks

* suggestions from @LysandreJik
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

b00cafbd

fix encoder decoder kwargs (#7131) · 85ffda96
Patrick von Platen authored Sep 15, 2020

85ffda96

fix ZeroDivisionError and epoch counting (#7125) · 4c62c602

Yih-Dar authored Sep 15, 2020

* fix ZeroDivisionError and epoch counting

* Add test for num_train_epochs calculation in trainer.py

* Remove @require_non_multigpu for test_num_train_epochs_in_training

4c62c602

Create README.md · 7af2791d
Patrick von Platen authored Sep 15, 2020

7af2791d
Funnel model cards (#7147) · 153ec2f1
Sylvain Gugger authored Sep 15, 2020

153ec2f1

Multi predictions trainer (#7126) · 7186ca62

Sylvain Gugger authored Sep 15, 2020

* Allow multiple outputs

* Formatting

* Move the unwrapping before metrics

* Fix typo

* Add test for non-supported config options

7186ca62

[model_cards] pvl/labse_bert model card · 52d250f6

Pedro Lima authored Sep 15, 2020

From **Language-Agnostic BERT Sentence Embedding**

https://ai.googleblog.com/2020/08/language-agnostic-bert-sentence.html

52d250f6

Create README.md (#7097) · 84d64805
tuner007 authored Sep 15, 2020
```
Model card for PEGASUS finetuned for paraphrasing task
```
84d64805
German electra model card v3 update (#7089) · 52bb7ccc
Philip May authored Sep 15, 2020
```
* changed eval table model order

* Update install

* update mc
```
52bb7ccc
Tiny typo fix (#7143) · 1a85299a
Siddharth Jain authored Sep 15, 2020

1a85299a
Add quotes to paths in MeCab arguments (#7142) · e29c3f1b
Paul O'Leary McCann authored Sep 15, 2020
```
Without quotes directories with spaces in them will fail to be processed
correctly.
```
e29c3f1b

Fix TF Trainer loss calculation (#6998) · cb061e78

Yih-Dar authored Sep 15, 2020

* create branch for issue #6968

* First attempt to fix incorrect tf trainer loss calculation

* Fix training loss in metric

* fix tf trainer evaluation loss

* apply count_instances_in_batch() for eval and test datasets

* prototype of using a new argument in trainer_tf.py to fix loss issue

* some renaming and fix, in particular for evaluation methods

* fix bugs to have a running version

* change to @staticmethod

* apply style

cb061e78

[logging] remove no longer needed verbosity override (#7100) · b0cbcdb0
Stas Bekman authored Sep 15, 2020

b0cbcdb0
Fix reproducible tests in Trainer (#7119) · 2bf70e21
Sylvain Gugger authored Sep 15, 2020
```
* Fix reproducible tests in Trainer

* Deal with multiple GPUs
```
2bf70e21
[QOL] add signature for prepare_seq2seq_batch (#7108) · 9e89390c
Sam Shleifer authored Sep 14, 2020

9e89390c

14 Sep, 2020 5 commits

[s2s] distributed eval in one command (#7124) · 33d479d2
Sam Shleifer authored Sep 14, 2020

33d479d2
Pin version of TF and torch · 206b78d4
sgugger authored Sep 14, 2020

206b78d4

Add Mirror Option for Downloads (#6679) · 90cde2e9

Kevin Canwen Xu authored Sep 14, 2020

* Add Tuna Mirror for Downloads from China

* format fix

* Use preset instead of hardcoding URL

* Fix

* make style

* update the mirror option doc

* update the mirror

90cde2e9

Demoing LXMERT with raw images by incorporating the FRCNN model for roi-pooled... · e0e0675a

Antonio V Mendoza authored Sep 14, 2020


Demoing LXMERT with raw images by incorporating the FRCNN model for roi-pooled extraction and bounding-box predction on the GQA answer set. (#6986)

* adding demo

* Update examples/lxmert/requirements.txt
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Update examples/lxmert/checkpoint.sh
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* added user input for .py demo

* updated model loading, data extrtaction, checkpoints, and lots of other automation

* adding normalizing for bounding boxes

* Update requirements.txt

* some optimizations for extracting data

* added data extracting file

* added data extraction file

* minor fixes to reqs and readme

* Style

* remove options
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>

e0e0675a

Extra ) · 5636cbb2
sgugger authored Sep 14, 2020

5636cbb2