Commits · 7719ecd19f63876fdc2d31699977c7ced3643417 · chenpangpang / transformers

18 Sep, 2020 15 commits
- Fix a typo (#7225) · 7719ecd1
  Yuta Hayashibe authored Sep 18, 2020
  
  7719ecd1
- Create README.md (#7205) · 4a26e8ac
  Manuel Romero authored Sep 18, 2020
  
  4a26e8ac
- Add customized text to widget (#7204) · 94320c5b
  Manuel Romero authored Sep 18, 2020
  
  94320c5b
- Create README.md (#7209) · 3aefb24b
  Manuel Romero authored Sep 18, 2020
  
  3aefb24b
- Create README.md (#7210) · a22e7a8d
  Manuel Romero authored Sep 18, 2020
  
  a22e7a8d
- Create README.md (#7212) · c028b264
  Manuel Romero authored Sep 18, 2020
  
  c028b264
- Create README.md for indobert-lite-base-p1 (#7182) · c7cdd7b4
  Genta Indra Winata authored Sep 18, 2020
  
  c7cdd7b4
- Create README.md for indobert-lite-large-p1 (#7184) · bfb9150b
  Genta Indra Winata authored Sep 18, 2020
```
* Create README.md

* Update README.md
```
  bfb9150b
- Create README.md (#7183) · d1935934
  Genta Indra Winata authored Sep 18, 2020
  
  d1935934
- Create README.md (#7185) · e65d8466
  Genta Indra Winata authored Sep 18, 2020
  
  e65d8466
- Create README.md for indobert-large-p2 model card (#7181) · e27d86d4
  Genta Indra Winata authored Sep 18, 2020
  
  e27d86d4
- Create README.md for indobert-large-p1 model card (#7180) · 881c0783
  Genta Indra Winata authored Sep 18, 2020
  
  881c0783
- Create README.md (#7179) · e0d58a5c
  Genta Indra Winata authored Sep 18, 2020
  
  e0d58a5c
- Create README.md for indobert-base-p2 (#7178) · 1313a1d2
  Genta Indra Winata authored Sep 18, 2020
  
  1313a1d2
- Create README.md (#7095) · cf24f43e
  tuner007 authored Sep 18, 2020
```
Create model card for Pegasus QA
```
  cf24f43e
17 Sep, 2020 16 commits

[s2s] remove double assert (#7223) · 67d9fc50
Sam Shleifer authored Sep 17, 2020

67d9fc50
[model cards] fix metadata - 3rd attempt (#7218) · edbaad2c
Stas Bekman authored Sep 17, 2020

edbaad2c
skip failing FSMT CUDA tests until investigated (#7220) · 999a1c95
Stas Bekman authored Sep 17, 2020

999a1c95
[model cards] fix dataset yaml (#7216) · 51c4adf5
Stas Bekman authored Sep 17, 2020

51c4adf5
[s2s] dynamic batch size with --max_tokens_per_batch (#7030) · a5638b2b
Sam Shleifer authored Sep 17, 2020

a5638b2b
[s2s] run_eval/run_eval_search tweaks (#7192) · efeab6a3
Stas Bekman authored Sep 17, 2020
```
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>
```
efeab6a3
[model cards] fix yaml in cards (#7207) · 9c5bcab5
Stas Bekman authored Sep 17, 2020

9c5bcab5

Change to use relative imports in some files & Add python prompt symbols to example codes (#7202) · e643a297

Sohee Yang authored Sep 18, 2020



* Move 'from transformers' statements to relative imports in some files

* Add python prompt symbols in front of the example codes

* Reformat the code

* Add one missing space
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

e643a297

[model cards] ported allenai Deep Encoder, Shallow Decoder models (#7153) · 0fe6e435

Stas Bekman authored Sep 17, 2020

* [model cards] ported allenai Deep Encoder, Shallow Decoder models

* typo

* fix references

* add allenai/wmt19-de-en-6-6 model cards

* fill-in the missing info for the build script as provided by the searcher.

0fe6e435

[ported model] FSMT (FairSeq MachineTranslation) (#6940) · 1eeb206b

Stas Bekman authored Sep 17, 2020

* ready for PR

* cleanup

* correct FSMT_PRETRAINED_MODEL_ARCHIVE_LIST

* fix

* perfectionism

* revert change from another PR

* odd, already committed this one

* non-interactive upload workaround

* backup the failed experiment

* store langs in config

* workaround for localizing model path

* doc clean up as in https://github.com/huggingface/transformers/pull/6956



* style

* back out debug mode

* document: run_eval.py --num_beams 10

* remove unneeded constant

* typo

* re-use bart's Attention

* re-use EncoderLayer, DecoderLayer from bart

* refactor

* send to cuda and fp16

* cleanup

* revert (moved to another PR)

* better error message

* document run_eval --num_beams

* solve the problem of tokenizer finding the right files when model is local

* polish, remove hardcoded config

* add a note that the file is autogenerated to avoid losing changes

* prep for org change, remove unneeded code

* switch to model4.pt, update scores

* s/python/bash/

* missing init (but doesn't impact the finetuned model)

* cleanup

* major refactor (reuse-bart)

* new model, new expected weights

* cleanup

* cleanup

* full link

* fix model type

* merge porting notes

* style

* cleanup

* have to create a DecoderConfig object to handle vocab_size properly

* doc fix

* add note (not a public class)

* parametrize

* - add bleu scores integration tests

* skip test if sacrebleu is not installed

* cache heavy models/tokenizers

* some tweaks

* remove tokens that aren't used

* more purging

* simplify code

* switch to using decoder_start_token_id

* add doc

* Revert "major refactor (reuse-bart)"

This reverts commit 226dad15ca6a9ef4e26178526e878e8fc5c85874.

* decouple from bart

* remove unused code #1

* remove unused code #2

* remove unused code #3

* update instructions

* clean up

* move bleu eval to examples

* check import only once

* move data+gen script into files

* reuse via import

* take less space

* add prepare_seq2seq_batch (auto-tested)

* cleanup

* recode test to use json instead of yaml

* ignore keys not needed

* use the new -y in transformers-cli upload -y

* [xlm tok] config dict: fix str into int to match definition (#7034)

* [s2s] --eval_max_generate_length (#7018)

* Fix CI with change of name of nlp (#7054)

* nlp -> datasets

* More nlp -> datasets

* Woopsie

* More nlp -> datasets

* One last

* extending to support allen_nlp wmt models

- allow a specific checkpoint file to be passed
- more arg settings
- scripts for allen_nlp models

* sync with changes

* s/fsmt-wmt/wmt/ in model names

* s/fsmt-wmt/wmt/ in model names (p2)

* s/fsmt-wmt/wmt/ in model names (p3)

* switch to a better checkpoint

* typo

* make non-optional args such - adjust tests where possible or skip when there is no other choice

* consistency

* style

* adjust header

* cards moved (model rename)

* use best custom hparams

* update info

* remove old cards

* cleanup

* s/stas/facebook/

* update scores

* s/allen_nlp/allenai/

* url maps aren't needed

* typo

* move all the doc / build /eval generators to their own scripts

* cleanup

* Apply suggestions from code review
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Apply suggestions from code review
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* fix indent

* duplicated line

* style

* use the correct add_start_docstrings

* oops

* resizing can't be done with the core approach, due to 2 dicts

* check that the arg is a list

* style

* style
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

1eeb206b

Trainer multi label (#7191) · 492bb6aa
Sylvain Gugger authored Sep 17, 2020
```
* Trainer accep multiple labels

* Missing import

* Fix dosctrings
```
492bb6aa

Transformer-XL: Remove unused parameters (#7087) · 70974592

RafaelWO authored Sep 17, 2020

* Removed 'tgt_len' and 'ext_len' from Transfomer-XL

 * Some changes are still to be done

* Removed 'tgt_len' and 'ext_len' from Transfomer-XL (2)

 * Removed comments
 * Fixed quality

* Changed warning to info

70974592

added multilabel text classification notebook using distilbert to community notebooks (#7201) · c183d81e

Dhaval Taunk authored Sep 17, 2020

* added multilabel classification using distilbert notebook to community notebooks

* added multilabel classification using distilbert notebook to community notebooks

c183d81e

remove deprecated flag (#7171) · 79111b77

Stas Bekman authored Sep 17, 2020

```
/home/circleci/.local/lib/python3.6/site-packages/isort/main.py:915: UserWarning: W0501: The following deprecated CLI flags were used and ignored: --recursive!
  "W0501: The following deprecated CLI flags were used and ignored: "
```

79111b77

remove duplicated code (#7173) · 0cdafbf7
Stas Bekman authored Sep 17, 2020

0cdafbf7
[s2s] fix kwarg typo (#7196) · 45b0b1ff
Sam Shleifer authored Sep 16, 2020

45b0b1ff

16 Sep, 2020 9 commits
- [s2s] distributed eval cleanup (#7186) · 0203ad43
  Sam Shleifer authored Sep 16, 2020
  
  0203ad43
- Formatting · 3babef81
  sgugger authored Sep 16, 2020
  
  3babef81
- use the correct add_start_docstrings (#7174) · 42049b8e
  Stas Bekman authored Sep 16, 2020
  
  42049b8e
- [s2s run_eval] new features (#7109) · fdaf8ab3
  Stas Bekman authored Sep 16, 2020
```
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>
```
  fdaf8ab3
- [model_cards] antoiloui/belgpt2 🇧🇪 (#7166) · df165065
  Antoine Louis authored Sep 16, 2020
```
* Create README.md

* Update README.md
```
  df165065
- Update README (#7133) · 108c9aef
  Sylvain Gugger authored Sep 16, 2020
```
* Rewrite and update README

* Typo and migration guide

* Apply suggestions from code review
Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com>

* Address Clem's comments
Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com>
```
  108c9aef
- Add condition (#7161) · 9e376e15
  Donna Choi authored Sep 16, 2020
  
  9e376e15
- [doc] improve/expand the Parametrization section (#7156) · f8590c56
  Stas Bekman authored Sep 16, 2020
  
  f8590c56
- build/eval/gen-card scripts for fsmt (#7155) · d3391c87
  Stas Bekman authored Sep 16, 2020
```
* build/eval/gen-card scripts for fsmt

* adjust for model renames
```
  d3391c87