Commits · c44a17db1b87e31ad4c232e48d19a2700e8b690d · chenpangpang / transformers

19 Mar, 2020 1 commit

[FIX] not training when epoch is small (#3006) · c44a17db

mataney authored Mar 19, 2020

* solving bug where for small epochs and large gradient_accumulation_steps we never train

* black formatting

* no need to change these files

c44a17db

17 Mar, 2020 4 commits

Update examples/ner/run_ner.py to use AutoModel (#3305) · 2b60a26b
J.P Lee authored Mar 18, 2020
```
* Update examples/ner/run_ner.py to use AutoModel

* Fix missing code and apply `make style` command
```
2b60a26b

[WIP] Lightning glue example (#3290) · 930c9412

Nathan Raw authored Mar 17, 2020

* ✨ Alter base pl transformer to use automodels

* 🐛 Add batch size env variable to function call

* 💄 Apply black code style from Makefile

* 🚚 Move lightning base out of ner directory

* ✨ Add lightning glue example

* 💄 self

* move _feature_file to base class

* ✨ Move eval logging to custom callback

* 💄 Apply black code style

* 🐛 Add parent to pythonpath, remove copy command

* 🐛 Add missing max_length kwarg

930c9412

[generate] do_sample default back to False (#3298) · e8f44af5

Patrick von Platen authored Mar 17, 2020

* change do_samples back

* None better default as boolean

* adapt do_sample to True in test example

* make style

e8f44af5

CPU/GPU memory benchmarking utilities - Remove support for python 3.5 (now only 3.6+) (#3186) · 2187c49f

Thomas Wolf authored Mar 17, 2020

* memory benchmark rss

* have both forward pass and line-by-line mem tracing

* cleaned up tracing

* refactored and cleaning up API

* no f-strings yet...

* add GPU mem logging

* fix GPU memory monitoring

* style and quality

* clean up and doc

* update with comments

* Switching to python 3.6+

* fix quality

2187c49f

16 Mar, 2020 1 commit
- [BART] Remove unused kwargs (#3279) · 5ea8ba67
  Sam Shleifer authored Mar 15, 2020
```
* Remove unused kwargs
* dont call forward in tests
```
  5ea8ba67
13 Mar, 2020 3 commits

make style · 4f75d380
Patrick von Platen authored Mar 13, 2020

4f75d380
update file to new starting token logic · c2ee3840
Patrick von Platen authored Mar 13, 2020

c2ee3840

Bump psutil from 5.6.3 to 5.6.6 in /examples/distillation · afea70c0

dependabot[bot] authored Mar 12, 2020

Bumps [psutil](https://github.com/giampaolo/psutil) from 5.6.3 to 5.6.6.
- [Release notes](https://github.com/giampaolo/psutil/releases)
- [Changelog](https://github.com/giampaolo/psutil/blob/master/HISTORY.rst)
- [Commits](https://github.com/giampaolo/psutil/compare/release-5.6.3...release-5.6.6

)
Signed-off-by: dependabot[bot] <support@github.com>

afea70c0

12 Mar, 2020 1 commit
- Bart: update example for #3140 compatibility (#3233) · 2e81b9d8
  Sam Shleifer authored Mar 12, 2020
```
* Update bart example docs
```
  2e81b9d8
11 Mar, 2020 1 commit
- renamed min_len to min_length · 5b3000d9
  Patrick von Platen authored Mar 05, 2020
  
  5b3000d9
10 Mar, 2020 1 commit

NER - pl example (#3180) · 5ca356a4

Shubham Agarwal authored Mar 10, 2020

* 1. seqeval required by ner pl example. install from examples/requirements. 2. unrecognized arguments: save_steps

* pl checkpoint callback filenotfound error: make directory and pass

* #3159 pl checkpoint path difference

* 1. Updated Readme for pl 2. pl script now also correct displays logs 3. pass gpu ids compared to number of gpus

* Updated results in readme

* 1. updated readme 2. removing deprecated pl methods 3. finalizing scripts

* comment length check

* using deprecated validation_end for stable results

* style related changes

5ca356a4

09 Mar, 2020 2 commits
- Bart example: model.to(device) (#3194) · 3aca02ef
  Sam Shleifer authored Mar 09, 2020
  
  3aca02ef
- cased -> uncased in BERT SQuAD example · eb3e6cb0
  Lysandre authored Mar 09, 2020
```
closes #3183
```
  eb3e6cb0
05 Mar, 2020 3 commits
- Rename BartForMaskedLM -> BartForConditionalGeneration (#3114) · 857e0a0d
  Sam Shleifer authored Mar 05, 2020
```
* improved documentation
```
  857e0a0d
- undo chg · c203509d
  sshleifer authored Mar 05, 2020
  
  c203509d
- tests pass · c36fdc88
  sshleifer authored Mar 05, 2020
  
  c36fdc88
03 Mar, 2020 2 commits

Summarization Examples: add Bart CNN Evaluation (#3082) · 5b396457

Sam Shleifer authored Mar 03, 2020

* Rename and improve example

* Add test

* slightly faster test

* style

* This breaks remy prolly

* shorter test string

* no slow

* newdir structure

* New tree

* Style

* shorter

* docs

* clean

* Attempt future import

* more import hax

5b396457

Don't crash if fine-tuned model doesn't end with a number (#3099) · c0c7ec34
Davide Fiocco authored Mar 03, 2020
```
That's the same fix applied in https://github.com/huggingface/transformers/issues/2258 , but for the GLUE example
```
c0c7ec34

02 Mar, 2020 1 commit
- fix n_gpu count when no_cuda flag is activated (#3077) · 6b1ff250
  Victor SANH authored Mar 02, 2020
```
* fix n_gpu count when no_cuda flag is activated

* someone was left behind
```
  6b1ff250
01 Mar, 2020 3 commits
- make style · 298bed16
  Julien Chaumond authored Mar 01, 2020
  
  298bed16
- include roberta in run_squad_w_distillation - cc @graviraja · 852e032c
  VictorSanh authored Mar 01, 2020
  
  852e032c
- --do_lower_case will always trick me... · b5509abb
  VictorSanh authored Mar 01, 2020
  
  b5509abb
27 Feb, 2020 1 commit
- Changes to NER examples for PLT and TPU (#3053) · 908fa43b
  srush authored Feb 27, 2020
```
* changes to allow for tpu training

* black

* tpu

* tpu
```
  908fa43b
26 Feb, 2020 3 commits
- Code now passes style enforcement · d762d428
  Martin Malmsten authored Feb 26, 2020
  
  d762d428
- Changes from reviews. · 9495d38b
  Martin Malmsten authored Feb 26, 2020
  
  9495d38b
- fix several typos in Distil* readme (#3034) · 5bc99e7f
  Andrew Walker authored Feb 26, 2020
  
  5bc99e7f
25 Feb, 2020 1 commit
- missing ner link (#2967) · 7a7ee28c
  Jhuo IH authored Feb 25, 2020
  
  7a7ee28c
24 Feb, 2020 1 commit

Add preprocessing step for transfo-xl tokenization to avoid tokenizing words... · 65d74c49

Patrick von Platen authored Feb 24, 2020

Add preprocessing step for transfo-xl tokenization to avoid tokenizing words followed by punction to <unk> (#2987)

* add preprocessing to add space before punctuation for transfo_xl

* improve warning messages

* make style

* compile regex at instantination of tokenizer object

65d74c49

23 Feb, 2020 3 commits

Now passes style guide enforcement · 105dcb41
Martin Malmsten authored Feb 23, 2020

105dcb41
Added , · 33eb8a16
Martin Malmsten authored Feb 23, 2020

33eb8a16

* Added support for Albert when fine-tuning for NER · 869b66f6

Martin Malmsten authored Feb 23, 2020

* Added support for Albert in NER pipeline

* Added command-line options to examples/ner/run_ner.py to better control tokenization

* Added class AlbertForTokenClassification

* Changed output for NerPipeline to use .convert_ids_to_tokens(...) instead of .decode(...) to better reflect tokens

869b66f6

22 Feb, 2020 1 commit
- fix hardcoded path in examples readme · cafc4dfc
  saippuakauppias authored Feb 22, 2020
  
  cafc4dfc
21 Feb, 2020 3 commits

Improve special_token_id logic in run_generation.py and add tests (#2885) · fc38d4c8

Patrick von Platen authored Feb 21, 2020



* improving generation

* finalized special token behaviour for no_beam_search generation

* solved modeling_utils merge conflict

* solve merge conflicts in modeling_utils.py

* add run_generation improvements from PR #2749

* adapted language generation to not use hardcoded -1 if no padding token is available

* remove the -1 removal as hard coded -1`s are not necessary anymore

* add lightweight language generation testing for randomely initialized models - just checking whether no errors are thrown

* add slow language generation tests for pretrained models using hardcoded output with pytorch seed

* delete ipdb

* check that all generated tokens are valid

* renaming

* renaming Generation -> Generate

* make style

* updated so that generate_beam_search has same token behavior than generate_no_beam_search

* consistent return format for run_generation.py

* deleted pretrain lm generate tests -> will be added in another PR

* cleaning of unused if statements and renaming

* run_generate will always return an iterable

* make style

* consistent renaming

* improve naming, make sure generate function always returns the same tensor, add docstring

* add slow tests for all lmhead models

* make style and improve example comments modeling_utils

* better naming and refactoring in modeling_utils

* improving generation

* finalized special token behaviour for no_beam_search generation

* solved modeling_utils merge conflict

* solve merge conflicts in modeling_utils.py

* add run_generation improvements from PR #2749

* adapted language generation to not use hardcoded -1 if no padding token is available

* remove the -1 removal as hard coded -1`s are not necessary anymore

* add lightweight language generation testing for randomely initialized models - just checking whether no errors are thrown

* add slow language generation tests for pretrained models using hardcoded output with pytorch seed

* delete ipdb

* check that all generated tokens are valid

* renaming

* renaming Generation -> Generate

* make style

* updated so that generate_beam_search has same token behavior than generate_no_beam_search

* consistent return format for run_generation.py

* deleted pretrain lm generate tests -> will be added in another PR

* cleaning of unused if statements and renaming

* run_generate will always return an iterable

* make style

* consistent renaming

* improve naming, make sure generate function always returns the same tensor, add docstring

* add slow tests for all lmhead models

* make style and improve example comments modeling_utils

* better naming and refactoring in modeling_utils

* changed fast random lm generation testing design to more general one

* delete in old testing design in gpt2

* correct old variable name

* temporary fix for encoder_decoder lm generation tests - has to be updated when t5 is fixed

* adapted all fast random generate tests to new design

* better warning description in modeling_utils

* better comment

* better comment and error message
Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com>

fc38d4c8

Added CamembertForQuestionAnswering (#2746) · c749a543
maximeilluin authored Feb 21, 2020
```
* Added CamembertForQuestionAnswering

* fixed camembert tokenizer case
```
c749a543
Labels are now added to model config under id2label and label2id (#2945) · 4452b44b
Martin Malmsten authored Feb 21, 2020

4452b44b

20 Feb, 2020 3 commits

New BartModel (#2745) · 53ce3854

Sam Shleifer authored Feb 20, 2020

* Results same as fairseq
* Wrote a ton of tests
* Struggled with api signatures
* added some docs

53ce3854

default arg fix (#2937) · 889d3bfd
srush authored Feb 20, 2020

889d3bfd

Support for torch-lightning in NER examples (#2890) · b662f0e6

srush authored Feb 20, 2020



* initial pytorch lightning commit

* tested multigpu

* Fix learning rate schedule

* black formatting

* fix flake8

* isort

* isort

* .
Co-authored-by: Check your git settings! <chris@chris-laptop>

b662f0e6

18 Feb, 2020 1 commit
- fix vocab size in binarized_data (distil): int16 vs int32 · 2ae98336
  VictorSanh authored Feb 18, 2020
  
  2ae98336