Commits · c7cb1aa26c518257f4c88acfabd23873500754b9 · chenpangpang / transformers

09 Nov, 2020 9 commits
- Bump tokenizers (#8419) · c7cb1aa2
  Sylvain Gugger authored Nov 09, 2020
  
  c7cb1aa2
- [fsmt tokenizer] support lowercase tokenizer (#8389) · 78d706f3
  Stas Bekman authored Nov 09, 2020
```
* support lowercase tokenizer

* fix arg pos
```
  78d706f3
- Bug fix for permutation language modelling (#8409) · 1e2acd0d
  Shashank Gupta authored Nov 09, 2020
  
  1e2acd0d
- add evaluate doc - trainer.evaluate returns 'epoch' from training (#8273) · bf8625e7
  Philip May authored Nov 09, 2020
```
* add evaluate doc

* fix style with utils/style.doc

* Update src/transformers/trainer.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
```
  bf8625e7
- examples/docs: caveat that PL examples don't work on TPU (#8309) · ebde57ac
  Sam Shleifer authored Nov 09, 2020
  
  ebde57ac
- Fix some tooling for windows (#8359) · 76e7a44d
  Julien Plu authored Nov 09, 2020
```
* Fix some tooling for windows

* Fix conflict

* Trigger CI
```
  76e7a44d
- Update README.md (#8406) · 507dfb40
  dartrevan authored Nov 09, 2020
  
  507dfb40
- updating tag for exbert viz (#8408) · 7247d0b4
  smanjil authored Nov 09, 2020
  
  7247d0b4
- comet_ml temporary fix(#8410) · 4ab5617b
  Stas Bekman authored Nov 09, 2020
  
  4ab5617b
08 Nov, 2020 7 commits
- [s2s/distill] remove run_distiller.sh, fix xsum script (#8412) · e6d9cdaa
  Sam Shleifer authored Nov 08, 2020
  
  e6d9cdaa
- [s2s test_finetune_trainer] failing multigpu test (#8400) · 66582492
  Stas Bekman authored Nov 08, 2020
  
  66582492
- [s2s examples test] fix data path (#8398) · f62755a6
  Stas Bekman authored Nov 08, 2020
  
  f62755a6
- Fix DataCollatorForWholeWordMask again (#8397) · 4a53e8e9
  Jonathan Chang authored Nov 08, 2020
  
  4a53e8e9
- fixed default labels for QA model (#8399) · 61073099
  Manav Rathod authored Nov 08, 2020
  
  61073099
- Add gpt2-medium-chinese model card (#8402) · 0b02489b
  Chengxi Guo authored Nov 08, 2020
```
* Create README.md

* Update model_cards/mymusise/gpt2-medium-chinese/README.md
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
```
  0b02489b
- fix md table (#8395) · 18755436
  Stas Bekman authored Nov 08, 2020
  
  18755436
07 Nov, 2020 2 commits

Fix DataCollatorForWholeWordMask (#8379) · 77a257fc
Jonathan Chang authored Nov 07, 2020
```
* Fix DataCollatorForWholeWordMask

* Replace all tensorize_batch in data_collator.py
```
77a257fc

[make] rewrite modified_py_files in python to be cross-platform (#8371) · 517eaf46

Stas Bekman authored Nov 07, 2020

* rewrite modified_py_files in python to be cross-platform

* try a different way to test for variable not being ""

* improve comment

517eaf46

06 Nov, 2020 19 commits
- fix encoder outputs (#8368) · 07708793
  Patrick von Platen authored Nov 06, 2020
  
  07708793
- [All Seq2Seq model + CLM models that can be used with EncoderDecoder] Add... · bc0d26d1
  Yossi Synett authored Nov 06, 2020
```
[All Seq2Seq model + CLM models that can be used with EncoderDecoder] Add cross-attention weights to outputs (#8071)

* Output cross-attention with decoder attention output

* Update src/transformers/modeling_bert.py

* add cross-attention for t5 and bart as well

* fix tests

* correct typo in docs

* add sylvains and sams comments

* correct typo
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
```
  bc0d26d1
- Update README.md (#8360) · 30f2507a
  hassoudi authored Nov 06, 2020
```
Fix websitr address
```
  30f2507a
- Fix typo (#8351) · 5807ba3f
  Jonathan Chang authored Nov 06, 2020
  
  5807ba3f
- Update README.md (#8338) · 82146496
  hassoudi authored Nov 06, 2020
```
fixes
```
  82146496
- Create README.md (#8312) · 9e5c4d39
  ktrapeznikov authored Nov 06, 2020
```
* Create README.md

* Update model_cards/ktrapeznikov/gpt2-medium-topic-news/README.md
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
```
  9e5c4d39
- Create README.md (#8255) · 06ebc379
  hasantanvir79 authored Nov 06, 2020
```
* Create README.md

Initial commit

* Updated Read me

Updated

* Apply suggestions from code review
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
```
  06ebc379
- Create README.md (#8169) · 41cd031c
  Karthik Uppuluri authored Nov 06, 2020
  
  41cd031c
- Create README.md (#8170) · f932ddef
  Karthik Uppuluri authored Nov 06, 2020
  
  f932ddef
- Create README.md (#8168) · 08b92f78
  Karthik Uppuluri authored Nov 06, 2020
```
* Create README.md

* Update README.md
```
  08b92f78
- Create README.md (#8167) · 77d62e78
  Karthik Uppuluri authored Nov 06, 2020
```
* Create README.md

Telugu BERTU Readme file

* Update model_cards/kuppuluri/telugu_bertu/README.md
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
```
  77d62e78
- Create README.md (#8327) · dd6bfcae
  Yifan Peng authored Nov 06, 2020
  
  dd6bfcae
- german medbert model details (#8266) · ddeecf08
  smanjil authored Nov 06, 2020
```
* model details

* Apply suggestions from code review
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
```
  ddeecf08
- Create README.md (#8258) · 96baaafd
  Jiaxin Pei authored Nov 06, 2020
  
  96baaafd
- [model_cards] Update Italian BERT models and introduce new Italian XXL ELECTRA model 🎉 (#8343) · 185259c2
  Stefan Schweter authored Nov 06, 2020
  
  185259c2
- Model card: GPT-2 fine-tuned on CommonGen (#8248) · 34bbf60b
  Manuel Romero authored Nov 06, 2020
  
  34bbf60b
- Model card: CodeBERT fine-tuned for Insecure Code Detection (#8247) · 973218fd
  Manuel Romero authored Nov 06, 2020
```
* Model card: CodeBERT fine-tuned for Insecure Code Detection

* Update model_cards/mrm8488/codebert-base-finetuned-detect-insecure-code/README.md
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
```
  973218fd
- Model card: T5-base fine-tuned on QuaRel (#8334) · f833ca41
  Manuel Romero authored Nov 06, 2020
  
  f833ca41
- [s2s] test_bash_script.py - actually learn something (#8318) · 9edafaeb
  Stas Bekman authored Nov 05, 2020
```
* use decorator

* remove hardcoded paths

* make the test use more data and do real quality tests

* shave off 10 secs

* add --eval_beams 2, reformat

* reduce train size, use smaller custom dataset
```
  9edafaeb
05 Nov, 2020 3 commits

Docs bart training ref (#8330) · 17450397
Leandro von Werra authored Nov 05, 2020
```
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>
```
17450397
[s2s] test_distributed_eval (#8315) · d787935a
Stas Bekman authored Nov 05, 2020
```
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>
```
d787935a

Make Trainer evaluation handle dynamic seq_length (#8336) · 04e442d5

Sylvain Gugger authored Nov 05, 2020

* Make Trainer evaluation handle dynamic seq_length

* Document behavior.

* Fix test

* Better fix

* Fixes for realsies this time

* Address review comments

* Without forgetting to save...

04e442d5