Commits · 895d394669d37669e1a83e62b84ed269f94121d5 · chenpangpang / transformers

31 Aug, 2020 3 commits
- TF Flaubert w/ pre-norm (#6841) · 895d3946
  Lysandre Debut authored Aug 31, 2020
  
  895d3946
- Set default logging level to `WARNING` instead of `INFO` · 4561f05c
  Lysandre authored Aug 31, 2020
  
  4561f05c
- Patch logging issue · 05c32141
  Lysandre authored Aug 31, 2020
  
  05c32141
30 Aug, 2020 6 commits
- [s2s README] Add more dataset download instructions (#6737) · dfa10a41
  Sam Shleifer authored Aug 30, 2020
  
  dfa10a41
- clearly indicate shuffle=False (#6312) · 32fe4408
  xujiaze13 authored Aug 30, 2020
```
* Clarify shuffle

* clarify shuffle
Co-authored-by: Kevin Canwen Xu <canwenxu@126.com>
```
  32fe4408
- BR_BERTo model card (#6793) · 0eecacea
  Rodolfo De Nadai authored Aug 30, 2020
  
  0eecacea
- Add model card for singbert lite. Update widget for singbert and singbert-large. (#6827) · d176aaad
  Zane Lim authored Aug 30, 2020
  
  d176aaad
- Fixed open in colab link (#6825) · a5847619
  Thomas Ashish Cherian authored Aug 30, 2020
  
  a5847619
- [tests] fix typos in inputs (#6818) · 563485bf
  Stas Bekman authored Aug 30, 2020
  
  563485bf
29 Aug, 2020 3 commits
- [bart] rename self-attention -> attention (#6708) · 22933e66
  Sam Shleifer authored Aug 29, 2020
  
  22933e66
- Pegasus finetune script: add --adafactor (#6811) · 0f58903b
  Sam Shleifer authored Aug 29, 2020
  
  0f58903b
- [s2s] round runtime in run_eval (#6798) · ac47458a
  Sam Shleifer authored Aug 29, 2020
  
  ac47458a
28 Aug, 2020 9 commits

[s2s] Test hub configs in self-scheduled CI (#6809) · 5ab21b07
Sam Shleifer authored Aug 28, 2020

5ab21b07
t5 model should make decoder_attention_mask (#6800) · 3cac867f
Sam Shleifer authored Aug 28, 2020

3cac867f
Fix style (#6803) · 20f77864
Sam Shleifer authored Aug 28, 2020

20f77864

prepare_seq2seq_batch makes labels/ decoder_input_ids made later. (#6654) · 9336086a

Sam Shleifer authored Aug 28, 2020

* broken test

* batch parity

* tests pass

* boom boom

* boom boom

* split out bart tokenizer tests

* fix tests

* boom boom

* Fixed dataset bug

* Fix marian

* Undo extra

* Get marian working

* Fix t5 tok tests

* Test passing

* Cleanup

* better assert msg

* require torch

* Fix mbart tests

* undo extra decoder_attn_mask change

* Fix import

* pegasus tokenizer can ignore src_lang kwargs

* unused kwarg test cov

* boom boom

* add todo for pegasus issue

* cover one word translation edge case

* Cleanup

* doc

9336086a

Transformer-XL: Improved tokenization with sacremoses (#6322) · cb276b41

RafaelWO authored Aug 28, 2020



* Improved tokenization with sacremoses

 * The TransfoXLTokenizer is now using sacremoses for tokenization
 * Added tokenization of comma-separated and floating point numbers.
 * Removed prepare_for_tokenization() from tokenization_transfo_xl.py because punctuation is handled by sacremoses
 * Added corresponding tests
 * Removed test comapring TransfoXLTokenizer and TransfoXLTokenizerFast
 * Added deprecation warning to TransfoXLTokenizerFast

* isort change
Co-authored-by: Teven <teven.lescao@gmail.com>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

cb276b41

Add ProtBert model card (#6764) · 930153e7
Ahmed Elnaggar authored Aug 28, 2020

930153e7

[style] set the minimal required version for `black` (#6784) · 743d131d

Stas Bekman authored Aug 27, 2020

`make style` with `black` < 20.8b1 is a no go (in case some other package forced a lower version) - so make it explicit to avoid confusion

743d131d

PL: --adafactor option (#6776) · fb78a90d
Sam Shleifer authored Aug 27, 2020

fb78a90d
[transformers-cli] fix logger getter (#6777) · 92ac2fa7
Stas Bekman authored Aug 27, 2020

92ac2fa7

27 Aug, 2020 12 commits

Format · 42fddacd
Lysandre authored Aug 27, 2020

42fddacd

new Makefile target: docs (#6510) · 70fccc5c

Stas Bekman authored Aug 27, 2020

* [doc] multiple corrections to "Summary of the tasks"

* add a new "docs" target to validate docs and document it

* fix mixup

70fccc5c

[test schedulers] adjust to test the first step's reading (#6429) · dbfe34f2
Stas Bekman authored Aug 27, 2020
```
* [test schedulers] small improvement

* cleanup
```
dbfe34f2
[testing] replace hardcoded paths to allow running tests from anywhere (#6523) · e6b811f0
Stas Bekman authored Aug 27, 2020
```
* [testing] replace hardcoded paths to allow running tests from anywhere

* fix the merge conflict
```
e6b811f0
add nlp install (#6767) · 9d1b4db2
Sam Shleifer authored Aug 27, 2020

9d1b4db2
Fix it to work with BART (#6756) · c225e872
Tom Grek authored Aug 27, 2020

c225e872
Format · 0d2c111a
Lysandre authored Aug 27, 2020

0d2c111a

Fix the TF Trainer gradient accumulation and the TF NER example (#6713) · 6f289dc9

Julien Plu authored Aug 27, 2020

* Align TF NER example over the PT one

* Fix Dataset call

* Fix gradient accumulation training

* Apply style

* Address Sylvain's comments

* Address Sylvain's comments

* Apply style

6f289dc9

Adafactor docs (#6765) · 41aa2b4e
Lysandre Debut authored Aug 27, 2020

41aa2b4e

Add AdaFactor optimizer from fairseq (#6722) · 971d1802

Nikolai Yakovenko authored Aug 27, 2020



* AdaFactor optimizer ported from fairseq. Tested for T5 finetuning and MLM -- reduced memory consumption compared to ADAM.

* update PR fixes, add basic test

* bug -- incorrect params in test

* bugfix -- import Adafactor into test

* bugfix -- removed accidental T5 include

* resetting T5 to master

* bugfix -- include Adafactor in __init__

* longer loop for adafactor test

* remove double error class declare

* lint

* black

* isort

* Update src/transformers/optimization.py
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>

* single docstring

* Cleanup docstring
Co-authored-by: Nikolai Y <nikolai.yakovenko@point72.com>
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>

971d1802

s2s distillation uses AutoModelForSeqToSeqLM (#6761) · 4bd7be9a
Sam Shleifer authored Aug 26, 2020

4bd7be9a
create ProtBert-BFD model card. (#6724) · 05e7150a
Ahmed Elnaggar authored Aug 27, 2020

05e7150a

26 Aug, 2020 7 commits
- [s2s] run_eval.py QOL improvements and cleanup(#6746) · 61518e2d
  Sam Shleifer authored Aug 26, 2020
  
  61518e2d
- Model Card for Multilingual Passage Reranking BERT (#6755) · 434936f3
  Igli Manaj authored Aug 27, 2020
  
  434936f3
- add __init__.py to utils (#6754) · 10a34501
  Joe Davison authored Aug 26, 2020
  
  10a34501
- Model card for kuisailab/albert-large-arabic (#6730) · 61b9ed80
  Ali Safaya authored Aug 27, 2020
```
* Create README.md

* Update README.md
```
  61b9ed80
- Model card for kuisailab/albert-xlarge-arabic (#6731) · 8e0d51e4
  Ali Safaya authored Aug 27, 2020
```
* Create README.md

* Update README.md
```
  8e0d51e4
- Model card for kuisailab/albert-base-arabic (#6729) · 70c96a10
  Ali Safaya authored Aug 27, 2020
```
* Create README.md

* Update README.md
```
  70c96a10
- added model card for codeswitch-spaeng-sentiment-analysis-lince (#6727) · cc4ba79f
  Sagor Sarker authored Aug 27, 2020
```
* added model card for codeswitch-spaeng-sentiment-analysis-lince model also update other model card

* fixed typo

* fixed typo

* fixed typo

* fixed typo

* fixed typo

* fixed typo

* fixed typo

* Update README.md
```
  cc4ba79f