Commits · 9c0afdaf7b091c341072b432ad6ee17ba7a5016b · chenpangpang / transformers

20 Nov, 2020 6 commits

fix flaky ci (#8694) · 9c0afdaf
Patrick von Platen authored Nov 20, 2020

9c0afdaf

Vectorize RepetitionPenaltyLogitsProcessor to improve performance (#8598) · 29bdb883

Binoy Dalal authored Nov 20, 2020

* refactored exisiting nested loops to vectorized implementation

* replaced explicit indexing with torch.where

* modifying score for previous input_ids only

29bdb883

moved temperature wrapper before topP/topK (#8686) · 2594bd8b
Roman Kalyakin authored Nov 20, 2020

2594bd8b

Fix rag finetuning + add finetuning test (#8585) · 8062fa63

Quentin Lhoest authored Nov 20, 2020

* replace init_ddp_connection for index init

* style

* add finetune test

* add test data

* move generate tensors to device

* add test on EM metric

* style

* allow multi process test

* keep gloo process group for retrieval

* add multi-gpu test

* use custom accelerator

* clean test finetune

* minor

* style

* style

* typo

* use python call instead of imported main fumction

* return_dict fix in modeling_rag

* use float32 in retrieval

* store as float32 as well in the custom knowledge dataset example

* style

* rename to finetune_rag

* style

* update readme

* rename utils and callbacks to utils_rag and callbacks_rag

* fix test

* patrick's comments

* generate dummy data in the finetue test script

* remove dummy data files

* style

8062fa63

Document adam betas TrainingArguments (#8688) · 63e91f5f
Sylvain Gugger authored Nov 20, 2020

63e91f5f
Update the bibtex with EMNLP demo (#8678) · 94caaa93
Kevin Canwen Xu authored Nov 20, 2020
```
* Update the bibtex with EMNLP demo

* Update README.md

* Update README.md
```
94caaa93

19 Nov, 2020 19 commits

Add sentencepiece to the CI and fix tests (#8672) · 6494910f
Sylvain Gugger authored Nov 19, 2020
```
* Fix the CI and tests

* Fix quality

* Remove that m form nowhere
```
6494910f
[examples/seq2seq] fix PL deprecation warning (#8577) · 0ad45e10
Stas Bekman authored Nov 19, 2020
```
* fix deprecation warning

* fix
```
0ad45e10

Update bert-base-multilingual-cased-README.md (#8668) · 0e19a4c2

Arindum Roy authored Nov 19, 2020

The heading was originally uncased, which did not reflect the contents of this README. Changed it to cased.

0e19a4c2

revert · 06518404
Stas Bekman authored Nov 19, 2020

06518404

Please fix your software not to ping master · 297a2938

Stas Bekman authored Nov 19, 2020

You may be unaware but you're running some software that meddles with every commit on https://github.com/huggingface/transformers/

Something is wrong with the software you're using. It adds a reference to almost every PR in the master tree. Which is very wrong. Please check your software and please don't do it again.

Example:
see the bottom of this PR and most other PRs:
https://github.com/huggingface/transformers/pull/8639

297a2938

[tokenizers] convert_to_tensors: don't reconvert when the type is already right (#8283) · 42111f1d
Stas Bekman authored Nov 19, 2020
```
* don't reconvert when the type is already right

* better name

* adjust logic as suggested

* merge
```
42111f1d
Fix run_ner script (#8664) · 20b65860
Sylvain Gugger authored Nov 19, 2020
```
* Fix run_ner script

* Pin datasets
```
20b65860

`disable_ngram_loss` fix for prophetnet (#8554) · ca0109bd

Zhylko Dima authored Nov 19, 2020



* `disable_ngram_loss` fix for prophetnet

* add changes documentation

* fix _compute_loss to use mean reduction and -100 to masked tokens & remove unnecessary arguments

* mean label smoothing loss

* small refactor

* fix test
Co-authored-by: patrickvonplaten <patrick.v.platen@gmail.com>

ca0109bd

Merge remote-tracking branch 'origin/master' · 0603564e
Sylvain Gugger authored Nov 19, 2020

0603564e
Forgot to save... · 1e08af38
Sylvain Gugger authored Nov 19, 2020

1e08af38
Release: v4.0.0-rc-1 · d86b5ffc
LysandreJik authored Nov 19, 2020

d86b5ffc
Fix a few last paths for the new repo org (#8666) · cb3e5c33
Sylvain Gugger authored Nov 19, 2020

cb3e5c33

fix small typo (#8644) · a79a96dd

Matthias authored Nov 19, 2020

Fixed a small typo on the XLNet and permutation language modelling section

a79a96dd

Better filtering of the model outputs in Trainer (#8633) · 4208f496
Sylvain Gugger authored Nov 19, 2020
```
* Better filtering of the model outputs in Trainer

* Fix examples tests

* Add test for Lysandre
```
4208f496

Fix a bunch of slow tests (#8634) · f2e07e72

Lysandre Debut authored Nov 19, 2020



* CI should install `sentencepiece`

* Requiring TF

* Fixing some TFDPR bugs

* remove return_dict=False/True hack
Co-authored-by: patrickvonplaten <patrick.v.platen@gmail.com>

f2e07e72

Tf longformer for sequence classification (#8231) · 5362bb8a

elk-cloner authored Nov 19, 2020



* working on LongformerForSequenceClassification

* add TFLongformerForMultipleChoice

* add TFLongformerForTokenClassification

* use add_start_docstrings_to_model_forward

* test TFLongformerForSequenceClassification

* test TFLongformerForMultipleChoice

* test TFLongformerForTokenClassification

* remove test from repo

* add test and doc for TFLongformerForSequenceClassification, TFLongformerForTokenClassification, TFLongformerForMultipleChoice

* add requested classes to modeling_tf_auto.py
update dummy_tf_objects
fix tests
fix bugs in requested classes

* pass all tests except test_inputs_embeds

* sync with master

* pass all tests except test_inputs_embeds

* pass all tests

* pass all tests

* work on test_inputs_embeds

* fix style and quality

* make multi choice work

* fix TFLongformerForTokenClassification signature

* fix TFLongformerForMultipleChoice, TFLongformerForSequenceClassification signature

* fix mult choice

* fix mc hint

* fix input embeds

* fix input embeds

* refactor input embeds

* fix copy issue

* apply sylvains changes and clean more
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

5362bb8a

fix missing return dict (#8653) · 62cd9ce9
Quentin Lhoest authored Nov 19, 2020

62cd9ce9
[model card] : fix bert-base-15lang-cased (#8655) · 0c2677f5
Amine Abdaoui authored Nov 19, 2020
```
the table was badly formatted because of a single line break
```
0c2677f5

Add cards for all Geotrend models (#8617) · 0a80959b

Amine Abdaoui authored Nov 19, 2020

* docs(bert-base-15lang-cased): add model card

* add cards for all Geotrend models

* [model cards] fix language tag for all Geotrend models

0a80959b

18 Nov, 2020 15 commits
- Updated the Extractive Question Answering code snippets (#8636) · dcc9c642
  cronoik authored Nov 19, 2020
```
* Updated the Extractive Question Answering code snippets

The Extractive Question Answering code snippets do not work anymore since the models return task-specific output objects. This commit fixes the pytorch and tensorflow examples but adding `.values()` to the model call.

* Update task_summary.rst
```
  dcc9c642
- Update README.md (#8635) · 28d16e7a
  Tim Isbister authored Nov 19, 2020
  
  28d16e7a
- grammar (#8639) · b290195a
  cronoik authored Nov 19, 2020
  
  b290195a
- [s2s] distillation apex breaks return_dict obj (#8631) · d86d57fa
  Stas Bekman authored Nov 18, 2020
```
* apex breaks return_dict obj

* style
```
  d86d57fa
- Created ModelCard for Hel-ach-en MT model (#8496) · bf3611b2
  Perez Ogayo authored Nov 18, 2020
```
* Updated ModelCard

* Apply suggestions from code review
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
```
  bf3611b2
- Create README.md (#8362) · c95b26a7
  Yifan Peng authored Nov 18, 2020
  
  c95b26a7
- Model card: T5-base fine-tuned on QuaRTz (#8369) · fdbbb6c1
  Manuel Romero authored Nov 18, 2020
```
* Model card: T5-base fine-tuned on QuaRTz

* Update model_cards/mrm8488/t5-base-finetuned-quartz/README.md
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
```
  fdbbb6c1
- Create README.md (#8363) · 6e6d24c5
  Yifan Peng authored Nov 18, 2020
  
  6e6d24c5
- Add model card for ai4bharat/indic-bert (#8464) · 35fd3d64
  Divyanshu Kakwani authored Nov 18, 2020
  
  35fd3d64
- Update README.md (#8405) · 38f01dfe
  dartrevan authored Nov 18, 2020
```
* Update README.md

* Update README.md
```
  38f01dfe
- Model Card for abhilash1910/financial_roberta (#8625) · 2d8fbf01
  Abhilash Majumder authored Nov 18, 2020
```
* Model Card for abhilash1910/financial_roberta

* Update model_cards/abhilash1910/financial_roberta/README.md
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
```
  2d8fbf01
- Update README.md (#8544) · 26dc6593
  Vishal Singh authored Nov 18, 2020
```
Modified Model in Action section. The class `AutoModelWithLMHead` is deprecated so changed it to `AutoModelForSeq2SeqLM` for encoder-decoder models. Removed duplicate eos token.
```
  26dc6593
- replace performance table with markdown (#8565) · 6c8fad4f
  smanjil authored Nov 18, 2020
```
* replace performance table with markdown

* Update model_cards/smanjil/German-MedBERT/README.md
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
```
  6c8fad4f
- model_cards for Chinese Couplet and Poem GPT2 models (#8620) · e7f77fc5
  hhou435 authored Nov 19, 2020
  
  e7f77fc5
- Fix training from scratch in new scripts (#8623) · a0c62d24
  Sylvain Gugger authored Nov 18, 2020
  
  a0c62d24