Commits · 04976a32dc555667afa994e8f918cbee88d84a4f · chenpangpang / transformers

20 Sep, 2021 4 commits

Fix mT5 documentation (#13639) · 04976a32
Ayaka Mikazuki authored Sep 20, 2021
```
* Fix MT5 documentation

The abstract is incomplete

* MT5 -> mT5
```
04976a32
[Fix]Make sure the args tb_writer passed to the TensorBoardCallback works (#13636) · fe379f85
Chengjiang Li authored Sep 20, 2021

fe379f85

Gunjan Chhablani authored Sep 20, 2021



* Init FNet

* Update config

* Fix config

* Update model classes

* Update tokenizers to use sentencepiece

* Fix errors in model

* Fix defaults in config

* Remove position embedding type completely

* Fix typo and take only real numbers

* Fix type vocab size in configuration

* Add projection layer to embeddings

* Fix position ids bug in embeddings

* Add minor changes

* Add conversion script and remove CausalLM vestiges

* Fix conversion script

* Fix conversion script

* Remove CausalLM Test

* Update checkpoint names to dummy checkpoints

* Add tokenizer mapping

* Fix modeling file and corresponding tests

* Add tokenization test file

* Add PreTraining model test

* Make style and quality

* Make tokenization base tests work

* Update docs

* Add FastTokenizer tests

* Fix fast tokenizer special tokens

* Fix style and quality

* Remove load_tf_weights vestiges

* Add FNet to  main README

* Fix configuration example indentation

* Comment tokenization slow test

* Fix style

* Add changes from review

* Fix style

* Remove bos and eos tokens from tokenizers

* Add tokenizer slow test, TPU transforms, NSP

* Add scipy check

* Add scipy availabilty check to test

* Fix tokenizer and use correct inputs

* Remove remaining TODOs

* Fix tests

* Fix tests

* Comment Fourier Test

* Uncomment Fourier Test

* Change to google checkpoint

* Add changes from review

* Fix activation function

* Fix model integration test

* Add more integration tests

* Add comparison steps to MLM integration test

* Fix style

* Add masked tokenization fix

* Improve mask tokenization fix

* Fix index docs

* Add changes from review

* Fix issue

* Fix failing import in test

* some more fixes

* correct fast tokenizer

* finalize

* make style

* Remove additional tokenization logic

* Set do_lower_case to False

* Allow keeping accents

* Fix tokenization test

* Fix FNet Tokenizer Fast

* fix tests

* make style

* Add tips to FNet docs
Co-authored-by: patrickvonplaten <patrick.v.platen@gmail.com>

d8049331

fix typo (#13647) · 87d5057d
Suraj Patil authored Sep 20, 2021

87d5057d

17 Sep, 2021 9 commits

Fix GPT2Config parameters in GPT2ModelTester (#13630) · b518aaf1
calpt authored Sep 17, 2021

b518aaf1
Updated tiny distilbert models (#13631) · 300ee0c7
Lysandre Debut authored Sep 17, 2021

300ee0c7
fix some docstring in encoder-decoder models (#13611) · afb07a79
Yih-Dar authored Sep 17, 2021
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
afb07a79
Cloned tensors after indexing in _compute_attn_output_with_global_indices (#13613) · 19b7acdd
Alessandro Suglia authored Sep 17, 2021
```
Co-authored-by: Alessandro Suglia <asuglia@fb.com>
```
19b7acdd
Use `config_dict_or_path` for deepspeed.zero.Init (#13614) · ce32c69c
Alex Hedges authored Sep 17, 2021

ce32c69c

Removed console spam from misfiring warnings (#13625) · 0eb02871

Matt authored Sep 17, 2021

* Removed misfiring warnings

* Revert "Removed misfiring warnings"

This reverts commit cea90de325056b9c1cbcda2bd2613a785c1639ce.

* Retain the warning, but only when the user actually overrides things

* Fix accidentally breaking just about every model on the hub simultaneously

* Style pass

0eb02871

Fix special tokens not correctly tokenized (#13489) · da8beaaf

Li-Huai (Allan) Lin authored Sep 17, 2021

* Fix special tokens not correctly tokenized

* Add testing

* Fix

* Fix

* Use user workflows instead of directly assigning variables

* Enable test of fast tokenizers

* Update test of canine tokenizer

da8beaaf

[Trainer] Add nan/inf logging filter (#13619) · 1f9dcfc1

Patrick von Platen authored Sep 17, 2021

* finish

* add test

* push

* remove unnecessary code

* up

* correct test

* Update src/transformers/training_args.py

1f9dcfc1

Optimize Token Classification models for TPU (#13096) · eae7a96b

Ibraheem Moosa authored Sep 17, 2021

* Optimize Token Classification models for TPU

As per the XLA document XLA cannot handle masked indexing well. So token classification
models for BERT and others use an implementation based on `torch.where`. This implementation
works well on TPU. 

ALBERT token classification model uses the masked indexing which causes performance issues
on TPU. This PR fixes this issue by following the BERT implementation.

* Same fix for ELECTRA

* Same fix for LayoutLM

eae7a96b

16 Sep, 2021 11 commits
- XLMR tokenizer is fully picklable (#13577) · e02ed0ee
  Benjamin Davidson authored Sep 16, 2021
```
* made tokenizer fully picklable

* remove whitespace

* added testcase
```
  e02ed0ee
- Properly use test_fetcher for examples (#13604) · af5c6ae5
  Sylvain Gugger authored Sep 16, 2021
```
* Properly use test_fetcher for examples

* Fake example modification

* Fake modeling file modification

* Clean fake modifications

* Run example tests for any modification.
```
  af5c6ae5
- [deepspeed] replaced deprecated init arg (#13587) · bec2e3f5
  Stas Bekman authored Sep 16, 2021
```
* [deepspeed] replaced deprecated init arg

* Trigger CI
```
  bec2e3f5
- Feature Extractor: Wav2Vec2 & Speech2Text - Allow truncation + padding=longest (#13600) · 4d5b4c78
  Patrick von Platen authored Sep 16, 2021
```
* correct

* add tests

* Update src/transformers/feature_extraction_sequence_utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
```
  4d5b4c78
- DataCollatorForTokenClassification numpy fix (#13609) · e5904168
  Matt authored Sep 16, 2021
```
* Fix issue when labels are supplied as Numpy array instead of list

* Fix issue when labels are supplied as Numpy array instead of list

* Fix same issue in the `TokenClassification` data collator

* Style pass
```
  e5904168
- Fix make fix-copies with type annotations (#13586) · 88dbbfb2
  Sylvain Gugger authored Sep 16, 2021
  
  88dbbfb2
- Fix test (#13608) · cec1c636
  Lysandre Debut authored Sep 16, 2021
  
  cec1c636
- Fix DataCollatorForSeq2Seq when labels are supplied as Numpy array instead of list (#13582) · 5c593718
  Matt authored Sep 16, 2021
```
* Fix issue when labels are supplied as Numpy array instead of list

* Fix issue when labels are supplied as Numpy array instead of list
```
  5c593718
- finish (#13593) · 421929b5
  Patrick von Platen authored Sep 16, 2021
  
  421929b5
- correct (#13585) · b5bab710
  Patrick von Platen authored Sep 16, 2021
  
  b5bab710
- [ci] nightly: add deepspeed master (#13589) · 89da1bfe
  Stas Bekman authored Sep 15, 2021
  
  89da1bfe
15 Sep, 2021 4 commits
- [Pretrained Model] Add resize_position_embeddings (#13559) · 95f933ea
  Patrick von Platen authored Sep 15, 2021
```
* finish

* delete bogus file

* correct some stuff

* finish

* finish
```
  95f933ea
- upgrade sentencepiece version (#13564) · c783e148
  elishowk authored Sep 15, 2021
  
  c783e148
- Fix GPTNeo onnx export (#13524) · e86c02ea
  Suraj Patil authored Sep 15, 2021
```
Update GPT Neo ONNX config to match the changes implied by the simplification of the local attention
Co-authored-by: Michael Benayoun <michael@huggingface.co>
```
  e86c02ea
- [Flax] Fixes typo in Bart based Flax Models (#13565) · 3fbb55c7
  Bhadresh Savani authored Sep 15, 2021
  
  3fbb55c7
14 Sep, 2021 8 commits

Fix test_fetcher when setup is updated (#13566) · 7bd16b87
Sylvain Gugger authored Sep 14, 2021
```
* Fix test_fetcher when setup is updated

* Remove example
```
7bd16b87
separate model card git push from the rest (#13514) · 054b6013
elishowk authored Sep 14, 2021
```
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
```
054b6013
Fix yml syntax error · 9f318be3
Sylvain Gugger authored Sep 14, 2021

9f318be3
Add checks to build cleaner model cards (#13542) · 801ec115
Sylvain Gugger authored Sep 14, 2021
```
* Add checks to build cleaner model cards

* Address review comments
```
801ec115

[Flax] Addition of FlaxPegasus (#13420) · c1e47bf4

Bhadresh Savani authored Sep 14, 2021



* added initial files

* fixes pipeline

* fixes style and quality

* fixes doc issue and positional encoding

* fixes layer norm and test

* fixes quality issue

* fixes code quality

* removed extra layer norm

* added layer norm back in encoder and decoder

* added more code copy quality checks

* update tests

* Apply suggestions from code review

* fix import

* fix test
Co-authored-by: patil-suraj <surajp815@gmail.com>

c1e47bf4

add flax mbart in auto seq2seq lm (#13560) · fc3551a6
Suraj Patil authored Sep 14, 2021

fc3551a6

Push to hub when saving checkpoints (#13503) · 3081d386

Sylvain Gugger authored Sep 14, 2021

* Push to hub when saving checkpoints

* Add model card

* Revert partial model card

* Small fix for checkpoint

* Add tests

* Add documentation

* Fix tests

* Bump huggingface_hub

* Fix test

3081d386

Add long overdue link to the Google TRC project (#13501) · 51e5eca6

Avital Oliver authored Sep 14, 2021



* Add long-overdue link to the Google TRC project

* Apply suggestions from code review
Co-authored-by: Suraj Patil <surajp815@gmail.com>
Co-authored-by: Stefan Schweter <stefan@schweter.it>

51e5eca6

13 Sep, 2021 4 commits
- Nightly torch ci (#13550) · 3ab0185b
  Lysandre Debut authored Sep 13, 2021
```
* Nightly CI torch

* Version

* Reformat

* Only subset
Fix

* Revert

* Better formatting

* New channel
```
  3ab0185b
- return attention mask in int32 (#13543) · 5c14fcea
  Patrick von Platen authored Sep 13, 2021
  
  5c14fcea
- Small changes in `perplexity.rst`to make the notebook executable on google collaboratory (#13541) · 149c833b
  SaulLu authored Sep 13, 2021
```
* add imports

* Update docs/source/perplexity.rst
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
```
  149c833b
- [tokenizer] use use_auth_token for config (#13523) · f1c22dae
  Stas Bekman authored Sep 13, 2021
```
* [tokenizer] use use_auth_token for config

* args order
```
  f1c22dae