Commits · ea8ffe36d3c97376f39a462820ad241c8dc52b95 · chenpangpang / transformers

12 Aug, 2021 5 commits

Proper import for unittest.mock.patch (#13085) · ea8ffe36
Sylvain Gugger authored Aug 12, 2021

ea8ffe36

Kamal Raj authored Aug 12, 2021



* TFDeberta

moved weights to build and fixed name scope

added missing ,

bug fixes to enable graph mode execution

updated setup.py

fixing typo

fix imports

embedding mask fix

added layer names avoid autmatic incremental names

+XSoftmax

cleanup

added names to layer

disable keras_serializable
Distangled attention output shape hidden_size==None
using symbolic inputs

test for Deberta tf

make style

Update src/transformers/models/deberta/modeling_tf_deberta.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

Update src/transformers/models/deberta/modeling_tf_deberta.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

Update src/transformers/models/deberta/modeling_tf_deberta.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

Update src/transformers/models/deberta/modeling_tf_deberta.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

Update src/transformers/models/deberta/modeling_tf_deberta.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

Update src/transformers/models/deberta/modeling_tf_deberta.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

Update src/transformers/models/deberta/modeling_tf_deberta.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

removed tensorflow-probability

removed blank line

* removed tf experimental api
+torch_gather tf implementation from @Rocketknight1

* layername DeBERTa --> deberta

* copyright fix

* added docs for TFDeberta & make style

* layer_name change to fix load from pt model

* layer_name change as pt model

* SequenceClassification layername change,
to same as pt model

* switched to keras built-in LayerNormalization

* added `TFDeberta` prefix most layer classes

* updated to tf.Tensor in the docstring

d329b633

Fix VisualBert Embeddings (#13017) · c4e1586d
Gunjan Chhablani authored Aug 12, 2021

c4e1586d

Doctests job (#13088) · 53b38d62

Lysandre Debut authored Aug 12, 2021



* Doctests

* Limit to 4 decimals

* Try with separate PT/TF tests

* Remove test for TF

* Ellips the predictions

* Doctest continue on failure
Co-authored-by: Sylvain Gugger <sylvain.gugger@gmail.com>

53b38d62

Fix classifier dropout in AlbertForMultipleChoice (#13087) · 3f52c685

Ibraheem Moosa authored Aug 12, 2021

Classification head of AlbertForMultipleChoice uses `hidden_dropout_prob` instead of `classifier_dropout_prob`. This
is not desirable as we cannot change classifer head dropout probability without changing the dropout probabilities of
the whole model.

3f52c685

11 Aug, 2021 3 commits

Install git (#13091) · c89180a9

Lysandre Debut authored Aug 11, 2021



* Install git

* Add TF tests

* And last TF test

* Add in commented code too
Co-authored-by: Sylvain Gugger <sylvain.gugger@gmail.com>

c89180a9

Add VisualBERT demo notebook (#12263) · c71f73f4

Gunjan Chhablani authored Aug 11, 2021

* Initialize VisualBERT demo

* Update demo

* Add commented URL

* Update README

* Update README

c71f73f4

[Doctest] Setup, quicktour and task_summary (#13078) · 83424ade

Sylvain Gugger authored Aug 11, 2021

* Fix doctests for quicktour

* Adapt causal LM exemple

* Remove space

* Fix until summarization

* End of task summary

* Style

* With last changes in quicktour

83424ade

10 Aug, 2021 12 commits

Fix last one · bfc88509
Sylvain Gugger authored Aug 10, 2021

bfc88509

Use original key for label in DataCollatorForTokenClassification (#13057) · 29dada00

Ibraheem Moosa authored Aug 10, 2021

* Use original key for label in DataCollatorForTokenClassification

DataCollatorForTokenClassification accepts either `label` or `labels` as key for label in it's input. However after padding the label it assigns the padded labels to key `labels`. If originally `label` was used as key than the original upadded labels still remains in the batch. Then at line 192 when we try to convert the batch elements to torch tensor than these original unpadded labels cannot be converted as the labels for different samples have different lengths.

* Fixed style.

29dada00

Revert to all tests whil we debug what's wrong (#13072) · 95e2e14f
Sylvain Gugger authored Aug 10, 2021

95e2e14f
Trigger GPU tests · 477480ce
Sylvain Gugger authored Aug 10, 2021

477480ce
Fix fallback of test_fetcher (#13071) · 0dad5d82
Sylvain Gugger authored Aug 10, 2021

0dad5d82
Merge branch 'master' of github.com:huggingface/transformers · 4dd85724
Sylvain Gugger authored Aug 10, 2021

4dd85724
Try fecthing the last two commits · bd5593b6
Sylvain Gugger authored Aug 10, 2021

bd5593b6

Roll out the test fetcher on push tests (#13055) · 9e9b8f1d

Sylvain Gugger authored Aug 10, 2021

* Use test fetcher for push tests as well

* Force diff with last commit for circleCI on master

* Fix syntax error

* Style

* Schedule nightly tests

9e9b8f1d

Pin sacrebleu · 2e0d767a
Sylvain Gugger authored Aug 10, 2021

2e0d767a
Fix ModelOutput instantiation form dictionaries (#13067) · 0454e4bd
Sylvain Gugger authored Aug 10, 2021
```
* Fix ModelOutput instantiation form dictionaries

* Style
```
0454e4bd

docs: add HuggingArtists to community notebooks (#13050) · 3157fa3c

Aleksey Korshuk authored Aug 10, 2021



* Adding HuggingArtists to Community Notebooks

* Adding HuggingArtists to Community Notebooks

* Adding HuggingArtists to Community Notebooks

* docs: add HuggingArtists to community notebooks
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

3157fa3c

Add try-except for torch_scatter (#13040) · ab7551cd
Kevin Canwen Xu authored Aug 10, 2021
```
* Add try-catch for torch_scatter

* Update modeling_tapas.py
```
ab7551cd

09 Aug, 2021 6 commits

replace tgt_lang by tgt_text (#13061) · 76cadb79
SaulLu authored Aug 09, 2021

76cadb79
Documentation for patch v4.9.2 · a8bf2fa7
Lysandre authored Aug 09, 2021

a8bf2fa7

Add to ONNX docs (#13048) · 5008e088

Lysandre Debut authored Aug 09, 2021



* Add to ONNX docs

* Add MBART example

* Update docs/source/serialization.rst
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

5008e088

Add MBART to models exportable with ONNX (#13049) · 6f5ab9da
Lysandre Debut authored Aug 09, 2021
```
* Add MBART to models exportable with ONNX

* unittest mock

* Add tests

* Misc fixes
```
6f5ab9da

[Flax] Refactor gpt2 & bert example docs (#13024) · 13a9c9a3

Patrick von Platen authored Aug 09, 2021



* fix_torch_device_generate_test

* remove @

* improve docs for clm

* speed-ups

* correct t5 example as well

* push final touches

* Update examples/flax/language-modeling/README.md

* correct docs for mlm

* Update examples/flax/language-modeling/README.md
Co-authored-by: Patrick von Platen <patrick@huggingface.co>

13a9c9a3

tfhub.de -> tfhub.dev (#12565) · 3ff2cde5
abhishek thakur authored Aug 09, 2021

3ff2cde5

08 Aug, 2021 2 commits
- Update README.md · 24cbf6bc
  Patrick von Platen authored Aug 08, 2021
  
  24cbf6bc
- Use min version for huggingface-hub dependency (#12961) · 7390d9de
  lewtun authored Aug 08, 2021
```
* Use min version for huggingface-hub dependency

* Update dependency version table
```
  7390d9de
06 Aug, 2021 6 commits

Tpu tie weights (#13030) · 7fcee113

Sylvain Gugger authored Aug 06, 2021

* Fix tied weights on TPU

* Manually tie weights in no trainer examples

* Fix for test

* One last missing

* Gettning owned by my scripts

* Address review comments

* Fix test

* Fix tests

* Fix reformer tests

7fcee113

Put smaller ALBERT model (#13028) · 1bf38611
Lysandre Debut authored Aug 06, 2021

1bf38611

T5 with past ONNX export (#13014) · dc420b0e

Michael Benayoun authored Aug 06, 2021



T5 with past ONNX export, and more explicit past_key_values inputs and outputs names for ONNX model
Authored-by: Michael Benayoun <michael@huggingface.co>

dc420b0e

FX submodule naming fix (#13016) · ee112246

Michael Benayoun authored Aug 06, 2021



Changed the way dynamically inserted submodules are named and the method used to insert them
Authored-by: Michael Benayoun <michael@huggingface.co>

ee112246

[WIP] Disentangle auto modules from other modeling files (#13023) · 9870093f

Sylvain Gugger authored Aug 06, 2021

* Initial work

* All auto models

* All tf auto models

* All flax auto models

* Tokenizers

* Add feature extractors

* Fix typos

* Fix other typo

* Use the right config

* Remove old mapping names and update logic in AutoTokenizer

* Update check_table

* Fix copies and check_repo script

* Fix last test

* Add back name

* clean up

* Update template

* Update template

* Forgot a )

* Use alternative to fixup

* Fix TF model template

* Address review comments

* Address review comments

* Style

9870093f

[Flax T5] Speed up t5 training (#13012) · 2e408236

Patrick von Platen authored Aug 06, 2021



* fix_torch_device_generate_test

* remove @

* update

* up

* fix

* remove f-stings

* correct readme

* up
Co-authored-by: Patrick von Platen <patrick@huggingface.co>

2e408236

05 Aug, 2021 4 commits
- [Flax] Correct pt to flax conversion if from base to head (#13006) · 60e448c8
  Patrick von Platen authored Aug 05, 2021
```
* finish PR

* add tests

* correct tests

* finish

* correct other flax tests

* better naming

* correct naming

* finish

* apply sylvains suggestions
```
  60e448c8
- Replace // operator with / operator + long() (#13013) · 33929448
  Nils Reimers authored Aug 05, 2021
  
  33929448
- GPT-Neo ONNX export (#12911) · a6d62aab
  Michael Benayoun authored Aug 05, 2021
```
GPT-Neo ONNX export and task / feature refactoring
Authored-by: Michael Benayoun <michael@huggingface.co>
```
  a6d62aab
- Create perplexity.rst (#13004) · 8aa01d2a
  Sasha Luccioni authored Aug 05, 2021
```
Updating the import for load_dataset
```
  8aa01d2a
04 Aug, 2021 2 commits

Add BEiT (#12994) · 83e5a106

NielsRogge authored Aug 04, 2021



* First pass

* Make conversion script work

* Improve conversion script

* Fix bug, conversion script working

* Improve conversion script, implement BEiTFeatureExtractor

* Make conversion script work based on URL

* Improve conversion script

* Add tests, add documentation

* Fix bug in conversion script

* Fix another bug

* Add support for converting masked image modeling model

* Add support for converting masked image modeling

* Fix bug

* Add print statement for debugging

* Fix another bug

* Make conversion script finally work for masked image modeling models

* Move id2label for datasets to JSON files on the hub

* Make sure id's are read in as integers

* Add integration tests

* Make style & quality

* Fix test, add BEiT to README

* Apply suggestions from @sgugger's review

* Apply suggestions from code review

* Make quality

* Replace nielsr by microsoft in tests, add docs

* Rename BEiT to Beit

* Minor fix

* Fix docs of BeitForMaskedImageModeling
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

83e5a106

Skip ProphetNet test (#12462) · 0dd1152c
Lysandre Debut authored Aug 04, 2021

0dd1152c