Commits · 2085f2090121c8e33c2d174d522569955db728b1 · chenpangpang / transformers

18 Jan, 2022 14 commits

Fix a sneaky reference to compute_loss in the tests · 2085f209
matt authored Jan 18, 2022

2085f209

[Fix doc example] Wrong checkpoint name (#15079) · 979ca24e

Yih-Dar authored Jan 18, 2022



* fix doc example - MarianForCausalLM example

* try to keep copies

* fix copies

* fix more similar doc examples

* fix more

* fix style
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

979ca24e

fix: #14486 do not use BertPooler in DPR (#15068) · 7b3d4df4

PaulLerner authored Jan 18, 2022



* fix: #14486 do not use BertPooler in DPR

* fix tf dpr as well

* finish
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

7b3d4df4

Add MAE (#15120) · 74bec986

NielsRogge authored Jan 18, 2022

* First draft

* More improvements

* More improvements

* More improvements

* Fix embeddings

* Add conversion script

* Finish conversion script

* More improvements

* Fix forward pass

* Remove print statements

* Add weights initialization

* Add initialization of decoder weights

* Add support for other models in the conversion script

* Fix patch_size for huge model

* Fix most of the tests

* Fix integration test

* Fix docs

* Fix archive_list

* Apply suggestions from code review

* Improve documentation

* Apply more suggestions

* Skip some tests due to non-deterministic behaviour

* Fix test_initialization

* Remove unneccessary initialization of nn.Embedding

* Improve docs

* Fix dummies

* Remove ViTMAEFeatureExtractor from docs

* Add model to README and table of contents

* Delete inference file

74bec986

[MBartTokenizer] remove dep on xlm-roberta tokenizer (#15201) · 2ae3be54
Suraj Patil authored Jan 18, 2022

2ae3be54
Ignore empty subfolders when identifying submodules (#15204) · 84c60a7b
Sylvain Gugger authored Jan 18, 2022
```
* Ignore empty subfolders when identifying submodules

* Update utils/check_inits.py
```
84c60a7b
Remove dependency to quiet Dependabot (#15205) · 6f0a9b41
Sylvain Gugger authored Jan 18, 2022

6f0a9b41
[ASR pipeline] correct with lm pipeline (#15200) · 497346d0
Patrick von Platen authored Jan 18, 2022
```
* [ASR pipeline] correct with lm pipeline

* improve error
```
497346d0
Copies and docstring styling (#15202) · 1144d336
Sylvain Gugger authored Jan 18, 2022
```
* Style docstrings when making/checking copies

* Polish
```
1144d336

Fix deprecation warnings for int div (#15180) · 531336bb

Sylvain Gugger authored Jan 18, 2022



* Fix deprecation warnings for int div
Co-authored-by: mgoldey <matthew.goldey@gmail.com>

* Fix import

* ensure that tensor output is python scalar

* make backward compatible

* make code more readable

* adapt test functions
Co-authored-by: mgoldey <matthew.goldey@gmail.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

531336bb

Error when code examples are improperly closed (#15186) · f6d3fee8
Sylvain Gugger authored Jan 18, 2022

f6d3fee8

Add REALM (#13292) · 22454ae4

Li-Huai (Allan) Lin authored Jan 18, 2022



* REALM initial commit

* Retriever OK (Update new_gelu).

* Encoder prediction score OK

* Encoder pretrained model OK

* Update retriever comments

* Update docs, tests, and imports

* Prune unused models

* Make embedder as a module `RealmEmbedder`

* Add RealmRetrieverOutput

* Update tokenization

* Pass all tests in test_modeling_realm.py

* Prune RealmModel

* Update docs

* Add training test.

* Remove completed TODO

* Style & Quality

* Prune `RealmModel`

* Fixup

* Changes:
1. Remove RealmTokenizerFast
2. Update docstrings
3. Add a method to RealmTokenizer to handle candidates tokenization.

* Fix up

* Style

* Add tokenization tests

* Update `from_pretrained` tests

* Apply suggestions

* Style & Quality

* Copy BERT model

* Fix comment to avoid docstring copying

* Make RealmBertModel private

* Fix bug

* Style

* Basic QA

* Save

* Complete reader logits

* Add searcher

* Complete searcher & reader

* Move block records init to constructor

* Fix training bug

* Add some outputs to RealmReader

* Add finetuned checkpoint variable names parsing

* Fix bug

* Update REALM config

* Add RealmForOpenQA

* Update convert_tfrecord logits

* Fix bugs

* Complete imports

* Update docs

* Update naming

* Add brute-force searcher

* Pass realm model tests

* Style

* Exclude RealmReader from common tests

* Fix

* Fix

* convert docs

* up

* up

* more make style

* up

* upload

* up

* Fix

* Update src/transformers/__init__.py

* adapt testing

* change modeling code

* fix test

* up

* up

* up

* correct more

* make retriever work

* update

* make style

* finish main structure

* Resolve merge conflict

* Make everything work

* Style

* Fixup

* Fixup

* Update training test

* fix retriever

* remove hardcoded path

* Fix

* Fix modeling test

* Update model links

* Initial retrieval test

* Fix modeling test

* Complete retrieval tests

* Fix

* style

* Fix tests

* Fix docstring example

* Minor fix of retrieval test

* Update license headers and docs

* Apply suggestions from code review

* Style

* Apply suggestions from code review

* Add an example to RealmEmbedder

* Fix
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

22454ae4

[Fix doc example] TFRagModel (#15187) · b25067d8

Yih-Dar authored Jan 18, 2022



* fix doc example - NameError: name 'PATH' is not defined

* fix name 'TFRagModel' is not defined

* correct TFRagRagSequenceForGeneration

* fix name 'tf' is not defined

* fix style
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

b25067d8

`is_ctc` needs to be updated to `self.type == "ctc". (#15194) · dea563c9
Nicolas Patry authored Jan 18, 2022
```
* `is_ctc` needs to be updated to `self.type == "ctc".

* Adding fast test for this functionality.
```
dea563c9

17 Jan, 2022 5 commits
- [Fix doc example] UniSpeechSatForPreTraining (#15152) · 32090c72
  Yih-Dar authored Jan 18, 2022
```
* fix doc example - cannot import name 'UniSpeechSatFeatureEncoder'

* fix ckpt name
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  32090c72
- Mark bad tokenizers version (#15188) · 6f8e644f
  Sylvain Gugger authored Jan 17, 2022
  
  6f8e644f
- [doc] new MoE paper (#15184) · edd3fce2
  Stas Bekman authored Jan 17, 2022
```
add new paper
```
  edd3fce2
- Fix dtype issue in TF BART (#15178) · 9a2dabae
  Matt authored Jan 17, 2022
  
  9a2dabae
- Added forward pass of test_inference_image_classification_head with torch.no_grad() (#14777) · 0167edc8
  MrinalTyagi authored Jan 17, 2022
  
  0167edc8
16 Jan, 2022 1 commit
- [Speech models] Disable non-existing chunking in tests (#15163) · 7a787c68
  Patrick von Platen authored Jan 16, 2022
  
  7a787c68
15 Jan, 2022 1 commit
- [doc] performance: Efficient Software Prebuilds (#15147) · 669e3c50
  Stas Bekman authored Jan 14, 2022
```
* Efficient Software Prebuilds

* improve
```
  669e3c50
14 Jan, 2022 11 commits

update from keras2onnx to tf2onnx (#15162) · ebc4edfe
Joao Gante authored Jan 14, 2022

ebc4edfe

Better dummies (#15148) · 1b730c3d

Sylvain Gugger authored Jan 14, 2022

* Better dummies

* See if this fixes the issue

* Fix quality

* Style

* Add doc for DummyObject

1b730c3d

Fixing flaky test (hopefully). (#15154) · b212ff9f
Nicolas Patry authored Jan 14, 2022
```
* Fixing flaky test (hopefully).

* tf compliant.
```
b212ff9f
TF Bert inference - support `np.ndarray` optional arguments (#15074) · 7d9a33fb
Joao Gante authored Jan 14, 2022
```
* TF Bert inference - support np.ndarray optional arguments

* apply np input tests to all TF architectures
```
7d9a33fb

Add "open in hf spaces" gradio button issue #73 (#15106) · 4663c609

AK391 authored Jan 14, 2022

* update XLMProphetNet link

* update DPR link

* change prophetnet link

* change link MBART

* change link GPT

* update gpt2 link

* ctrl update link

* update Transformer-XL link

* Update Reformer link

* update xlnet link

* bert update link

* udpate albert link

* roberta update link

* update distilbert link

* update convbert link

* update XLM link

* xlm roberta update link

* update Flaubert link

* update electra link

* update funnel transformer and longformer

* bart update link

* pegasus update link

* udpate marianmt link

* t5 update link

* mt5 update link

4663c609

Update test_configuration_common.py (#15160) · 735d2bb6
novice authored Jan 14, 2022

735d2bb6
fix BertTokenizerFast `tokenize_chinese_chars` arg (#15158) · 51d7ebf2
SaulLu authored Jan 14, 2022
```
* add new test

* fix in init

* more relevant test
```
51d7ebf2
fix doc example - object has no attribute 'lm_logits' (#15143) · 4aa16fce
Yih-Dar authored Jan 14, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
4aa16fce
Make sure all submodules are properly registered (#15144) · 7cbf8429
Sylvain Gugger authored Jan 14, 2022
```
* Make sure all submodules are properly registered

* Try to fix tests

* Fix tests
```
7cbf8429
add TF glu activation function (#15146) · c4f7eb12
Joao Gante authored Jan 14, 2022

c4f7eb12

Check the repo consistency in model templates test (#15141) · 5f3c57fc

Sylvain Gugger authored Jan 14, 2022

* Check the repo consistency in model templates test

* Fix doc template

* Fix docstrings

* Fix last docstring

5f3c57fc

13 Jan, 2022 8 commits

Remove assert on optional arg · 96881729
Sylvain Gugger authored Jan 13, 2022

96881729
[deepspeed tests] fix summarization (#15149) · 1eb40338
Stas Bekman authored Jan 13, 2022

1eb40338

Enable AMP for xla:gpu device in trainer class (#15022) · 6e058e84

Yanming Wang authored Jan 13, 2022

* Multiple fixes of trainer class with XLA GPU

* Make fp16 valid for xla:gpu

* Add mark_step in should_log to reduce compilation overhead

6e058e84

Update model_sharing.mdx (#15142) · 3fc221d0
Carlos Aguayo authored Jan 13, 2022
```
Fix typo
```
3fc221d0

Deprecates AdamW and adds `--optim` (#14744) · 7b83feb5

Manuel R. Ciosici authored Jan 13, 2022



* Add AdamW deprecation warning

* Add --optim to Trainer

* Update src/transformers/optimization.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/optimization.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/optimization.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/optimization.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/training_args.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/training_args.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/training_args.py

* fix style

* fix

* Regroup adamws together
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* Change --adafactor to --optim adafactor

* Use Enum for optimizer values

* fixup! Change --adafactor to --optim adafactor

* fixup! Change --adafactor to --optim adafactor

* fixup! Change --adafactor to --optim adafactor

* fixup! Use Enum for optimizer values

* Improved documentation for --adafactor
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* Add mention of no_deprecation_warning
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* Rename OptimizerOptions to OptimizerNames

* Use choices for --optim

* Move optimizer selection code to a function and add a unit test

* Change optimizer names

* Rename method
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* Rename method
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* Remove TODO comment
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* Rename variable
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* Rename variable
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* Rename function

* Rename variable

* Parameterize the tests for supported optimizers

* Refactor

* Attempt to make tests pass on CircleCI

* Add a test with apex

* rework to add apex to parameterized; add actual train test

* fix import when torch is not available

* fix optim_test_params when torch is not available

* fix optim_test_params when torch is not available

* re-org

* small re-org

* fix test_fused_adam_no_apex

* Update src/transformers/training_args.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/training_args.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/training_args.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Remove .value from OptimizerNames

* Rename optimizer strings s|--adam_|--adamw_|

* Also rename Enum options

* small fix

* Fix instantiation of OptimizerNames. Remove redundant test

* Use ExplicitEnum instead of Enum

* Add unit test with string optimizer

* Change optimizer default to string value
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>
Co-authored-by: Stas Bekman <stas@stason.org>

7b83feb5

[examples/flax/language-modeling] set loglevel (#15129) · 762416ff
Stas Bekman authored Jan 13, 2022

762416ff
fix doc example - AssertionError: has to be configured as a decoder. (#15124) · 74837171
Yih-Dar authored Jan 13, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
74837171

doc-builder -> doc-build (#15134) · 6950ccec

Lysandre Debut authored Jan 13, 2022



* Updated script

* Commit everything

* Ready for review!

* Update .github/workflows/build_documentation.yml
Co-authored-by: Julien Chaumond <julien@huggingface.co>
Co-authored-by: Julien Chaumond <julien@huggingface.co>

6950ccec