Commits · 39f8eafc1b6f00769240f714e2df5b2c5f111c32 · chenpangpang / transformers

03 May, 2022 7 commits

Remove device parameter from create_extended_attention_mask_for_decoder (#16894) · 39f8eafc
Pavel Belevich authored May 03, 2022

39f8eafc
Remove fetch in model templates test · dd739f70
Sylvain Gugger authored May 03, 2022

dd739f70
Fix RNG reload in resume training from epoch checkpoint (#17055) · 1c9fcd0e
Sylvain Gugger authored May 03, 2022
```
* Fix RNG reload in resume training from epoch checkpoint

* Fix test
```
1c9fcd0e
Remove Python and use v2 action (#17059) · 6e17ba6a
Sylvain Gugger authored May 03, 2022

6e17ba6a
Make Trainer compatible with sharded checkpoints (#17053) · a8fa2f91
Sylvain Gugger authored May 03, 2022
```
* Make Trainer compatible with sharded checkpoints

* Add doc
```
a8fa2f91

Move test model folders (#17034) · 19420fd9

Yih-Dar authored May 03, 2022



* move test model folders (TODO: fix imports and others)

* fix (potentially partially) imports (in model test modules)

* fix (potentially partially) imports (in tokenization test modules)

* fix (potentially partially) imports (in feature extraction test modules)

* fix import utils.test_modeling_tf_core

* fix path ../fixtures/

* fix imports about generation.test_generation_flax_utils

* fix more imports

* fix fixture path

* fix get_test_dir

* update module_to_test_file

* fix get_tests_dir from wrong transformers.utils

* update config.yml (CircleCI)

* fix style

* remove missing imports

* update new model script

* update check_repo

* update SPECIAL_MODULE_TO_TEST_MAP

* fix style

* add __init__

* update self-scheduled

* fix add_new_model scripts

* check one way to get location back

* python setup.py build install

* fix import in test auto

* update self-scheduled.yml

* update slack notification script

* Add comments about artifact names

* fix for yolos
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

19420fd9

[FlaxBert] Add ForCausalLM (#16995) · cd9274d0

Sanchit Gandhi authored May 03, 2022

* [FlaxBert] Add ForCausalLM

* make style

* fix output attentions

* Add RobertaForCausalLM

* remove comment

* fix fx-to-pt model loading

* remove comment

* add modeling tests

* add enc-dec model tests

* add big_bird

* add electra

* make style

* make repo-consitency

* add to docs

* remove roberta test

* quality

* amend cookiecutter

* fix attention_mask bug in flax bert model tester

* tighten pt-fx thresholds to 1e-5

* add 'copied from' statements

* amend 'copied from' statements

* amend 'copied from' statements

* quality

cd9274d0

02 May, 2022 17 commits

[T5 Tokenizer] Model has no fixed position ids - there is no hardcode… (#16990) · 31616b8d

Patrick von Platen authored May 02, 2022



* [T5 Tokenizer] Model has no fixed position ids - there is no hardcoded max length

* [T5 Tokenizer] Model has no fixed position ids - there is no hardcoded max length

* correct t5 tokenizer

* correct t5 tokenizer

* fix test

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* finish
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

31616b8d

Clean up setup.py (#17045) · 1073f00d
Sylvain Gugger authored May 02, 2022
```
* Clean up setup.py

* Trigger CI

* Upgrade Python used
```
1073f00d
Make the sacremoses dependency optional (#17049) · 30ca5299
Lysandre Debut authored May 02, 2022
```
* Make sacremoses optional

* Pickle
```
30ca5299
Allow all imports from transformers (#17050) · bb2e088b
Lysandre Debut authored May 02, 2022

bb2e088b

Add YOLOS (#16848) · 1ac69874

NielsRogge authored May 02, 2022



* First draft

* Add YolosForObjectDetection

* Make forward pass work

* Add mid position embeddings

* Add interpolation of position encodings

* Add expected values

* Add YOLOS to tests

* Add integration test

* Support tiny model as well

* Support all models in conversion script

* Remove mid_pe_size attribute

* Make more tests pass

* Add model to README and fix config

* Add copied from statements

* Rename base_model_prefix to vit

* Add missing YOLOS_PRETRAINED_CONFIG_ARCHIVE_MAP

* Apply suggestions from code review

* Apply more suggestions from code review

* Convert remaining checkpoints

* Improve docstrings

* Add YolosFeatureExtractor

* Add feature extractor to docs

* Add corresponding tests

* Fix style

* Fix docs

* Apply suggestion from code review

* Fix bad rebase

* Fix some more bad rebase

* Fix missing character

* Improve docs and variable names
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>

1ac69874

Fix no_trainer examples to properly calculate the number of samples (#17046) · f275e593
Zachary Mueller authored May 02, 2022
```
* Update all examples to properly calculate progress bar
```
f275e593
Update no_trainer examples to use new logger (#17044) · 35d48db8
Zachary Mueller authored May 02, 2022
```
* Propagate and fix imports
```
35d48db8
[Trainer] Move logic for checkpoint loading into separate methods for easy overriding (#17043) · daecae1f
calpt authored May 02, 2022

daecae1f

Clean up vision tests (#17024) · 2de2c9ec

NielsRogge authored May 02, 2022



* Clean up tests

* Make fixup
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>

2de2c9ec

Disable Flax GPU tests on push (#17042) · 4be8b95a
Sylvain Gugger authored May 02, 2022

4be8b95a
add torch.no_grad when in eval mode (#17020) · bdd690a7
yujun authored May 02, 2022
```
* add torch.no_grad when in eval mode

* make style quality
```
bdd690a7
Fix typo in RetriBERT docstring (#17018) · 9586e222
Martin Pömsl authored May 02, 2022

9586e222
[Flax(Speech)EncoderDecoder] Fix bug in `decoder_module` (#17036) · 93b802c4
Sanchit Gandhi authored May 02, 2022
```
* [FlaxSpeechEncoderDecoder] Fix bug in `decoder_module`

* [FlaxEncoderDecoder] Fix bug in `decoder_module`
```
93b802c4
Fix style · 1ae182d9
Sylvain Gugger authored May 02, 2022

1ae182d9

Fx with meta (#16836) · 2c2a2169

Michael Benayoun authored May 02, 2022

* Add meta proxy

* Uses meta data to trace data dependent control-flow

* Remove commented class

* Handles torch creating functions

* Added type annotation to fix tracing

* Tracing works for everything but T5 and GPT-J

* Almost all previously supported models pass

* All architectures can be traced except T5

* Intermediate commit to have a trace of the comparison operators for HFProxy

* Everything works, except loss computation

* Everything works

* Removed unused import

* Overriden methods do not use underlying ops (linear and torch.matmul), and model attributes are copied to the traced version

* Fix torch_matmul_override

* Change attributes reference to deepcopy

* Remove breakpoint and add torch_index_override

* Small fix

* Fix typo

* Replace asserts by explicit exceptions

2c2a2169

[FlaxGenerate] Fix bug in decoder_start_token_id (#17035) · ff846e9b
Sanchit Gandhi authored May 02, 2022

ff846e9b
update docs of length_penalty (#17022) · eb877f1f
Manan Dey authored May 02, 2022

eb877f1f

30 Apr, 2022 2 commits

Add translating guide (#17004) · da47c264
Omar U. Espejel authored Apr 30, 2022
```
* Add translating guide
```
da47c264

Add a check on config classes docstring checkpoints (#17012) · ede5e041

Yih-Dar authored Apr 30, 2022



* Add the check

* add missing ckpts

* add a list to ignore

* call the added check script

* better regex pattern
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

ede5e041

29 Apr, 2022 9 commits
- Result of new doc style with fixes (#17015) · 7152ed2b
  Sylvain Gugger authored Apr 29, 2022
```
* Result of new doc style with fixes

* Add last two files

* Bump hf-doc-builder
```
  7152ed2b
- Replace dict/BatchEncoding instance checks by Mapping (#17014) · 18df4407
  Sylvain Gugger authored Apr 29, 2022
```
* Replace dict/BatchEncoding instance checks by Mapping

* Typo
```
  18df4407
- Revert "Updating variable names. (#16445)" (#17011) · b8dffd1f
  Nicolas Patry authored Apr 29, 2022
```
This reverts commit 4f3a14e3.
```
  b8dffd1f
- Updating variable names. (#16445) · 4f3a14e3
  Nicolas Patry authored Apr 29, 2022
  
  4f3a14e3
- Update README_zh-hans.md (#16977) · 20fb5d51
  tarzan authored Apr 29, 2022
  
  20fb5d51
- Make create_extended_attention_mask_for_decoder static method (#16893) · 63fbed5c
  Pavel Belevich authored Apr 29, 2022
  
  63fbed5c
- TF: XLA bad words logits processor and list of processors (#16974) · fb0ae129
  Joao Gante authored Apr 29, 2022
  
  fb0ae129
- Update all require decorators to use skipUnless when possible (#16999) · 57e6464a
  Zachary Mueller authored Apr 29, 2022
  
  57e6464a
- use scale=1.0 in floats_tensor called in speech model testers (#17007) · e952e049
  Yih-Dar authored Apr 29, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  e952e049
28 Apr, 2022 5 commits
- Update README to latest release (#16997) · e6f00a11
  Sylvain Gugger authored Apr 28, 2022
  
  e6f00a11
- Fix savedir for by epoch (#16996) · 3486a92a
  Zachary Mueller authored Apr 28, 2022
  
  3486a92a
- set eos_token_id to None to generate until max length (#16989) · 5af5735f
  Yih-Dar authored Apr 28, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  5af5735f
- Rename a class to reflect framework pattern AutoModelXxx -> TFAutoModelXxx (#16993) · 01562dac
  amyeroberts authored Apr 28, 2022
  
  01562dac
- Add parameter --config_overrides for run_mlm_wwm.py (#16961) · 1be8d56e
  conan1024hao authored Apr 28, 2022
```
* dd parameter --config_overrides for run_mlm_wwm.py

* linter
```
  1be8d56e