Commits · 4c01231e67f0d699e0236c11178c956fb9753a17 · chenpangpang / transformers

"tests/models/ernie/test_modeling_ernie.py" did not exist on "f1fe18465d8c4ee3f5710cdfd7de387a1d136f6b"

24 Feb, 2023 1 commit
- Generate - update cookie cutters to not initialize cache with training and... · 440f3975
  Joao Gante authored Feb 24, 2023
```
Generate - update cookie cutters to not initialize cache with training and gradient checkpointing (#21759)
```
  440f3975
14 Feb, 2023 1 commit
- Final cleanup of TOKENIZER_FOR_DOC (#21565) · 68b21b37
  Sylvain Gugger authored Feb 14, 2023
```
FInal cleanup of TOKENIZER_FOR_DOC
```
  68b21b37
07 Feb, 2023 2 commits

Sylvain Gugger authored Feb 07, 2023

* Remove mentions of flake8/isort

* Clean up inits

* Deall with all other inits

* Last special rule for dummy files

67d07487

[CI ] Remove `past` in favor of `pat_key_values` (#21443) · 12eb528b

Arthur authored Feb 07, 2023

* fix past renamed to past_key_value

* update more `past`that were ski^êd

* fixup

* remove changes made to rag

* refactor `_reorder_cache` to use `past_key_values`

* fix git `prepare_inputs_for_generation` to pass tests when false is needed in use_cache

12eb528b

19 Jan, 2023 1 commit
- Flax dtype-dependent numerical masking (#21197) · cbaaa2f6
  Joao Gante authored Jan 19, 2023
  
  cbaaa2f6
09 Jan, 2023 1 commit
- Patch-past-refactor (#21050) · e3ecbaa4
  Arthur authored Jan 09, 2023
```
* small patches, forgot a line

* refactor PT

* the actual fix
```
  e3ecbaa4
08 Jan, 2023 1 commit

Replace `past` with `past_key_values` (#20944) · f0577df6

Arthur authored Jan 08, 2023

* start cleanup

* more updates

* more models are affected

* more updates

* update generation utils

* style

* revert change that removed reorder cachce

* update generation utils

* style

* style

* remove reorder cache

f0577df6

03 Jan, 2023 1 commit
- Generate: delete unused TF `_reorder_cache` (#20964) · 4fd89e49
  Joao Gante authored Jan 03, 2023
  
  4fd89e49
27 Dec, 2022 1 commit
- fix docs typos in "add_new_model" (#20900) · e35bc46a
  Eli Simhayev authored Dec 27, 2022
```
fix Jupyter typos
```
  e35bc46a
08 Dec, 2022 1 commit

Fix CIs for PyTorch 1.13 (#20686) · e3cc4487

Yih-Dar authored Dec 08, 2022



* fix 1

* fix 2

* fix 3

* fix 4
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

e3cc4487

05 Dec, 2022 1 commit

Cleanup some config attributes (#20554) · 9ffbed26

Yih-Dar authored Dec 05, 2022



* Remove is_encoder_decoder from some vision models

* cleanup more

* cleanup more
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

9ffbed26

30 Nov, 2022 1 commit
- Update doc examples feature extractor -> image processor (#20501) · 17a7b49b
  amyeroberts authored Nov 30, 2022
```
* Update doc example feature extractor -> image processor

* Apply suggestions from code review
```
  17a7b49b
28 Nov, 2022 1 commit

More TF int dtype fixes (#20384) · de4159a3

Matt authored Nov 28, 2022

* Add a test to ensure int dummy inputs are int64

* Move the test into the existing int64 test and update a lot of existing dummies

* Fix remaining dummies

* Fix remaining dummies

* Test for int64 serving sigs as well

* Update core tests to use tf.int64

* Add better messages to the assertions

* Update all serving sigs to int64

* More sneaky hiding tf.int32s

* Add an optional int32 signature in save_pretrained

* make fixup

* Add Amy's suggestions

* Switch all serving sigs back to tf.int32

* Switch all dummies to tf.int32

* Adjust tests to check for tf.int32 instead of tf.int64

* Fix base dummy_inputs dtype

* Start casting to tf.int32 in input_processing

* Change dtype for unpack_inputs test

* Add proper tf.int32 test

* Make the alternate serving signature int64

de4159a3

09 Nov, 2022 1 commit

Generate: move generation_*.py src files into generation/*.py (#20096) · f270b960

Joao Gante authored Nov 09, 2022

* move generation_*.py src files into generation/*.py

* populate generation.__init__ with lazy loading

* move imports and references from generation.xxx.object to generation.object

f270b960

18 Oct, 2022 1 commit
- check decoder_inputs_embeds is None before shifting labels (#19671) · 3e07196f
  Arthur authored Oct 18, 2022
  
  3e07196f
11 Oct, 2022 1 commit

🚨

TF: Remove `TFWrappedEmbeddings` (breaking: TF embedding initialization... · 462cd641

Joao Gante authored Oct 11, 2022

🚨🚨🚨  TF: Remove `TFWrappedEmbeddings` (breaking: TF embedding initialization updated for encoder-decoder models) (#19263)

* added test

* correct embedding init

* some changes in blenderbot (incomplete)

* update blenderbot (diff to be used as reference)

* update blenderbot_small

* update LED

* update marian

* update T5 and remove TFWrappedEmbeddings

* nullcontext() -> ContextManagers()

* fix embedding init

462cd641

22 Sep, 2022 1 commit
- TF: check embeddings range (#19102) · 1b5ab39c
  Joao Gante authored Sep 22, 2022
  
  1b5ab39c
15 Sep, 2022 1 commit

Update serving signatures and make sure we actually use them (#19034) · 2322eb8e

Matt authored Sep 15, 2022

* Override save() to use the serving signature as the default

* Replace int32 with int64 in all our serving signatures

* Remember one very important line so as not to break every test at once

* Dtype fix for TFLED

* dtype fix for shift_tokens_right in general

* Dtype fixes in mBART and RAG

* Fix dtypes for test_unpack_inputs

* More dtype fixes

* Yet more mBART + RAG dtype fixes

* Yet more mBART + RAG dtype fixes

* Add a check that the model actually has a serving method

2322eb8e

14 Sep, 2022 2 commits
- TF: tf.debugging assertions without tf.running_eagerly() protection (#19030) · 31be02f1
  Joao Gante authored Sep 14, 2022
  
  31be02f1
- PyTorch >= 1.7.0 and TensorFlow >= 2.4.0 (#19016) · a2a3afbc
  Sylvain Gugger authored Sep 14, 2022
  
  a2a3afbc
12 Sep, 2022 1 commit
- Fix TF start docstrings (#18991) · cf450b77
  Matt authored Sep 12, 2022
```
* Update our TF 2.0 input format tip across all models

* make style
```
  cf450b77
07 Sep, 2022 1 commit
- TF: final bias as a layer in seq2seq models (replicate TFMarian fix) (#18903) · 0eabab09
  Joao Gante authored Sep 07, 2022
  
  0eabab09
03 Aug, 2022 1 commit

Fix torch version comparisons (#18460) · 02b176c4

LSinev authored Aug 03, 2022

Comparisons like
version.parse(torch.__version__) > version.parse("1.6")
are True for torch==1.6.0+cu101 or torch==1.6.0+cpu

version.parse(version.parse(torch.__version__).base_version) are preferred (and available in pytorch_utils.py

02b176c4

01 Aug, 2022 1 commit
- Add a check regarding the number of occurrences of ``` (#18389) · bd6d1b43
  Yih-Dar authored Aug 01, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  bd6d1b43
11 Jul, 2022 1 commit

Fix some typos. (#17560) · 95113d13

Yulv-git authored Jul 11, 2022



* Fix some typos.
Signed-off-by: Yulv-git <yulvchi@qq.com>

* Fix typo.
Signed-off-by: Yulv-git <yulvchi@qq.com>

* make fixup.

95113d13

01 Jul, 2022 1 commit

[Flax] Add remat (gradient checkpointing) (#17843) · 485bbe79

Sanchit Gandhi authored Jul 01, 2022

* [Flax] Add remat (gradient checkpointing)

* fix variable naming in test

* flip: checkpoint using a method

* fix naming

* fix class naming

* apply PVP's suggestions from code review

* make fix-copies

* fix big-bird, electra, roberta

* cookie-cutter

* fix flax big-bird

* move test to common

485bbe79

29 Jun, 2022 1 commit
- Add missing comment quotes (#17379) · b8142753
  Leon Derczynski authored Jun 29, 2022
  
  b8142753
20 Jun, 2022 2 commits

Not use -1e4 as attn mask (#17306) · d3cb2888

Yih-Dar authored Jun 20, 2022



* Use torch.finfo(self.dtype).min

* for GPTNeoX

* for Albert

* For Splinter

* Update src/transformers/models/data2vec/modeling_data2vec_audio.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* fix -inf used in Bart-like models

* Fix a few remaining -inf

* more fix

* clean up

* For CLIP

* For FSMT

* clean up

* fix test

* Add dtype argument and use it for LayoutLMv3

* update FlaxLongT5Attention
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

d3cb2888

TF: BART compatible with XLA generation (#17479) · 132402d7
Joao Gante authored Jun 20, 2022
```
* Also propagate changes to blenderbot, blenderbot_small, marian, mbart, and pegasus
```
132402d7

13 Jun, 2022 1 commit
- Fix typo in adding_a_new_model README (#17679) · a5282ab4
  Ayush Mangal authored Jun 13, 2022
  
  a5282ab4
16 May, 2022 1 commit
- Fix obvious typos in flax decoder impl (#17279) · e86faecf
  cloudhan authored May 16, 2022
```
Change config.encoder_ffn_dim -> config.decoder_ffn_dim for decoder.
```
  e86faecf
12 May, 2022 1 commit
- update BART docs (#17212) · 9bd67ac7
  Suraj Patil authored May 12, 2022
  
  9bd67ac7
10 May, 2022 1 commit
- Fix template init (#17163) · 4ad2f68e
  Sylvain Gugger authored May 10, 2022
  
  4ad2f68e
09 May, 2022 1 commit

[WIP] Fix Pyright static type checking by replacing if-else imports with try-except (#16578) · df735d13

Dom Miketa authored May 09, 2022



* rebase and isort

* modify cookiecutter init

* fix cookiecutter auto imports

* fix clean_frameworks_in_init

* fix add_model_to_main_init

* blackify

* replace unnecessary f-strings

* update yolos imports

* fix roberta import bug

* fix yolos missing dependency

* fix add_model_like and cookiecutter bug

* fix repository consistency error

* modify cookiecutter, fix add_new_model_like

* remove stale line
Co-authored-by: Dom Miketa <dmiketa@exscientia.co.uk>

df735d13

03 May, 2022 3 commits

Remove device parameter from create_extended_attention_mask_for_decoder (#16894) · 39f8eafc
Pavel Belevich authored May 03, 2022

39f8eafc

Move test model folders (#17034) · 19420fd9

Yih-Dar authored May 03, 2022



* move test model folders (TODO: fix imports and others)

* fix (potentially partially) imports (in model test modules)

* fix (potentially partially) imports (in tokenization test modules)

* fix (potentially partially) imports (in feature extraction test modules)

* fix import utils.test_modeling_tf_core

* fix path ../fixtures/

* fix imports about generation.test_generation_flax_utils

* fix more imports

* fix fixture path

* fix get_test_dir

* update module_to_test_file

* fix get_tests_dir from wrong transformers.utils

* update config.yml (CircleCI)

* fix style

* remove missing imports

* update new model script

* update check_repo

* update SPECIAL_MODULE_TO_TEST_MAP

* fix style

* add __init__

* update self-scheduled

* fix add_new_model scripts

* check one way to get location back

* python setup.py build install

* fix import in test auto

* update self-scheduled.yml

* update slack notification script

* Add comments about artifact names

* fix for yolos
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

19420fd9

[FlaxBert] Add ForCausalLM (#16995) · cd9274d0

Sanchit Gandhi authored May 03, 2022

* [FlaxBert] Add ForCausalLM

* make style

* fix output attentions

* Add RobertaForCausalLM

* remove comment

* fix fx-to-pt model loading

* remove comment

* add modeling tests

* add enc-dec model tests

* add big_bird

* add electra

* make style

* make repo-consitency

* add to docs

* remove roberta test

* quality

* amend cookiecutter

* fix attention_mask bug in flax bert model tester

* tighten pt-fx thresholds to 1e-5

* add 'copied from' statements

* amend 'copied from' statements

* amend 'copied from' statements

* quality

cd9274d0

25 Apr, 2022 1 commit
- TF: XLA stable softmax (#16892) · e03966e4
  Joao Gante authored Apr 25, 2022
```
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
```
  e03966e4
19 Apr, 2022 1 commit

[Flax] improve large model init and loading (#16148) · d3bd9ac7

Suraj Patil authored Apr 19, 2022



* begin do_init

* add params_shape_tree

* raise error if params are accessed when do_init is False

* don't allow do_init=False when keys are missing

* make shape tree a property

* assign self._params at the end

* add test for do_init

* add do_init arg to all flax models

* fix param setting

* disbale do_init for composite models

* update test

* add do_init in FlaxBigBirdForMultipleChoice

* better names and errors

* improve test

* style

* add a warning when do_init=False

* remove extra if

* set params after _required_params

* add test for from_pretrained

* do_init => _do_init

* chage warning to info

* fix typo

* add params in init_weights

* add params to gpt neo init

* add params to init_weights

* update do_init test

* Trigger CI

* Apply suggestions from code review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* update template

* trigger CI

* style

* style

* fix template
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

d3bd9ac7

12 Apr, 2022 1 commit

Moved functions to pytorch_utils.py (#16625) · a315988b

Anmol Joshi authored Apr 12, 2022

* Moved functions to pytorch_utils.py

* isort formatting

* Reverted tf changes

* isort, make fix-copies

* documentation fix

* Fixed Conv1D import

* Reverted research examples file

* backward compatibility for pytorch_utils

* missing import

* isort fix

a315988b