Commits · 17a7b49bda15353cc49172a0cfeb839a9719e018 · chenpangpang / transformers

30 Nov, 2022 1 commit
- Update doc examples feature extractor -> image processor (#20501) · 17a7b49b
  amyeroberts authored Nov 30, 2022
```
* Update doc example feature extractor -> image processor

* Apply suggestions from code review
```
  17a7b49b
28 Nov, 2022 1 commit

More TF int dtype fixes (#20384) · de4159a3

Matt authored Nov 28, 2022

* Add a test to ensure int dummy inputs are int64

* Move the test into the existing int64 test and update a lot of existing dummies

* Fix remaining dummies

* Fix remaining dummies

* Test for int64 serving sigs as well

* Update core tests to use tf.int64

* Add better messages to the assertions

* Update all serving sigs to int64

* More sneaky hiding tf.int32s

* Add an optional int32 signature in save_pretrained

* make fixup

* Add Amy's suggestions

* Switch all serving sigs back to tf.int32

* Switch all dummies to tf.int32

* Adjust tests to check for tf.int32 instead of tf.int64

* Fix base dummy_inputs dtype

* Start casting to tf.int32 in input_processing

* Change dtype for unpack_inputs test

* Add proper tf.int32 test

* Make the alternate serving signature int64

de4159a3

09 Nov, 2022 1 commit

Generate: move generation_*.py src files into generation/*.py (#20096) · f270b960

Joao Gante authored Nov 09, 2022

* move generation_*.py src files into generation/*.py

* populate generation.__init__ with lazy loading

* move imports and references from generation.xxx.object to generation.object

f270b960

18 Oct, 2022 1 commit
- check decoder_inputs_embeds is None before shifting labels (#19671) · 3e07196f
  Arthur authored Oct 18, 2022
  
  3e07196f
11 Oct, 2022 1 commit

🚨

TF: Remove `TFWrappedEmbeddings` (breaking: TF embedding initialization... · 462cd641

Joao Gante authored Oct 11, 2022

🚨🚨🚨  TF: Remove `TFWrappedEmbeddings` (breaking: TF embedding initialization updated for encoder-decoder models) (#19263)

* added test

* correct embedding init

* some changes in blenderbot (incomplete)

* update blenderbot (diff to be used as reference)

* update blenderbot_small

* update LED

* update marian

* update T5 and remove TFWrappedEmbeddings

* nullcontext() -> ContextManagers()

* fix embedding init

462cd641

22 Sep, 2022 1 commit
- TF: check embeddings range (#19102) · 1b5ab39c
  Joao Gante authored Sep 22, 2022
  
  1b5ab39c
15 Sep, 2022 1 commit

Update serving signatures and make sure we actually use them (#19034) · 2322eb8e

Matt authored Sep 15, 2022

* Override save() to use the serving signature as the default

* Replace int32 with int64 in all our serving signatures

* Remember one very important line so as not to break every test at once

* Dtype fix for TFLED

* dtype fix for shift_tokens_right in general

* Dtype fixes in mBART and RAG

* Fix dtypes for test_unpack_inputs

* More dtype fixes

* Yet more mBART + RAG dtype fixes

* Yet more mBART + RAG dtype fixes

* Add a check that the model actually has a serving method

2322eb8e

14 Sep, 2022 2 commits
- TF: tf.debugging assertions without tf.running_eagerly() protection (#19030) · 31be02f1
  Joao Gante authored Sep 14, 2022
  
  31be02f1
- PyTorch >= 1.7.0 and TensorFlow >= 2.4.0 (#19016) · a2a3afbc
  Sylvain Gugger authored Sep 14, 2022
  
  a2a3afbc
12 Sep, 2022 1 commit
- Fix TF start docstrings (#18991) · cf450b77
  Matt authored Sep 12, 2022
```
* Update our TF 2.0 input format tip across all models

* make style
```
  cf450b77
07 Sep, 2022 1 commit
- TF: final bias as a layer in seq2seq models (replicate TFMarian fix) (#18903) · 0eabab09
  Joao Gante authored Sep 07, 2022
  
  0eabab09
06 Aug, 2022 1 commit

`transformers-cli login` => `huggingface-cli login` (#18490) · 9129fd03

Julien Chaumond authored Aug 06, 2022

* zero chance anyone's using that constant no?

* `transformers-cli login` => `huggingface-cli login`

* `transformers-cli repo create` => `huggingface-cli repo create`

* `make style`

9129fd03

03 Aug, 2022 1 commit

Fix torch version comparisons (#18460) · 02b176c4

LSinev authored Aug 03, 2022

Comparisons like
version.parse(torch.__version__) > version.parse("1.6")
are True for torch==1.6.0+cu101 or torch==1.6.0+cpu

version.parse(version.parse(torch.__version__).base_version) are preferred (and available in pytorch_utils.py

02b176c4

01 Aug, 2022 1 commit
- Add a check regarding the number of occurrences of ``` (#18389) · bd6d1b43
  Yih-Dar authored Aug 01, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  bd6d1b43
11 Jul, 2022 1 commit

Fix some typos. (#17560) · 95113d13

Yulv-git authored Jul 11, 2022



* Fix some typos.
Signed-off-by: Yulv-git <yulvchi@qq.com>

* Fix typo.
Signed-off-by: Yulv-git <yulvchi@qq.com>

* make fixup.

95113d13

01 Jul, 2022 1 commit

[Flax] Add remat (gradient checkpointing) (#17843) · 485bbe79

Sanchit Gandhi authored Jul 01, 2022

* [Flax] Add remat (gradient checkpointing)

* fix variable naming in test

* flip: checkpoint using a method

* fix naming

* fix class naming

* apply PVP's suggestions from code review

* make fix-copies

* fix big-bird, electra, roberta

* cookie-cutter

* fix flax big-bird

* move test to common

485bbe79

29 Jun, 2022 1 commit
- Add missing comment quotes (#17379) · b8142753
  Leon Derczynski authored Jun 29, 2022
  
  b8142753
20 Jun, 2022 2 commits

Not use -1e4 as attn mask (#17306) · d3cb2888

Yih-Dar authored Jun 20, 2022



* Use torch.finfo(self.dtype).min

* for GPTNeoX

* for Albert

* For Splinter

* Update src/transformers/models/data2vec/modeling_data2vec_audio.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* fix -inf used in Bart-like models

* Fix a few remaining -inf

* more fix

* clean up

* For CLIP

* For FSMT

* clean up

* fix test

* Add dtype argument and use it for LayoutLMv3

* update FlaxLongT5Attention
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

d3cb2888

TF: BART compatible with XLA generation (#17479) · 132402d7
Joao Gante authored Jun 20, 2022
```
* Also propagate changes to blenderbot, blenderbot_small, marian, mbart, and pegasus
```
132402d7

13 Jun, 2022 1 commit
- Fix typo in adding_a_new_model README (#17679) · a5282ab4
  Ayush Mangal authored Jun 13, 2022
  
  a5282ab4
07 Jun, 2022 1 commit

Add examples telemetry (#17552) · 3cab9027

Sylvain Gugger authored Jun 07, 2022

* Add examples telemetry

* Alternative approach

* Add to all other examples

* Add to templates as well

* Put framework separately

* Same for TensorFlow

3cab9027

16 May, 2022 1 commit
- Fix obvious typos in flax decoder impl (#17279) · e86faecf
  cloudhan authored May 16, 2022
```
Change config.encoder_ffn_dim -> config.decoder_ffn_dim for decoder.
```
  e86faecf
12 May, 2022 1 commit
- update BART docs (#17212) · 9bd67ac7
  Suraj Patil authored May 12, 2022
  
  9bd67ac7
10 May, 2022 1 commit
- Fix template init (#17163) · 4ad2f68e
  Sylvain Gugger authored May 10, 2022
  
  4ad2f68e
09 May, 2022 1 commit

[WIP] Fix Pyright static type checking by replacing if-else imports with try-except (#16578) · df735d13

Dom Miketa authored May 09, 2022



* rebase and isort

* modify cookiecutter init

* fix cookiecutter auto imports

* fix clean_frameworks_in_init

* fix add_model_to_main_init

* blackify

* replace unnecessary f-strings

* update yolos imports

* fix roberta import bug

* fix yolos missing dependency

* fix add_model_like and cookiecutter bug

* fix repository consistency error

* modify cookiecutter, fix add_new_model_like

* remove stale line
Co-authored-by: Dom Miketa <dmiketa@exscientia.co.uk>

df735d13

03 May, 2022 3 commits

Remove device parameter from create_extended_attention_mask_for_decoder (#16894) · 39f8eafc
Pavel Belevich authored May 03, 2022

39f8eafc

Move test model folders (#17034) · 19420fd9

Yih-Dar authored May 03, 2022



* move test model folders (TODO: fix imports and others)

* fix (potentially partially) imports (in model test modules)

* fix (potentially partially) imports (in tokenization test modules)

* fix (potentially partially) imports (in feature extraction test modules)

* fix import utils.test_modeling_tf_core

* fix path ../fixtures/

* fix imports about generation.test_generation_flax_utils

* fix more imports

* fix fixture path

* fix get_test_dir

* update module_to_test_file

* fix get_tests_dir from wrong transformers.utils

* update config.yml (CircleCI)

* fix style

* remove missing imports

* update new model script

* update check_repo

* update SPECIAL_MODULE_TO_TEST_MAP

* fix style

* add __init__

* update self-scheduled

* fix add_new_model scripts

* check one way to get location back

* python setup.py build install

* fix import in test auto

* update self-scheduled.yml

* update slack notification script

* Add comments about artifact names

* fix for yolos
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

19420fd9

[FlaxBert] Add ForCausalLM (#16995) · cd9274d0

Sanchit Gandhi authored May 03, 2022

* [FlaxBert] Add ForCausalLM

* make style

* fix output attentions

* Add RobertaForCausalLM

* remove comment

* fix fx-to-pt model loading

* remove comment

* add modeling tests

* add enc-dec model tests

* add big_bird

* add electra

* make style

* make repo-consitency

* add to docs

* remove roberta test

* quality

* amend cookiecutter

* fix attention_mask bug in flax bert model tester

* tighten pt-fx thresholds to 1e-5

* add 'copied from' statements

* amend 'copied from' statements

* amend 'copied from' statements

* quality

cd9274d0

02 May, 2022 1 commit
- add torch.no_grad when in eval mode (#17020) · bdd690a7
  yujun authored May 02, 2022
```
* add torch.no_grad when in eval mode

* make style quality
```
  bdd690a7
25 Apr, 2022 1 commit
- TF: XLA stable softmax (#16892) · e03966e4
  Joao Gante authored Apr 25, 2022
```
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
```
  e03966e4
19 Apr, 2022 1 commit

[Flax] improve large model init and loading (#16148) · d3bd9ac7

Suraj Patil authored Apr 19, 2022



* begin do_init

* add params_shape_tree

* raise error if params are accessed when do_init is False

* don't allow do_init=False when keys are missing

* make shape tree a property

* assign self._params at the end

* add test for do_init

* add do_init arg to all flax models

* fix param setting

* disbale do_init for composite models

* update test

* add do_init in FlaxBigBirdForMultipleChoice

* better names and errors

* improve test

* style

* add a warning when do_init=False

* remove extra if

* set params after _required_params

* add test for from_pretrained

* do_init => _do_init

* chage warning to info

* fix typo

* add params in init_weights

* add params to gpt neo init

* add params to init_weights

* update do_init test

* Trigger CI

* Apply suggestions from code review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* update template

* trigger CI

* style

* style

* fix template
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

d3bd9ac7

12 Apr, 2022 1 commit

Moved functions to pytorch_utils.py (#16625) · a315988b

Anmol Joshi authored Apr 12, 2022

* Moved functions to pytorch_utils.py

* isort formatting

* Reverted tf changes

* isort, make fix-copies

* documentation fix

* Fixed Conv1D import

* Reverted research examples file

* backward compatibility for pytorch_utils

* missing import

* isort fix

a315988b

05 Apr, 2022 2 commits

Adding new train_step logic to make things less confusing for users (#15994) · 43540052

Matt authored Apr 05, 2022



* Adding new train_step logic to make things less confusing for users

* DO NOT ASK WHY WE NEED THAT SUBCLASS

* Metrics now working, at least for single-output models with type annotations!

* Updates and TODOs for the new train_step

* Make fixup

* Temporary test workaround until T5 has types

* Temporary test workaround until T5 has types

* I think this actually works! Needs a lot of tests though

* MAke style/quality

* Revert changes to T5 tests

* Deleting the aforementioned unmentionable subclass

* Deleting the aforementioned unmentionable subclass

* Adding a Keras API test

* Style fixes

* Removing unneeded TODO and comments

* Update test_step too

* Stop trying to compute metrics with the dummy_loss, patch up test

* Make style

* make fixup

* Docstring cleanup

* make fixup

* make fixup

* Stop expanding 1D input tensors when using dummy loss

* Adjust T5 test given the new compile()

* make fixup

* Skipping test for convnext

* Removing old T5-specific Keras test now that we have a common one

* make fixup

* make fixup

* Only skip convnext test on CPU

* Update src/transformers/modeling_tf_utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/modeling_tf_utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Avoiding TF import issues

* make fixup

* Update compile() to support TF 2.3

* Skipping model.fit() on template classes for now

* Skipping model.fit() on template class tests for now

* Replace ad-hoc solution with find_labels

* make fixup
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

43540052

add a template to add missing tokenization test (#16553) · 02214cb3

SaulLu authored Apr 05, 2022



* add a template to add missing tokenization test

* add cookiecutter setting

* improve doc

* Update templates/adding_a_missing_tokenization_test/README.md
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

02214cb3

04 Apr, 2022 1 commit
- TF: Finalize `unpack_inputs`-related changes (#16499) · dad5ca83
  Joao Gante authored Apr 04, 2022
```
* Add unpack_inputs to remaining models

* removed kwargs to `call()` in TF models

* fix TF T5 tests
```
  dad5ca83
01 Apr, 2022 1 commit

Use random_attention_mask for TF tests (#16517) · 2199382d

Yih-Dar authored Apr 01, 2022



* use random_attention_mask for TF tests

* Fix for TFCLIP test (for now).
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

2199382d

30 Mar, 2022 1 commit

TF: unpack inputs on Convbert, GPTJ, LED, and templates (#16491) · c2f8eaf6

Joao Gante authored Mar 30, 2022

* Add unpack_inputs to remaining models

* remove stray use of inputs in the templates; fix tf.debugging of attn masks

c2f8eaf6

25 Mar, 2022 1 commit
- Big file_utils cleanup (#16396) · 088c1880
  Sylvain Gugger authored Mar 25, 2022
```
* Big file_utils cleanup

* This one still needs to be treated separately
```
  088c1880
23 Mar, 2022 2 commits

Reorganize file utils (#16264) · 4975002d

Sylvain Gugger authored Mar 23, 2022

* Split file_utils in several submodules

* Fixes

* Add back more objects

* More fixes

* Who exactly decided to import that from there?

* Second suggestion to code with code review

* Revert wront move

* Fix imports

* Adapt all imports

* Adapt all imports everywhere

* Revert this import, will fix in a separate commit

4975002d

Updates the default branch from master to main (#16326) · eca77f47

Lysandre Debut authored Mar 23, 2022



* Updates the default branch from master to main

* Links from `master` to `main`

* Typo

* Update examples/flax/README.md
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

eca77f47