Commits · a5282ab4bcb0556be5bc9c82d3e17ed978419605 · chenpangpang / transformers

13 Jun, 2022 1 commit
- Fix typo in adding_a_new_model README (#17679) · a5282ab4
  Ayush Mangal authored Jun 13, 2022
  
  a5282ab4
16 May, 2022 1 commit
- Fix obvious typos in flax decoder impl (#17279) · e86faecf
  cloudhan authored May 16, 2022
```
Change config.encoder_ffn_dim -> config.decoder_ffn_dim for decoder.
```
  e86faecf
12 May, 2022 1 commit
- update BART docs (#17212) · 9bd67ac7
  Suraj Patil authored May 12, 2022
  
  9bd67ac7
10 May, 2022 1 commit
- Fix template init (#17163) · 4ad2f68e
  Sylvain Gugger authored May 10, 2022
  
  4ad2f68e
09 May, 2022 1 commit

[WIP] Fix Pyright static type checking by replacing if-else imports with try-except (#16578) · df735d13

Dom Miketa authored May 09, 2022



* rebase and isort

* modify cookiecutter init

* fix cookiecutter auto imports

* fix clean_frameworks_in_init

* fix add_model_to_main_init

* blackify

* replace unnecessary f-strings

* update yolos imports

* fix roberta import bug

* fix yolos missing dependency

* fix add_model_like and cookiecutter bug

* fix repository consistency error

* modify cookiecutter, fix add_new_model_like

* remove stale line
Co-authored-by: Dom Miketa <dmiketa@exscientia.co.uk>

df735d13

03 May, 2022 3 commits

Remove device parameter from create_extended_attention_mask_for_decoder (#16894) · 39f8eafc
Pavel Belevich authored May 03, 2022

39f8eafc

Move test model folders (#17034) · 19420fd9

Yih-Dar authored May 03, 2022



* move test model folders (TODO: fix imports and others)

* fix (potentially partially) imports (in model test modules)

* fix (potentially partially) imports (in tokenization test modules)

* fix (potentially partially) imports (in feature extraction test modules)

* fix import utils.test_modeling_tf_core

* fix path ../fixtures/

* fix imports about generation.test_generation_flax_utils

* fix more imports

* fix fixture path

* fix get_test_dir

* update module_to_test_file

* fix get_tests_dir from wrong transformers.utils

* update config.yml (CircleCI)

* fix style

* remove missing imports

* update new model script

* update check_repo

* update SPECIAL_MODULE_TO_TEST_MAP

* fix style

* add __init__

* update self-scheduled

* fix add_new_model scripts

* check one way to get location back

* python setup.py build install

* fix import in test auto

* update self-scheduled.yml

* update slack notification script

* Add comments about artifact names

* fix for yolos
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

19420fd9

[FlaxBert] Add ForCausalLM (#16995) · cd9274d0

Sanchit Gandhi authored May 03, 2022

* [FlaxBert] Add ForCausalLM

* make style

* fix output attentions

* Add RobertaForCausalLM

* remove comment

* fix fx-to-pt model loading

* remove comment

* add modeling tests

* add enc-dec model tests

* add big_bird

* add electra

* make style

* make repo-consitency

* add to docs

* remove roberta test

* quality

* amend cookiecutter

* fix attention_mask bug in flax bert model tester

* tighten pt-fx thresholds to 1e-5

* add 'copied from' statements

* amend 'copied from' statements

* amend 'copied from' statements

* quality

cd9274d0

25 Apr, 2022 1 commit
- TF: XLA stable softmax (#16892) · e03966e4
  Joao Gante authored Apr 25, 2022
```
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
```
  e03966e4
19 Apr, 2022 1 commit

[Flax] improve large model init and loading (#16148) · d3bd9ac7

Suraj Patil authored Apr 19, 2022



* begin do_init

* add params_shape_tree

* raise error if params are accessed when do_init is False

* don't allow do_init=False when keys are missing

* make shape tree a property

* assign self._params at the end

* add test for do_init

* add do_init arg to all flax models

* fix param setting

* disbale do_init for composite models

* update test

* add do_init in FlaxBigBirdForMultipleChoice

* better names and errors

* improve test

* style

* add a warning when do_init=False

* remove extra if

* set params after _required_params

* add test for from_pretrained

* do_init => _do_init

* chage warning to info

* fix typo

* add params in init_weights

* add params to gpt neo init

* add params to init_weights

* update do_init test

* Trigger CI

* Apply suggestions from code review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* update template

* trigger CI

* style

* style

* fix template
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

d3bd9ac7

12 Apr, 2022 1 commit

Moved functions to pytorch_utils.py (#16625) · a315988b

Anmol Joshi authored Apr 12, 2022

* Moved functions to pytorch_utils.py

* isort formatting

* Reverted tf changes

* isort, make fix-copies

* documentation fix

* Fixed Conv1D import

* Reverted research examples file

* backward compatibility for pytorch_utils

* missing import

* isort fix

a315988b

05 Apr, 2022 1 commit

Adding new train_step logic to make things less confusing for users (#15994) · 43540052

Matt authored Apr 05, 2022



* Adding new train_step logic to make things less confusing for users

* DO NOT ASK WHY WE NEED THAT SUBCLASS

* Metrics now working, at least for single-output models with type annotations!

* Updates and TODOs for the new train_step

* Make fixup

* Temporary test workaround until T5 has types

* Temporary test workaround until T5 has types

* I think this actually works! Needs a lot of tests though

* MAke style/quality

* Revert changes to T5 tests

* Deleting the aforementioned unmentionable subclass

* Deleting the aforementioned unmentionable subclass

* Adding a Keras API test

* Style fixes

* Removing unneeded TODO and comments

* Update test_step too

* Stop trying to compute metrics with the dummy_loss, patch up test

* Make style

* make fixup

* Docstring cleanup

* make fixup

* make fixup

* Stop expanding 1D input tensors when using dummy loss

* Adjust T5 test given the new compile()

* make fixup

* Skipping test for convnext

* Removing old T5-specific Keras test now that we have a common one

* make fixup

* make fixup

* Only skip convnext test on CPU

* Update src/transformers/modeling_tf_utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/modeling_tf_utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Avoiding TF import issues

* make fixup

* Update compile() to support TF 2.3

* Skipping model.fit() on template classes for now

* Skipping model.fit() on template class tests for now

* Replace ad-hoc solution with find_labels

* make fixup
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

43540052

04 Apr, 2022 1 commit
- TF: Finalize `unpack_inputs`-related changes (#16499) · dad5ca83
  Joao Gante authored Apr 04, 2022
```
* Add unpack_inputs to remaining models

* removed kwargs to `call()` in TF models

* fix TF T5 tests
```
  dad5ca83
01 Apr, 2022 1 commit

Use random_attention_mask for TF tests (#16517) · 2199382d

Yih-Dar authored Apr 01, 2022



* use random_attention_mask for TF tests

* Fix for TFCLIP test (for now).
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

2199382d

30 Mar, 2022 1 commit

TF: unpack inputs on Convbert, GPTJ, LED, and templates (#16491) · c2f8eaf6

Joao Gante authored Mar 30, 2022

* Add unpack_inputs to remaining models

* remove stray use of inputs in the templates; fix tf.debugging of attn masks

c2f8eaf6

25 Mar, 2022 1 commit
- Big file_utils cleanup (#16396) · 088c1880
  Sylvain Gugger authored Mar 25, 2022
```
* Big file_utils cleanup

* This one still needs to be treated separately
```
  088c1880
23 Mar, 2022 2 commits

Reorganize file utils (#16264) · 4975002d

Sylvain Gugger authored Mar 23, 2022

* Split file_utils in several submodules

* Fixes

* Add back more objects

* More fixes

* Who exactly decided to import that from there?

* Second suggestion to code with code review

* Revert wront move

* Fix imports

* Adapt all imports

* Adapt all imports everywhere

* Revert this import, will fix in a separate commit

4975002d

Updates the default branch from master to main (#16326) · eca77f47

Lysandre Debut authored Mar 23, 2022



* Updates the default branch from master to main

* Links from `master` to `main`

* Typo

* Update examples/flax/README.md
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

eca77f47

22 Mar, 2022 1 commit

Add type annotations for Rembert/Splinter and copies (#16338) · ec3aace0

Jacob Dineen authored Mar 22, 2022



* undo black autoformat

* minor fix to rembert forward with default

* make fix-copies, make quality

* Adding types to template model

* Removing List from the template types

* Remove `Optional` from a couple of types that don't accept `None`
Co-authored-by: matt <rocketknight1@gmail.com>

ec3aace0

21 Mar, 2022 1 commit

added type hints for BART model (#16270) · d50f62f2

Robot Jelly authored Mar 21, 2022



* added type hints for BART model

* make fixup, adding imports to copied files

* Adding some missing types to cookiecutter

* Adding some missing types to cookiecutter

* Adding some missing types to cookiecutter
Co-authored-by: matt <rocketknight1@gmail.com>

d50f62f2

16 Mar, 2022 1 commit
- Replace all deprecated `jax.ops` operations with jnp's `at` (#16078) · ee27b3d7
  Sanchit Gandhi authored Mar 16, 2022
```
* Replace all deprecated `jax.ops` operations with jnp's `at`

* np to jnp scores

* suggested changes
```
  ee27b3d7
08 Mar, 2022 1 commit

TF generate refactor - past without encoder outputs (#15944) · 70203b59

Joao Gante authored Mar 08, 2022

* Remove packed past from generation_tf_utils

* update models with the new past format

* update template accordingly

70203b59

04 Mar, 2022 1 commit

Do not change the output from tuple to list - to match PT's version (#15918) · f0aacc14

Yih-Dar authored Mar 04, 2022



* Do not change the output from tuple to list - to match PT's version

* Fix the same issues for 5 other models and the template
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

f0aacc14

25 Feb, 2022 1 commit

Fix tf.concatenate + test past_key_values for TF models (#15774) · 8635407b

Yih-Dar authored Feb 25, 2022



* fix wrong method name tf.concatenate

* add tests related to causal LM / decoder

* make style and quality

* clean-up

* Fix TFBertModel's extended_attention_mask when past_key_values is provided

* Fix tests

* fix copies

* More tf.int8 -> tf.int32 in TF test template

* clean-up

* Update TF test template

* revert the previous commit + update the TF test template

* Fix TF template extended_attention_mask when past_key_values is provided

* Fix some styles manually

* clean-up

* Fix ValueError: too many values to unpack in the test

* Fix more: too many values to unpack in the test

* Add a comment for extended_attention_mask when there is past_key_values

* Fix TFElectra extended_attention_mask when past_key_values is provided

* Add tests to other TF models

* Fix for TF Electra test: add prepare_config_and_inputs_for_decoder

* Fix not passing training arg to lm_head in TFRobertaForCausalLM

* Fix tests (with past) for TF Roberta

* add testing for pask_key_values for TFElectra model
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

8635407b

23 Feb, 2022 2 commits
- Fix model templates (#15806) · bb7949b3
  Lysandre Debut authored Feb 23, 2022
```
* Fix model templates

* Update paths
```
  bb7949b3
- [Test refactor 1/5] Per-folder tests reorganization (#15725) · 29c10a41
  Lysandre Debut authored Feb 23, 2022
```
* Per-folder tests reorganization
Co-authored-by: sgugger <sylvain.gugger@gmail.com>
Co-authored-by: Stas Bekman <stas@stason.org>
```
  29c10a41
15 Feb, 2022 1 commit

TF generate refactor - Greedy Search (#15562) · 2e12b907

Patrick von Platen authored Feb 15, 2022



* TF generate start refactor

* Add tf tests for sample generate

* re-organize

* boom boom

* Apply suggestions from code review

* re-add

* add all code

* make random greedy pass

* make encoder-decoder random work

* further improvements

* delete bogus file

* make gpt2 and t5 tests work

* finish logits tests

* correct logits processors

* correct past / encoder_outputs drama

* refactor some methods

* another fix

* refactor shape_list

* fix more shape list

* import shape
_list

* finish docs

* fix imports

* make style

* correct tf utils

* Fix TFRag as well

* Apply Lysandre's and Sylvais suggestions

* Update tests/test_generation_tf_logits_process.py
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>

* Update src/transformers/tf_utils.py
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>

* remove cpu according to gante

* correct logit processor
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>

2e12b907

08 Feb, 2022 1 commit

Force use_cache to be False in PyTorch (#15385) · 6a5472a8

Yih-Dar authored Feb 08, 2022



* use_cache = False for PT models if labels is passed

* Fix for BigBirdPegasusForConditionalGeneration

* add warning if users specify use_cache=True

* Use logger.warning instead of warnings.warn
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

6a5472a8

01 Feb, 2022 2 commits

fix the `tokenizer_config.json` file for the slow tokenizer when a fast... · 7b8bdd86

SaulLu authored Feb 01, 2022

fix the `tokenizer_config.json` file for the slow tokenizer when a fast version is available (#15319)

* add new test

* update test

* remove `tokenizer_file` from `additional_files_names` in `tokenization_utils_base.py`

* add `tokenizer_file` for the fast only tokenizer

* change global variables layoutxml

* remove `"tokenizer_file"` from DPR tokenizer's Global variables

* remove `tokenizer_file` from herbert slow tokenizer init

* `"tokenizer_file"` from LED tokenizer's Global variables

* remove `tokenizer_file` from mbart slow tokenizer init

* remove `tokenizer_file` from slow tokenizer template

* adapt to versioning

* adapt the `test_tokenizer_mismatch_warning` test

* clean test

* clarify `VOCAB_FILES_NAMES` in tokenization_utils_fast.py

* Revert "remove `tokenizer_file` from mbart slow tokenizer init"

This reverts commit 0dbb723fa9c7599d4640fe30b3647a74eb4a64e1.

* Revert "`"tokenizer_file"` from LED tokenizer's Global variables"

This reverts commit 5a3f879bdd651233f3d74a3d1146c34cde82b0c2.

* Revert "remove `tokenizer_file` from herbert slow tokenizer init"

This reverts commit f5e10007b7b0ec5345e015b9de7ffec72c5407fd.

* Revert "remove `"tokenizer_file"` from DPR tokenizer's Global variables"

This reverts commit da0895330bedfafc81ae3073470a9348c669f032.

* set `tokenizer_file` in super `__init__` of mbart

7b8bdd86

Fix TF Causal LM models' returned logits (#15256) · dc05dd53

Yih-Dar authored Feb 01, 2022



* Fix TF Causal LM models' returned logits

* Fix expected shape in the tests
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

dc05dd53

31 Jan, 2022 2 commits

Fix loss calculation in TFXXXForTokenClassification models (#15294) · 554d333e

Yih-Dar authored Jan 31, 2022



* Fix loss calculation in TFFunnelForTokenClassification

* revert the change in TFFunnelForTokenClassification

* fix FunnelForTokenClassification loss

* fix other TokenClassification loss

* fix more

* fix more

* add num_labels to ElectraForTokenClassification

* revert the change to research projects
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

554d333e

Add doc for add-new-model-like command (#15433) · 7fc6f41d
Sylvain Gugger authored Jan 31, 2022

7fc6f41d

24 Jan, 2022 1 commit

[Fix doc example] fix missing import jnp (#15291) · c15bb3fe

Yih-Dar authored Jan 24, 2022



* fix missing import jnp

* Fix missing jax and k=1
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

c15bb3fe

21 Jan, 2022 1 commit

Adds missing module_specs for usages of _LazyModule (#15230) · c962c2ad

Jonas Kuball authored Jan 21, 2022

* Add missing __spec__ for transformers.models.auto

* Moves the __spec__-test to the UnitTest class

* Adds module_spec to all instances of _LazyModule

* Refactors an old test from pytest to unittest

c962c2ad

19 Jan, 2022 1 commit

Rename compute_loss in TF models (#15207) · 2708bfa1

Matt authored Jan 19, 2022

* Rename compute_loss to hf_compute_loss to avoid conflicts with the new Keras method

* make style

* Adding deprecation warning to `compute_loss`

* Fix sneaky reference to compute_loss

* Replace logger.warning with warnings.warn

* Clarifying warning and deprecation timeline

2708bfa1

14 Jan, 2022 1 commit

Check the repo consistency in model templates test (#15141) · 5f3c57fc

Sylvain Gugger authored Jan 14, 2022

* Check the repo consistency in model templates test

* Fix doc template

* Fix docstrings

* Fix last docstring

5f3c57fc

11 Jan, 2022 2 commits
- Fix typo in doc template · 1a00863e
  Sylvain Gugger authored Jan 11, 2022
  
  1a00863e
- Fix cookiecutter (#15100) · 6ea62666
  NielsRogge authored Jan 11, 2022
  
  6ea62666
10 Jan, 2022 2 commits
- [DOC] fix doc examples for bart-like models (#15093) · 3e9fdcf0
  Suraj Patil authored Jan 10, 2022
```
* fix doc examples

* remove double colons
```
  3e9fdcf0
- Happy New Year! (#15094) · 61d18ae0
  Sylvain Gugger authored Jan 10, 2022
  
  61d18ae0