Commits · 02214cb3cc3e967c2fde79b39856b2ea2ddc2d31 · chenpangpang / transformers

05 Apr, 2022 1 commit

add a template to add missing tokenization test (#16553) · 02214cb3

SaulLu authored Apr 05, 2022



* add a template to add missing tokenization test

* add cookiecutter setting

* improve doc

* Update templates/adding_a_missing_tokenization_test/README.md
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

02214cb3

04 Apr, 2022 1 commit
- TF: Finalize `unpack_inputs`-related changes (#16499) · dad5ca83
  Joao Gante authored Apr 04, 2022
```
* Add unpack_inputs to remaining models

* removed kwargs to `call()` in TF models

* fix TF T5 tests
```
  dad5ca83
01 Apr, 2022 1 commit

Use random_attention_mask for TF tests (#16517) · 2199382d

Yih-Dar authored Apr 01, 2022



* use random_attention_mask for TF tests

* Fix for TFCLIP test (for now).
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

2199382d

30 Mar, 2022 1 commit

TF: unpack inputs on Convbert, GPTJ, LED, and templates (#16491) · c2f8eaf6

Joao Gante authored Mar 30, 2022

* Add unpack_inputs to remaining models

* remove stray use of inputs in the templates; fix tf.debugging of attn masks

c2f8eaf6

25 Mar, 2022 1 commit
- Big file_utils cleanup (#16396) · 088c1880
  Sylvain Gugger authored Mar 25, 2022
```
* Big file_utils cleanup

* This one still needs to be treated separately
```
  088c1880
23 Mar, 2022 2 commits

Reorganize file utils (#16264) · 4975002d

Sylvain Gugger authored Mar 23, 2022

* Split file_utils in several submodules

* Fixes

* Add back more objects

* More fixes

* Who exactly decided to import that from there?

* Second suggestion to code with code review

* Revert wront move

* Fix imports

* Adapt all imports

* Adapt all imports everywhere

* Revert this import, will fix in a separate commit

4975002d

Updates the default branch from master to main (#16326) · eca77f47

Lysandre Debut authored Mar 23, 2022



* Updates the default branch from master to main

* Links from `master` to `main`

* Typo

* Update examples/flax/README.md
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

eca77f47

22 Mar, 2022 1 commit

Add type annotations for Rembert/Splinter and copies (#16338) · ec3aace0

Jacob Dineen authored Mar 22, 2022



* undo black autoformat

* minor fix to rembert forward with default

* make fix-copies, make quality

* Adding types to template model

* Removing List from the template types

* Remove `Optional` from a couple of types that don't accept `None`
Co-authored-by: matt <rocketknight1@gmail.com>

ec3aace0

21 Mar, 2022 1 commit

added type hints for BART model (#16270) · d50f62f2

Robot Jelly authored Mar 21, 2022



* added type hints for BART model

* make fixup, adding imports to copied files

* Adding some missing types to cookiecutter

* Adding some missing types to cookiecutter

* Adding some missing types to cookiecutter
Co-authored-by: matt <rocketknight1@gmail.com>

d50f62f2

16 Mar, 2022 1 commit
- Replace all deprecated `jax.ops` operations with jnp's `at` (#16078) · ee27b3d7
  Sanchit Gandhi authored Mar 16, 2022
```
* Replace all deprecated `jax.ops` operations with jnp's `at`

* np to jnp scores

* suggested changes
```
  ee27b3d7
08 Mar, 2022 1 commit

TF generate refactor - past without encoder outputs (#15944) · 70203b59

Joao Gante authored Mar 08, 2022

* Remove packed past from generation_tf_utils

* update models with the new past format

* update template accordingly

70203b59

04 Mar, 2022 1 commit

Do not change the output from tuple to list - to match PT's version (#15918) · f0aacc14

Yih-Dar authored Mar 04, 2022



* Do not change the output from tuple to list - to match PT's version

* Fix the same issues for 5 other models and the template
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

f0aacc14

25 Feb, 2022 1 commit

Fix tf.concatenate + test past_key_values for TF models (#15774) · 8635407b

Yih-Dar authored Feb 25, 2022



* fix wrong method name tf.concatenate

* add tests related to causal LM / decoder

* make style and quality

* clean-up

* Fix TFBertModel's extended_attention_mask when past_key_values is provided

* Fix tests

* fix copies

* More tf.int8 -> tf.int32 in TF test template

* clean-up

* Update TF test template

* revert the previous commit + update the TF test template

* Fix TF template extended_attention_mask when past_key_values is provided

* Fix some styles manually

* clean-up

* Fix ValueError: too many values to unpack in the test

* Fix more: too many values to unpack in the test

* Add a comment for extended_attention_mask when there is past_key_values

* Fix TFElectra extended_attention_mask when past_key_values is provided

* Add tests to other TF models

* Fix for TF Electra test: add prepare_config_and_inputs_for_decoder

* Fix not passing training arg to lm_head in TFRobertaForCausalLM

* Fix tests (with past) for TF Roberta

* add testing for pask_key_values for TFElectra model
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

8635407b

23 Feb, 2022 2 commits
- Fix model templates (#15806) · bb7949b3
  Lysandre Debut authored Feb 23, 2022
```
* Fix model templates

* Update paths
```
  bb7949b3
- [Test refactor 1/5] Per-folder tests reorganization (#15725) · 29c10a41
  Lysandre Debut authored Feb 23, 2022
```
* Per-folder tests reorganization
Co-authored-by: sgugger <sylvain.gugger@gmail.com>
Co-authored-by: Stas Bekman <stas@stason.org>
```
  29c10a41
15 Feb, 2022 1 commit

TF generate refactor - Greedy Search (#15562) · 2e12b907

Patrick von Platen authored Feb 15, 2022



* TF generate start refactor

* Add tf tests for sample generate

* re-organize

* boom boom

* Apply suggestions from code review

* re-add

* add all code

* make random greedy pass

* make encoder-decoder random work

* further improvements

* delete bogus file

* make gpt2 and t5 tests work

* finish logits tests

* correct logits processors

* correct past / encoder_outputs drama

* refactor some methods

* another fix

* refactor shape_list

* fix more shape list

* import shape
_list

* finish docs

* fix imports

* make style

* correct tf utils

* Fix TFRag as well

* Apply Lysandre's and Sylvais suggestions

* Update tests/test_generation_tf_logits_process.py
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>

* Update src/transformers/tf_utils.py
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>

* remove cpu according to gante

* correct logit processor
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>

2e12b907

08 Feb, 2022 1 commit

Force use_cache to be False in PyTorch (#15385) · 6a5472a8

Yih-Dar authored Feb 08, 2022



* use_cache = False for PT models if labels is passed

* Fix for BigBirdPegasusForConditionalGeneration

* add warning if users specify use_cache=True

* Use logger.warning instead of warnings.warn
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

6a5472a8

01 Feb, 2022 2 commits

fix the `tokenizer_config.json` file for the slow tokenizer when a fast... · 7b8bdd86

SaulLu authored Feb 01, 2022

fix the `tokenizer_config.json` file for the slow tokenizer when a fast version is available (#15319)

* add new test

* update test

* remove `tokenizer_file` from `additional_files_names` in `tokenization_utils_base.py`

* add `tokenizer_file` for the fast only tokenizer

* change global variables layoutxml

* remove `"tokenizer_file"` from DPR tokenizer's Global variables

* remove `tokenizer_file` from herbert slow tokenizer init

* `"tokenizer_file"` from LED tokenizer's Global variables

* remove `tokenizer_file` from mbart slow tokenizer init

* remove `tokenizer_file` from slow tokenizer template

* adapt to versioning

* adapt the `test_tokenizer_mismatch_warning` test

* clean test

* clarify `VOCAB_FILES_NAMES` in tokenization_utils_fast.py

* Revert "remove `tokenizer_file` from mbart slow tokenizer init"

This reverts commit 0dbb723fa9c7599d4640fe30b3647a74eb4a64e1.

* Revert "`"tokenizer_file"` from LED tokenizer's Global variables"

This reverts commit 5a3f879bdd651233f3d74a3d1146c34cde82b0c2.

* Revert "remove `tokenizer_file` from herbert slow tokenizer init"

This reverts commit f5e10007b7b0ec5345e015b9de7ffec72c5407fd.

* Revert "remove `"tokenizer_file"` from DPR tokenizer's Global variables"

This reverts commit da0895330bedfafc81ae3073470a9348c669f032.

* set `tokenizer_file` in super `__init__` of mbart

7b8bdd86

Fix TF Causal LM models' returned logits (#15256) · dc05dd53

Yih-Dar authored Feb 01, 2022



* Fix TF Causal LM models' returned logits

* Fix expected shape in the tests
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

dc05dd53

31 Jan, 2022 2 commits

Fix loss calculation in TFXXXForTokenClassification models (#15294) · 554d333e

Yih-Dar authored Jan 31, 2022



* Fix loss calculation in TFFunnelForTokenClassification

* revert the change in TFFunnelForTokenClassification

* fix FunnelForTokenClassification loss

* fix other TokenClassification loss

* fix more

* fix more

* add num_labels to ElectraForTokenClassification

* revert the change to research projects
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

554d333e

Add doc for add-new-model-like command (#15433) · 7fc6f41d
Sylvain Gugger authored Jan 31, 2022

7fc6f41d

24 Jan, 2022 1 commit

[Fix doc example] fix missing import jnp (#15291) · c15bb3fe

Yih-Dar authored Jan 24, 2022



* fix missing import jnp

* Fix missing jax and k=1
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

c15bb3fe

21 Jan, 2022 1 commit

Adds missing module_specs for usages of _LazyModule (#15230) · c962c2ad

Jonas Kuball authored Jan 21, 2022

* Add missing __spec__ for transformers.models.auto

* Moves the __spec__-test to the UnitTest class

* Adds module_spec to all instances of _LazyModule

* Refactors an old test from pytest to unittest

c962c2ad

19 Jan, 2022 1 commit

Rename compute_loss in TF models (#15207) · 2708bfa1

Matt authored Jan 19, 2022

* Rename compute_loss to hf_compute_loss to avoid conflicts with the new Keras method

* make style

* Adding deprecation warning to `compute_loss`

* Fix sneaky reference to compute_loss

* Replace logger.warning with warnings.warn

* Clarifying warning and deprecation timeline

2708bfa1

14 Jan, 2022 1 commit

Check the repo consistency in model templates test (#15141) · 5f3c57fc

Sylvain Gugger authored Jan 14, 2022

* Check the repo consistency in model templates test

* Fix doc template

* Fix docstrings

* Fix last docstring

5f3c57fc

11 Jan, 2022 2 commits
- Fix typo in doc template · 1a00863e
  Sylvain Gugger authored Jan 11, 2022
  
  1a00863e
- Fix cookiecutter (#15100) · 6ea62666
  NielsRogge authored Jan 11, 2022
  
  6ea62666
10 Jan, 2022 2 commits
- [DOC] fix doc examples for bart-like models (#15093) · 3e9fdcf0
  Suraj Patil authored Jan 10, 2022
```
* fix doc examples

* remove double colons
```
  3e9fdcf0
- Happy New Year! (#15094) · 61d18ae0
  Sylvain Gugger authored Jan 10, 2022
  
  61d18ae0
22 Dec, 2021 1 commit

Convert rst files (#14888) · 207594be

Sylvain Gugger authored Dec 22, 2021

* Convert all tutorials and guides

* Convert all remaining rst to mdx

* Track and fix bad links

207594be

21 Dec, 2021 2 commits

Mass conversion of documentation from rst to Markdown (#14866) · 27b3031d

Sylvain Gugger authored Dec 21, 2021

* Convert docstrings of all configurations and tokenizers

* Processors and fixes

* Last modeling files and fixes to models

* Pipeline modules

* Utils files

* Data submodule

* All the other files

* Style

* Missing examples

* Style again

* Fix copies

* Say bye bye to rst docstrings forever

27b3031d

Convert docstrings of modeling files (#14850) · 7af80f66

Sylvain Gugger authored Dec 21, 2021

* Convert file_utils docstrings to Markdown

* Test on BERT

* Return block indent

* Temporarily disable doc styler

* Remove from quality checks as well

* Remove doc styler mess

* Remove check from circleCI

* Fix typo

* Convert file_utils docstrings to Markdown

* Test on BERT

* Return block indent

* Temporarily disable doc styler

* Remove from quality checks as well

* Remove doc styler mess

* Remove check from circleCI

* Fix typo

* Let's go on all other model files

* Add templates too

* Styling and quality

7af80f66

17 Dec, 2021 1 commit

Implement head_mask for Flax BERT and other models copied from BERT (#14620) · ff066119

Daniel Stancl authored Dec 17, 2021

* Implement head_mask for Flax BERT and other models copied from BERT

* Remove `from jax._src.nn.functions import sigmoid`

Remove `from jax._src.nn.functions import sigmoid` unintentionally added by IDE

* Remove no more valid copy statement

* Apply patil-suraj's suggestions from code review

* Apply suggestions from the code review

* Update Flax template

* Fix a typo

* Also update template for CausalLM modules

ff066119

16 Dec, 2021 1 commit
- Removes images to put them in a dataset (#14781) · 8010fda9
  Lysandre Debut authored Dec 16, 2021
```
* First try

* Update instructions
```
  8010fda9
13 Dec, 2021 2 commits
- Avoid using tf.tile in embeddings for TF models (#14735) · 15a9d015
  Yih-Dar authored Dec 13, 2021
```
* avoid tf.tile in embeddings

* remove more tf.tile in embeddings

* clean
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  15a9d015
- Fix doc examples: modify config before super().__init__ (#14697) · 32eb29fe
  Yih-Dar authored Dec 13, 2021
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  32eb29fe
10 Dec, 2021 1 commit

Fix examples: 'CausalLMOutputWithCrossAttentions' object has no attribute... · 59d684fa

Yih-Dar authored Dec 10, 2021


Fix examples: 'CausalLMOutputWithCrossAttentions' object has no attribute 'last_hidden_state' (#14678)
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

59d684fa

30 Nov, 2021 1 commit

use functional interface for softmax in attention (#14198) · 6ed9882d

Thomas Viehmann authored Nov 30, 2021

* use functional interface instead of instantiating module and immediately calling it

* fix torch.nn.functional to nn.functional. Thank you Stas!

6ed9882d

18 Nov, 2021 1 commit

Add a post init method to all models (#14431) · d83b0e0c

Sylvain Gugger authored Nov 18, 2021

* Add a post init method to all models

* Fix tests

* Fix last tests

* Fix templates

* Add comment

* Forgot to save

d83b0e0c

11 Nov, 2021 1 commit

Fix Flax params dtype (#13098) · e92190c0

Suraj Patil authored Nov 11, 2021



* fix inits

* fix embed dtype

* fix embed dtype

* add test to check default dtype

* quality

* add type conversion methods for flax models

* more robust casting

* cast sinusoidal positions

* update pegasus

* update albert

* update test

* make sure dtype is passed to every module

* style

* fix electra dense

* fix t5

* quality

* add more tests

* better name

* use the dtype for lm head computation

* fix albert

* style

* fix albert embed dtype

* more tests

* fix vision enc-dec

* cleanup

* fix embed dtype pegasus

* fix default param test

* doc

* update template

* fix final_logits_bias dtype

* Apply suggestions from code review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* fix doc

* fix doc

* add detailed docstring for dtype parameter

* remove un-necessary import
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

e92190c0