Commits · 473709fc761fcf0a1e3d321d12ed2b1e9548e86d · chenpangpang / transformers

28 Mar, 2022 8 commits

Use doc builder styler (#16412) · 473709fc

Sylvain Gugger authored Mar 28, 2022

* Config update

* Use doc-builder styler

* Cleanup

* Adapt import

* We need it there too!

473709fc

Update run_t5_mlm_flax.py (#16421) · 8049dfa4
Yongrae Jo authored Mar 28, 2022
```
Fix typo in comment: proprocessed -> preprocessed
```
8049dfa4
[Flax] Improve Robustness of Back-Prop Tests (#16418) · 925fc57b
Sanchit Gandhi authored Mar 28, 2022
```
* [Flax] Improve Robustness of Back-Prop Tests

* check equality of logits/outputs

* make fixup
```
925fc57b
QDQBert example update (#16395) · 7ecbb9c5
Shang Zhang authored Mar 28, 2022
```
* update Dockerfile and utils_qa

* Update README.md
```
7ecbb9c5
`cached_download ∘ hf_hub_url` is `hf_hub_download` (#16375) · f6f6866e
Julien Chaumond authored Mar 28, 2022

f6f6866e

Fix broken links (#16113) · c88ff66c

Kurian Benoy authored Mar 28, 2022



* Update marian.mdx

* Update marian.mdx

* Update docs/source/model_doc/marian.mdx
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Update marian.mdx
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

c88ff66c

Update comments in class BatchEncoding (#15932) · 342ff6eb
Jia authored Mar 28, 2022

342ff6eb

remove references to PDF reading via PIL (#15293) · e02f95b2

Nathan Glenn authored Mar 28, 2022

* fix confusing PIL instructions

As stated in the documentation
[here](https://pillow.readthedocs.io/en/stable/handbook/image-file-formats.html?highlight=pdf#write-only-formats

),
PIL can only write PDF's, not read them. Remove references to reading
PDF's via PIL from this page to avoid confusion.

* mention PDF in doc examples using PIL
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Be explicit: PDFs must be converted to images

* fix formatting
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

e02f95b2

27 Mar, 2022 1 commit
- TF: removed inputs_processing and replaced with decorator in lxmert (#16414) · 3dc82427
  Shamima authored Mar 27, 2022
  
  3dc82427
25 Mar, 2022 11 commits

Create concept guide section (#16369) · b320d87e

Steven Liu authored Mar 25, 2022

* ✨ create concept guide section

* 🖍 make fixup

* 🖍

 apply feedback
Co-authored-by: Steven <stevhliu@gmail.com>

b320d87e

Add TF implementation of GPT-J (#15623) · ed2ee373

Daniel Stancl authored Mar 25, 2022

* Initial commit

* Add TFGPTJModel

* Fix a forward pass

* Add TFGPTJCausalLM

* Add TFGPTJForSequenceClassification

* Add TFGPTJForQuestionAnswering

* Fix docs

* Deal with TF dynamic shapes

* Add Loss parents to models

* Adjust split and merge heads to handle 4 and 5-dim tensors

* Update outputs for @tooslow tests

ed2ee373

Fix Typo in Argument of FlaxWav2Vec2ForPreTrainingModule (#16084) · aa4c0a86
Sanchit Gandhi authored Mar 25, 2022

aa4c0a86
[FlaxSpeechEncoderDecoder] Fix feature extractor gradient test (#16407) · e231c729
Sanchit Gandhi authored Mar 25, 2022

e231c729

Add ONNX support for Blenderbot and BlenderbotSmall (#15875) · a97f3150

lewtun authored Mar 25, 2022

* Add ONNX support for Blenderbot

* Add BlenderbotSmall ONNX configuration

* Update serialization table

a97f3150

Checkpoint sharding (#16343) · b473617d

Sylvain Gugger authored Mar 25, 2022



* Sharded checkpoint support

* Handle distant sharded checkpoints

* Add tests

* TODO is done

* Apply suggestions from code review
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* Fix docstring

* Add example and format

* Address review comments

* More review comments

* End of merge

* Revert unintentional change

* VsCode what did you do?

* Style

* Changes

* Address final comments

* Quality

* Moar tests

* Move import beneath is_pt_available
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

b473617d

Terminate previous pushes when we get to the final push (#16409) · 7fa7408b
Matt authored Mar 25, 2022

7fa7408b
Rename master to main for notebooks links and leftovers (#16397) · 867f3950
Sylvain Gugger authored Mar 25, 2022

867f3950
fixed typo from enable to disable in disable_progress_bar function (#16406) · 7e749047
Atharva Ingle authored Mar 25, 2022

7e749047
Big file_utils cleanup (#16396) · 088c1880
Sylvain Gugger authored Mar 25, 2022
```
* Big file_utils cleanup

* This one still needs to be treated separately
```
088c1880
Make FeaturesManager.get_model_from_feature a static method (#16357) · 2b23e080
Michael Benayoun authored Mar 25, 2022

2b23e080

24 Mar, 2022 13 commits

Rename to SemanticSegmenterOutput (#15849) · aa6cfe9c
NielsRogge authored Mar 24, 2022
```
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
```
aa6cfe9c

Added type hints (#16389) · 70a9bc69

Yi Heng Lim authored Mar 25, 2022

* Added type hints for PyTorch T5 model

* removed a type hint

* ran make style

* added type hints for ibert pytorch

* added type hints for lxmert pytorch

* removed kwargs type hint and fixed arguments order

70a9bc69

Adapt import to new structure · cae394c8
Sylvain Gugger authored Mar 24, 2022

cae394c8

TF - variable naming for Distilbert model (unpack_inputs decorator) (#16384) · 4e0f583e

Robot Jelly authored Mar 24, 2022



* variable naming for Distilbert model

* adding unpack inputs at top

* make style/quality
Co-authored-by: matt <rocketknight1@gmail.com>

4e0f583e

Fix readme links and add CI check (#16392) · 3a0f1684

Sylvain Gugger authored Mar 24, 2022

* Fix doc links in README

* Fix name

* Fix links in READMEs and doc index

* Error if there is something wrong so the CI knows

3a0f1684

Fix style (#16391) · 8cbd9b8f
Lysandre Debut authored Mar 24, 2022

8cbd9b8f
bump cookiecutter version (#16387) · 9d88be57
Yih-Dar authored Mar 24, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
9d88be57

Update PT Flax equivalence tests in PT test file (#16280) · f571dc20

Yih-Dar authored Mar 24, 2022



* update PT/Flax equivalence tests on PT side

* overwrite check_outputs in BigBirdModelTest
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

f571dc20

Add type hints for ConvBert model (#16377) · 41bfc1e2

Zehua Li authored Mar 24, 2022



* Add missing type hints for ConvBERT flavored models.

* Update src/transformers/models/convbert/modeling_convbert.py
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>

41bfc1e2

Type hints and decorator for TF T5 (#16376) · 23a75a53

Dahlbomii authored Mar 24, 2022



* Type hints and TF decorator added

* Re-add XLA generation method

* Re-add lines that were deleted by conflicting updates

* Re-add lines that were deleted by conflicting updates

* Re-add lines that were deleted by conflicting updates
Co-authored-by: matt <rocketknight1@gmail.com>

23a75a53

Fix BigBirdModelTester (#16310) · 2a27c800

Yih-Dar authored Mar 24, 2022



* fix

* update the expected value in test_fast_integration
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

2a27c800

Update readme with how to train offline and fix BPE command (#15897) · f5e8c9bd

Nathan Cooper authored Mar 24, 2022



* Update readme with how to train offline and fix BPE command

* Update examples/research_projects/codeparrot/README.md
Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com>

* Update examples/research_projects/codeparrot/README.md
Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com>

* Update examples/research_projects/codeparrot/README.md
Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com>

* Update examples/research_projects/codeparrot/README.md
Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com>
Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com>

f5e8c9bd

[Doctests] Make TFRoberta-like meaningfull (#16370) · 9badcecf

Yih-Dar authored Mar 24, 2022



* update doc examples for TFRoberta

* fix style

* fix style

* use TF ckpt

* apply suggestion

* add the code file to test here

* fix style
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

9badcecf

23 Mar, 2022 7 commits

[Doctests] Make roberta-like meaningfull (#16363) · 77c5a805

Patrick von Platen authored Mar 24, 2022

* [Doctests] Make roberta-like meaningfull

* correct

* final correct

* Trigger test

* make style

* apply suggestion from sylvain

77c5a805

Make BigBird model compatiable to fp16 dtype. (#16034) · 5f0d07b3

Xu Zhao authored Mar 23, 2022

* Make BigBird model compatiable to fp16 dtype.

* Use tree_map instead of map

* Reformat the code

* Fix import order

* Convert masks to the correct dtype

* Fix format issue

* Address comments.

5f0d07b3

Update docs/README.md (#16333) · 1cf28da6

Yih-Dar authored Mar 23, 2022



* Update docs/README.md
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

1cf28da6

add GPT-J ONNX config to Transformers (#16274) · 029b0d95

Thomas Chaigneau authored Mar 23, 2022



* add GPT-J ONNX config to Transformers

* remove token-classification features mapping
Co-authored-by: lewtun <lewis.c.tunstall@gmail.com>

* add question-answering features mapping
Co-authored-by: lewtun <lewis.c.tunstall@gmail.com>

* add GPT2 config init to GPT2 config + copie shebang for fix-copies
Co-authored-by: ChainYo <t.chaigneau.tc@gmail.com>
Co-authored-by: lewtun <lewis.c.tunstall@gmail.com>

029b0d95

Decision transformer gym (#15845) · aff9bc40

Edward Beeching authored Mar 23, 2022



* Created the Decision Transformer Modle

* updating tests, copy to other machine

* Added last hidden size to Decision Transformer modelling outputs

* Removed copy of original DT file

* made a temporary change to gpt2 to have it conform with the Decision Transformer version

* Updated tests

* Ignoring a file used to test the DT model

* added comments to config file

* added comments and argument descriptions to decision transformer file

* Updated doc

* Ran "make style"

* Remove old model imports

* Removed unused imports, cleaned up init file

* Update docs/source/model_doc/decision_transformer.mdx

added my username
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Reverted changes made to gpt2

* Removed datasets submodule

* Update the modeling outputs to include gpt2 attentions, hidden states and last hidden states

* Added support for return of hidden states, attentions and return dict of gpt2 model.

* Updated tests to include many of the ModelTesterMixin tests. 

The following tests are skipped: test_generate_without_input_ids, test_pruning, test_resize_embeddings, test_head_masking, test_attention_outputs, test_hidden_states_output, test_inputs_embeds, test_model_common_attributes

* Added missing line to the end of gpt2 file

* Added an integration test for the Decision Transformer

Test performs and autoregressive evaluation for two time steps

* Set done and info to _ to fix failing test

* Updated integration test to be deterministic and check expected outputs

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Removed unnecessary config options

* Cleaned up commented code and old comments.

* Cleaned up commented code.

* Changed DecisionTransformer to Decision Transformer

* Added Decision Transformer to the main README file

* Added copy of GTP2 called DecisionTranformerGPT2Model

* isorted imports

* isorted imports

* Added model to non-English README files

* Ran make fix-copies and corrected some cases.

* Updated index file to include Decision Transformer

* Added gpt2 model as copy inside the Decision Transformer model file

* Added the unit test file to the list of TEST_FILES_WITH_NO_COMMON_TESTS

* Deleted redundant checkpoint files (I don't know how these got committed)

* Removed testing files. (These should have never been committed)

* Removed accidentally committed files

* Moved the Decision Transformer test to its own directory

* Add type hints for Pegasus (#16324)

* Funnel type hints (#16323)

* add pt funnel type hints

* add tf funnel type hints

* Add type hints for ProphetNet PyTorch (#16272)

* [GLPN] Improve docs (#16331)

* Add link to notebook

* Add link

* Fix bug
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>

* Added type hints for Pytorch Marian calls (#16200)

* Added type hinting for forward functions in pytorch marian

* typo correction

* Removed type hints on functions from BART per Suraj Patil request

* fix import pb

* fix typo

* corrected tuple call

* ran black

* after fix-copies
Some optional tags on primitives were removed, past_key_values in MarianForCausalLM changed from Tuple of Tuple to List

* Fixing copies to roformer and pegasus
Co-authored-by: Clementine Fourrier <cfourrie@inria.fr>
Co-authored-by: matt <rocketknight1@gmail.com>

* Moved DecisionTransformOutput to modeling_decision_transformer

* Moved the example usage to research project and cleaned comments

* Made tests ignore the copy of gpt2 in Decision Transformer

* Added module output to modelling decision transformer

* removed copied gpt2 model from list of transformers models

* Updated tests and created __init__ file for new test location

* Update README.md
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/models/decision_transformer/configuration_decision_transformer.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Removed unneeded summary type from config file

* Fixed copies

* Updated pretrained config map to refer to hopper-medium checkpoint

* done (#16340)

* Added Decision transformer to model docs

* Update src/transformers/models/decision_transformer/modeling_decision_transformer.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/models/decision_transformer/modeling_decision_transformer.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/models/decision_transformer/configuration_decision_transformer.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Add type annotations for Rembert/Splinter and copies (#16338)

* undo black autoformat

* minor fix to rembert forward with default

* make fix-copies, make quality

* Adding types to template model

* Removing List from the template types

* Remove `Optional` from a couple of types that don't accept `None`
Co-authored-by: matt <rocketknight1@gmail.com>

* [Bug template] Shift responsibilities for long-range (#16344)

* Fix code repetition in serialization guide (#16346)

* Adopt framework-specific blocks for content (#16342)

* ✨ refactor code samples with framework-specific blocks

* ✨ update training.mdx

* 🖍

 apply feedback

* Updates the default branch from master to main (#16326)

* Updates the default branch from master to main

* Links from `master` to `main`

* Typo

* Update examples/flax/README.md
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Updated model with custom docstring example

* Created the Decision Transformer Modle

* updating tests, copy to other machine

* Added last hidden size to Decision Transformer modelling outputs

* Removed copy of original DT file

* made a temporary change to gpt2 to have it conform with the Decision Transformer version

* Updated tests

* Ignoring a file used to test the DT model

* added comments to config file

* added comments and argument descriptions to decision transformer file

* Updated doc

* Ran "make style"

* Remove old model imports

* Removed unused imports, cleaned up init file

* Update docs/source/model_doc/decision_transformer.mdx

added my username
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Reverted changes made to gpt2

* Removed datasets submodule

* Update the modeling outputs to include gpt2 attentions, hidden states and last hidden states

* Added support for return of hidden states, attentions and return dict of gpt2 model.

* Updated tests to include many of the ModelTesterMixin tests. 

The following tests are skipped: test_generate_without_input_ids, test_pruning, test_resize_embeddings, test_head_masking, test_attention_outputs, test_hidden_states_output, test_inputs_embeds, test_model_common_attributes

* Added missing line to the end of gpt2 file

* Added an integration test for the Decision Transformer

Test performs and autoregressive evaluation for two time steps

* Set done and info to _ to fix failing test

* Updated integration test to be deterministic and check expected outputs

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Removed unnecessary config options

* Cleaned up commented code and old comments.

* Cleaned up commented code.

* Changed DecisionTransformer to Decision Transformer

* Added Decision Transformer to the main README file

* Added copy of GTP2 called DecisionTranformerGPT2Model

* isorted imports

* isorted imports

* Added model to non-English README files

* Ran make fix-copies and corrected some cases.

* Updated index file to include Decision Transformer

* Added gpt2 model as copy inside the Decision Transformer model file

* Added the unit test file to the list of TEST_FILES_WITH_NO_COMMON_TESTS

* Deleted redundant checkpoint files (I don't know how these got committed)

* Removed testing files. (These should have never been committed)

* Removed accidentally committed files

* Moved the Decision Transformer test to its own directory

* Moved DecisionTransformOutput to modeling_decision_transformer

* Moved the example usage to research project and cleaned comments

* Made tests ignore the copy of gpt2 in Decision Transformer

* Added module output to modelling decision transformer

* removed copied gpt2 model from list of transformers models

* Updated tests and created __init__ file for new test location

* Update README.md
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/models/decision_transformer/configuration_decision_transformer.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Removed unneeded summary type from config file

* Fixed copies

* Updated pretrained config map to refer to hopper-medium checkpoint

* Added Decision transformer to model docs

* Update src/transformers/models/decision_transformer/modeling_decision_transformer.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/models/decision_transformer/modeling_decision_transformer.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/models/decision_transformer/configuration_decision_transformer.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Updated model with custom docstring example

* Updated copies, config auto, and readme files.
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Dan Tegzes <48134725+Tegzes@users.noreply.github.com>
Co-authored-by: Adam Montgomerie <adam@avanssion.com>
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
Co-authored-by: Clémentine Fourrier <22726840+clefourrier@users.noreply.github.com>
Co-authored-by: Clementine Fourrier <cfourrie@inria.fr>
Co-authored-by: matt <rocketknight1@gmail.com>
Co-authored-by: Francesco Saverio Zuppichini <francesco.zuppichini@gmail.com>
Co-authored-by: Jacob Dineen <54680234+jacobdineen@users.noreply.github.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Omar Sanseviero <osanseviero@gmail.com>
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr>

aff9bc40

Make Transformers use cache files when hf.co is down (#16362) · c595b6e6

Sylvain Gugger authored Mar 23, 2022

* Make Transformers use cache files when hf.co is down

* Fix tests

* Was there a random circleCI failure?

* Isolate patches

* Style

* Comment out the failure since it doesn't fail anymore

* Better comment

c595b6e6

Swap inequalities (#16368) · 8a69e023

OllieBroadhurst authored Mar 23, 2022



* Swap inequalities

* Update src/transformers/trainer_callback.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/trainer_callback.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

8a69e023