Commits · cdf19c501de82aefb922e3718d3def753a3ba4bb · chenpangpang / transformers

"vscode:/vscode.git/clone" did not exist on "01b8cd59324565a713a736fe77bc2bd9d60494cb"

15 Feb, 2022 6 commits
- Re-export `KeyDataset`. (#15645) · cdf19c50
  Nicolas Patry authored Feb 15, 2022
```
* Re-export `KeyDataset`.

* Update the docs locations.
```
  cdf19c50
- add a network debug script and document it (#15652) · 28e6155d
  Stas Bekman authored Feb 15, 2022
```
* add a network debug script and document it

* doc
```
  28e6155d
- Add section about doc testing (#15659) · f45ac11f
  Patrick von Platen authored Feb 15, 2022
```
* Add doctesting section

* Improve

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
```
  f45ac11f
- Fix typo in speech2text2 doc (#15617) · 86a7845c
  jonrbates authored Feb 15, 2022
```
Forward looks for inputs, not input_ids
```
  86a7845c
- Revert "logger doc" · 05a85809
  fra authored Feb 15, 2022
```
This reverts commit 41168a49.
```
  05a85809
- logger doc · 41168a49
  fra authored Feb 15, 2022
  
  41168a49
14 Feb, 2022 1 commit

Make Swin work with VisionEncoderDecoderModel (#15527) · b090b790

NielsRogge authored Feb 14, 2022



* Add attribute_map

* Add mention in docs

* Set hidden_size attribute correctly

* Add note about Transformer-based models only
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MBP.localdomain>

b090b790

11 Feb, 2022 4 commits
- Fix grammar in tokenizer_summary (#15614) · 4f403ea8
  Daniel Erenrich authored Feb 11, 2022
```
"to make ensure" is redundant.
```
  4f403ea8
- [deepspeed docs] misc additions (#15585) · f15c99fa
  Stas Bekman authored Feb 11, 2022
```
* [deepspeed docs] round_robin_gradients

* training and/or eval/predict loss is

* Update docs/source/main_classes/deepspeed.mdx
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
```
  f15c99fa
- 🖍 remove broken link (#15615) · 85aee09e
  Steven Liu authored Feb 11, 2022
  
  85aee09e
- Mark "code in the Hub" API as experimental (#15624) · 6cf06d19
  Sylvain Gugger authored Feb 11, 2022
  
  6cf06d19
10 Feb, 2022 3 commits

Correct JSON format (#15600) · c0864d98
Ngo Quang Huy authored Feb 11, 2022

c0864d98
Add local and TensorFlow ONNX export examples to docs (#15604) · 2e8b85f7
lewtun authored Feb 10, 2022
```
* Add local and TensorFlow ONNX export examples to docs

* Use PyTorch - TensorFlow split
```
2e8b85f7

Add Tensorflow handling of ONNX conversion (#13831) · cb7ed6e0

Alberto Bégué authored Feb 10, 2022



* Add TensorFlow support for ONNX export

* Change documentation to mention conversion with Tensorflow

* Refactor export into export_pytorch and export_tensorflow

* Check model's type instead of framework installation to choose between TF and Pytorch
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Alberto Bégué <alberto.begue@della.ai>
Co-authored-by: lewtun <lewis.c.tunstall@gmail.com>

cb7ed6e0

09 Feb, 2022 6 commits

Expand tutorial for custom models (#15587) · c722753a

Sylvain Gugger authored Feb 09, 2022



* Expand tutorial for custom models

* Style

* Apply suggestions from code review
Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr>
Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr>

c722753a

Add link (#15588) · a86ee226

NielsRogge authored Feb 09, 2022


Co-authored-by: Niels Rogge <nielsrogge@Nielss-MBP.localdomain>

a86ee226

[trainer docs] document how to select specific gpus (#15551) · dee17d56
Stas Bekman authored Feb 09, 2022
```
* [trainer docs] document how to select specific gpus

* expand

* add urls

* add accelerate launcher
```
dee17d56

Constrained Beam Search [without disjunctive decoding] (#15416) · 2b5603f6

Chan Woo Kim authored Feb 10, 2022



* added classes to get started with constrained beam search

* in progress, think i can directly force tokens now but not yet with the round robin

* think now i have total control, now need to code the bank selection

* technically works as desired, need to optimize and fix design choices leading to undersirable outputs

* complete PR #1 without disjunctive decoding

* removed incorrect tests

* Delete k.txt

* Delete test.py

* Delete test.sh

* revert changes to test scripts

* genutils

* full implementation with testing, no disjunctive yet

* shifted docs

* passing all tests realistically ran locally

* removing accidentally included print statements

* fixed source of error in initial PR test

* fixing the get_device() vs device trap

* fixed documentation docstrings about constrained_beam_search

* fixed tests having failing for Speech2TextModel's floating point inputs

* fix cuda long tensor

* added examples and testing for them and founx & fixed a bug in beam_search and constrained_beam_search

* deleted accidentally added test halting code with assert False

* code reformat

* Update tests/test_generation_utils.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update tests/test_generation_utils.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update tests/test_generation_utils.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update tests/test_generation_utils.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update tests/test_generation_utils.py

* fixing based on comments on PR

* took out the testing code that should but work fails without the beam search moditification ; style changes

* fixing comments issues

* docstrings for ConstraintListState

* typo in PhrsalConstraint docstring

* docstrings improvements
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

2b5603f6

add model scaling section (#15119) · d923f762

Leandro von Werra authored Feb 09, 2022



* add model scaling section

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* integrate reviewer feedback

* initialize GPU properly

* add note about BnB optimizer

* move doc from `scaling.mdx` to `performance.mdx`

* integrate reviewer feedback

* revert section levels
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

d923f762

PoC for a ProcessorMixin class (#15549) · b5c6fdec

Sylvain Gugger authored Feb 09, 2022



* PoC for a ProcessorMixin class

* Documentation

* Apply suggestions from code review
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
Co-authored-by: Suraj Patil <surajp815@gmail.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Roll out to other processors

* Add base feature extractor class in init

* Use args and kwargs
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
Co-authored-by: Suraj Patil <surajp815@gmail.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

b5c6fdec

08 Feb, 2022 3 commits

📝 Add codecarbon callback to docs (#15563) · fcb4f11c
Nathan Raw authored Feb 08, 2022

fcb4f11c

Add TFSpeech2Text (#15113) · 8406fa6d

Joao Gante authored Feb 08, 2022

* Add wrapper classes

* convert inner layers to tf

* Add TF Encoder and Decoder layers

* TFSpeech2Text models

* Loadable model

* TF model with same outputs as PT model

* test skeleton

* correct tests and run the fixup

* correct attention expansion

* TFSpeech2Text pask_key_values with TF format

8406fa6d

electra is added to onnx supported model (#15084) · 87d08afb

aaron authored Feb 08, 2022



* electra is added to onnx supported model

* add google/electra-base-generator for test onnx module
Co-authored-by: Lewis Tunstall <lewis.c.tunstall@gmail.com>

87d08afb

07 Feb, 2022 3 commits

Create a custom model guide (#15489) · 552f8d30

Steven Liu authored Feb 07, 2022

* 📝 add config section

* 📝 finish first draft

* 📝 add feature extractor and processor

* 🖍 apply feedback from review

* 📝 minor edits

* last review

552f8d30

Remove Longformers from ONNX-supported models (#15273) · 6775b211
lewtun authored Feb 07, 2022

6775b211

Add ConvNeXT (#15277) · 84eec9e6

NielsRogge authored Feb 07, 2022



* First draft

* Add conversion script

* Improve conversion script

* Improve docs and implement tests

* Define model output class

* Fix tests

* Fix more tests

* Add model to README

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Apply more suggestions from code review

* Apply suggestions from code review

* Rename dims to hidden_sizes

* Fix equivalence test

* Rename gamma to gamma_parameter

* Clean up conversion script

* Add ConvNextFeatureExtractor

* Add corresponding tests

* Implement feature extractor correctly

* Make implementation cleaner

* Add ConvNextStem class

* Improve design

* Update design to also include encoder

* Fix gamma parameter

* Use sample docstrings

* Finish conversion, add center cropping

* Replace nielsr by facebook, make feature extractor tests smaller

* Fix integration test
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

84eec9e6

04 Feb, 2022 3 commits

[deepspeed docs] DeepSpeed ZeRO Inference (#15486) · 8ce13306

Stas Bekman authored Feb 04, 2022



* [deepspeed docs] DeepSpeed ZeRO Inference

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* tweak

* deal with black

* extra cleanup, better comments
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

8ce13306

Standardize semantic segmentation models outputs (#15469) · ac6aa10f

Sylvain Gugger authored Feb 04, 2022



* Standardize instance segmentation models outputs

* Rename output

* Update src/transformers/modeling_outputs.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Add legacy argument to the config and model forward

* Update src/transformers/models/beit/modeling_beit.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Copy fix in Segformer
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

ac6aa10f

[deepspeed docs] Megatron-Deepspeed info (#15488) · 31be2f45
Stas Bekman authored Feb 04, 2022

31be2f45

03 Feb, 2022 1 commit
- [deepspeed docs] memory requirements (#15506) · 21dcaec5
  Stas Bekman authored Feb 03, 2022
  
  21dcaec5
02 Feb, 2022 3 commits

Save code of registered custom models (#15379) · 44b21f11

Sylvain Gugger authored Feb 02, 2022



* Allow dynamic modules to use relative imports

* Work for configs

* Fix last merge conflict

* Save code of registered custom objects

* Map strings to strings

* Fix test

* Add tokenizer

* Rework tests

* Tests

* Ignore fixtures py files for tests

* Tokenizer test + fix collection

* With full path

* Rework integration

* Fix typo

* Remove changes in conftest

* Test for tokenizers

* Add documentation

* Update docs/source/custom_models.mdx
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Add file structure and file content

* Add more doc

* Style

* Update docs/source/custom_models.mdx
Co-authored-by: Suraj Patil <surajp815@gmail.com>

* Address review comments
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Suraj Patil <surajp815@gmail.com>

44b21f11

Update tutorial docs (#15165) · b9418a1d

Steven Liu authored Feb 01, 2022

* first draft of pipeline, autoclass, preprocess tutorials

* apply review feedback

* 🖍 apply feedback from patrick/niels

* 📝add output image to preprocessed image

* 🖍 apply feedback from patrick

b9418a1d

Update fine-tune docs (#15259) · c157c7e3

Steven Liu authored Feb 01, 2022

* add fine-tune tutorial

* make edits, fix style

* 📝 make edits

* 🖍 fix code format links to external libraries

* 🔄revert code formatting

* 🖍 use DefaultDataCollator instead of DataCollatorWithPadding

c157c7e3

31 Jan, 2022 4 commits
- [deepspeed doc] fix import, extra notes (#15400) · 44c7857b
  Stas Bekman authored Jan 31, 2022
```
* [deepspeed doc] fix import, extra notes

* typo
```
  44c7857b
- Add header (#15434) · 47df0f22
  NielsRogge authored Jan 31, 2022
  
  47df0f22
- add t5 ner finetuning (#15432) · 282ae123
  Ogundepo Odunayo authored Jan 31, 2022
  
  282ae123
- Update README.md (#15430) · 3254080d
  Kamal Raj authored Jan 31, 2022
```
fix typo
```
  3254080d
29 Jan, 2022 3 commits

Add support for XLM-R XL and XXL models by modeling_xlm_roberta_xl.py (#13727) · e09473a8

Soonhwan-Kwon authored Jan 29, 2022



* add xlm roberta xl

* add convert xlm xl fairseq checkpoint to pytorch

* fix init and documents for xlm-roberta-xl

* fix indention

* add test for XLM-R xl,xxl

* fix model hub name

* fix some stuff

* up

* correct init

* fix more

* fix as suggestions

* add torch_device

* fix default values of doc strings

* fix leftovers

* merge to master

* up

* correct hub names

* fix docs

* fix model

* up

* finalize

* last fix

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* add copied from

* make style
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

e09473a8

Get started docs (#15098) · 16d4acbf

Steven Liu authored Jan 28, 2022

* clean commit of changes

* apply review feedback, make edits

* fix backticks, minor formatting

* 🖍 make fixup and minor edits

* 🖍 fix # in header

* 📝 update code sample without from_pt

* 📝 final review

16d4acbf

Update model share tutorial (#15288) · cabd6d26

Steven Liu authored Jan 28, 2022

* add model sharing tutorial

* 🖍 apply feedback from review

* 📝 make edits

* 🖍 fix formatting

* 📝 convert from pt checkpoint to flax

* 📝 final review

cabd6d26