Commits · 8f46ac98498dd47701971064617e00d7e723a98e · chenpangpang / transformers

25 May, 2022 12 commits

Spanish translation of the files sagemaker.mdx and image_classification.mdx (#17262) · 8f46ac98

Juanjo do Olmo authored May 26, 2022



* Duplication of the source eng file

* Spanish translation of the file multilingual.mdx

* Update docs/source_es/multilingual.mdx
Co-authored-by: Omar U. Espejel <espejelomar@gmail.com>

* Update docs/source_es/multilingual.mdx
Co-authored-by: Omar U. Espejel <espejelomar@gmail.com>

* Update docs/source_es/multilingual.mdx
Co-authored-by: Omar U. Espejel <espejelomar@gmail.com>

* Update docs/source_es/multilingual.mdx
Co-authored-by: Omar U. Espejel <espejelomar@gmail.com>

* Update docs/source_es/multilingual.mdx
Co-authored-by: Omar U. Espejel <espejelomar@gmail.com>

* Update docs/source_es/multilingual.mdx
Co-authored-by: Omar U. Espejel <espejelomar@gmail.com>

* Update docs/source_es/multilingual.mdx
Co-authored-by: Omar U. Espejel <espejelomar@gmail.com>

* Fix nits and finish translation

* Spanish translation of sagemaker.mdx

* Was deleted in main

* Security saving

* Complete translation of image_classification.mdx

* Nits

* nits

* Update docs/source/es/image_classification.mdx

* Add files to _toctree.yml

* Fix toctree and add tasks folder
Co-authored-by: Omar U. Espejel <espejelomar@gmail.com>

8f46ac98

Added es version of bertology.mdx doc (#17255) · 5e7f085f

Joaq authored May 25, 2022



* added bertology es doc

* toctree fix

* Update docs/source/es/bertology.mdx
Co-authored-by: Omar U. Espejel <espejelomar@gmail.com>

* Update docs/source/es/bertology.mdx
Co-authored-by: Omar U. Espejel <espejelomar@gmail.com>

* Update docs/source/es/bertology.mdx
Co-authored-by: Omar U. Espejel <espejelomar@gmail.com>

* change position of bertology in _toctree.yml
Co-authored-by: Omar U. Espejel <espejelomar@gmail.com>

5e7f085f

Adding the Portuguese version of the tasks/sequence_classification.mdx documentation (#17352) · 70484a8d
Jonatas Grosman authored May 25, 2022
```
* add sequence_classification pt doc structure

* add Portuguese tasks/sequence_classification.mdx
```
70484a8d

Wav2vec2 finetuning shared file system (#17423) · a9eca743

Patrick von Platen authored May 25, 2022



* fix_torch_device_generate_test

* remove @

* [Fix shared file system]
Co-authored-by: Patrick von Platen <patrick@huggingface.co>

a9eca743

fix link in performance docs (#17419) · 740a1574
Leandro von Werra authored May 25, 2022

740a1574
Add link to Hub PR docs in model cards (#17421) · 284fc6c0
lewtun authored May 25, 2022

284fc6c0
Upd AutoTokenizer.from_pretrained doc examples (#17416) · 35e2d13f
Cookie_thief authored May 25, 2022

35e2d13f

Support compilation via Torchdynamo, AOT Autograd, NVFuser (#17308) · 897a8dd8

Animesh Jain authored May 25, 2022



* Support compilation via Torchdynamo, AOT Autograd, NVFuser

* Address comments

* Lint

* Stas comments - missing quality test

* Lintere

* Quality test

* Doc lint

* Reset CUDA peak mem

* Add CustomTrainer

* require a single gpu
Co-authored-by: Stas Bekman <stas@stason.org>

897a8dd8

Add test for new model parallelism features (#17401) · 31484afb
Sylvain Gugger authored May 25, 2022

31484afb
Make check_init script more robust and clean inits (#17408) · 56b35ce3
Sylvain Gugger authored May 25, 2022

56b35ce3
Fix README localizer script (#17407) · bd908e9b
Sylvain Gugger authored May 25, 2022

bd908e9b
Fix expected value for OPT test `test_inference_no_head` (#17395) · 4d727bd2
Yih-Dar authored May 25, 2022
```
* Fix expected value

* 5e-5
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
4d727bd2

24 May, 2022 5 commits

Bump tensorflow in /examples/research_projects/decision_transformer (#17400) · 1ef9a1ed

dependabot[bot] authored May 24, 2022

Bumps [tensorflow](https://github.com/tensorflow/tensorflow) from 2.8.0 to 2.8.1.
- [Release notes](https://github.com/tensorflow/tensorflow/releases)
- [Changelog](https://github.com/tensorflow/tensorflow/blob/master/RELEASE.md)
- [Commits](https://github.com/tensorflow/tensorflow/compare/v2.8.0...v2.8.1

)

---
updated-dependencies:
- dependency-name: tensorflow
  dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

1ef9a1ed

[WIP] Adding GPT-NeoX-20B (#16659) · 71e60272

Jason Phang authored May 24, 2022



* initial

* first try

* working 20B

* 20B tokenizers

* Docs

* Import fixes for missing classes

* Update docs, fixup

* black formatting

* isort

* flake

* dummy objects

* documentation

* Documentation yml

* more docs

* tweaks for tests

* tokenization auto

* fix neox tests

* test

* test

* einsum

* address PR feedback

* Documentation

* Update README.md
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/models/gpt_neox/__init__.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/models/gpt_neox/configuration_gpt_neox.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Remove undefined LaTeX syntax

* Update to full url to avoid confusion about if that's supposed to refer to the Hub

* fix auto

* move tests

* documentation fix

* more doc fixes

* test refactor

* fix import

* fix import

* fix import

* fix import

* fix import

* style fixes

* More modeling fixes
Co-authored-by: Jason Phang <zp489@gr057.hpc.nyu.edu>
Co-authored-by: Stella Biderman <stellabiderman@gmail.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

71e60272

Clean up CLIP tests (#17380) · 374a2f69
NielsRogge authored May 24, 2022
```
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
```
374a2f69

Enabling `imageGPT` auto feature extractor. (#16871) · d9809298

Nicolas Patry authored May 24, 2022



* Enablign `imageGPT` auto feature extractor.
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

* Small updates.

* Update after rebase to use `input_ids` instead of `pixel_values`.
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

d9809298

Add LayoutLMv3 (#17060) · 31ee80d5

NielsRogge authored May 24, 2022



* Make forward pass work

* More improvements

* Remove unused imports

* Remove timm dependency

* Improve loss calculation of token classifier

* Fix most tests

* Add docs

* Add model integration test

* Make all tests pass

* Add LayoutLMv3FeatureExtractor

* Improve integration test + make fixup

* Add example script

* Fix style

* Add LayoutLMv3Processor

* Fix style

* Add option to add visual labels

* Make more tokenizer tests pass

* Fix more tests

* Make more tests pass

* Fix bug and improve docs

* Fix import of processors

* Improve docstrings

* Fix toctree and improve docs

* Fix auto tokenizer

* Move tests to model folder

* Move tests to model folder

* change default behavior add_prefix_space

* add prefix space for fast

* add_prefix_spcae set to True for Fast

* no space before `unique_no_split` token

* add test to hightligh special treatment of added tokens

* fix `test_batch_encode_dynamic_overflowing` by building a long enough example

* fix `test_full_tokenizer` with add_prefix_token

* Fix tokenizer integration test

* Make the code more readable

* Add tests for LayoutLMv3Processor

* Fix style

* Add model to README and update init

* Apply suggestions from code review

* Replace asserts by value errors

* Add suggestion by @ducviet00

* Add model to doc tests

* Simplify script

* Improve README

* a step ahead to fix

* Update pair_input_test

* Make all tokenizer tests pass - phew

* Make style

* Add LayoutLMv3 to CI job

* Fix auto mapping

* Fix CI job name

* Make all processor tests pass

* Make tests of LayoutLMv2 and LayoutXLM consistent

* Add copied from statements to fast tokenizer

* Add copied from statements to slow tokenizer

* Remove add_visual_labels attribute

* Fix tests

* Add link to notebooks

* Improve docs of LayoutLMv3Processor

* Fix reference to section
Co-authored-by: SaulLu <lucilesaul.com@gmail.com>
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>

31ee80d5

23 May, 2022 8 commits

Add support for `device_map="auto"` to OPT (#17382) · 13541b4a
Sylvain Gugger authored May 23, 2022

13541b4a
OPTForCausalLM lm_head input size should be config.word_embed_proj_dim (#17225) · 71cced8a
vfbd authored May 23, 2022

71cced8a

Use Accelerate in `from_pretrained` for big model inference (#17341) · 56f50590

Sylvain Gugger authored May 23, 2022



* Initial work

* More or less finished with first draft

* Update src/transformers/modeling_utils.py
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* Update src/transformers/modeling_utils.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Fix randomly initialized weights

* Update src/transformers/modeling_utils.py
Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr>

* Address review comments

* Rename DeepSpeed folder to temporarily fix the test issue?

* Revert to try if Accelerate fix works

* Use latest Accelerate release

* Quality and fixes

* Style

* Quality

* Add doc

* Test + fix

* More blocks
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr>

56f50590

Traced models serialization and torchscripting fix (#17206) · 2e7e4280

Michael Benayoun authored May 23, 2022

* Fix torch.jit.script and pickling issues

* Fix get_attr issues

* Fix import in function

* Fix GPT-J and T5 tracing for torch=1.11

* Gate graph surgery on torch version

* Modeling minor changes to enable TorchScripting

* Model serialization / deserialization test

* Remove _assert_is_none users

2e7e4280

Fix Comet ML integration (#17381) · 1cd01b0a

Maximilian Schmidt authored May 23, 2022

Callback function `on_train_end` crashed if Comet ML integration was
used but `COMET_MODE` set to `DISABLE`

1cd01b0a

Fix cvt docstrings (#17367) · c86aad61
Anugunj Naman authored May 23, 2022

c86aad61

Correct & Improve Doctests for LayoutLMv2 (#17168) · 7b8cb269

ghlai9665 authored May 23, 2022



* add inference example to LayoutLMv2ForQuestionAnswering, passing doctest

* add loss example to LayoutLMv2ForQuestionAnswering, passing doctest

* Add correct doctest for LayoutLMv2ForTokenClassification, passing doctest

* add correct doctest for LayoutLMv2ForSequenceClassification, passing test

* add correct doctest for LayoutLMv2Model, passing test

* make fixup

* fix to address review comments

* make style

* fix doctest line break issue, add to documentaiton_tests.txt, address review comments

* move comment about layoutlmv2 dependencies to the doc page

* format doc page as suggested
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* delete extraneous backtick
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

7b8cb269

Fix CodeParrot training script (#17291) · b48ac1a0

Loubna Ben Allal authored May 23, 2022



* average loss over batches and accumulated steps for tracking

* fix layernorm weight decay

* use AdamW from Pytorch instead of Transformers

* add shuffling of sequences inside the batches

* add shuffling of sequences inside the batches

* add logging dir and reformat code

* fix lr tracking

* remove Mistral scaling

* keep Mistral scaling

* reformat code

* fix error

* fix error

* use shuffling function from Pytorch

* remove argument for shuffling batch sequences as it isn't optional

* update package versions and install accelerate from source

* remove unused package

* Update loss average over accumulated steps
Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com>

* Update loss average over accumulated steps
Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com>

* use one shuffle buffer argument

* compute avg_loss in one line
Co-authored-by: Loubna ben allal <loubnabenallal@gmail.com>
Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com>

b48ac1a0

20 May, 2022 2 commits
- Fix a typo relative_postion_if_large -> relative_position_if_large (#17366) · b9bb4173
  Daniel Stancl authored May 20, 2022
  
  b9bb4173
- Pin dill to fix examples (#17368) · 3fd7de49
  Sylvain Gugger authored May 20, 2022
```
* Pin dill for now

* Try this version?

* force install

* Actually use dep in testing

* Try a larger pin
```
  3fd7de49
19 May, 2022 7 commits

[Test OPT] Add batch generation test opt (#17359) · 54192058
Patrick von Platen authored May 19, 2022
```
* up

* up
```
54192058
Fix bug in Wav2Vec2 pretrain example (#17326) · 48c22691
ddobokki authored May 20, 2022

48c22691
fix for 17292 (#17293) · 5d6feecf
Nathan Dahlberg authored May 19, 2022

5d6feecf
[Generation] Fix Transition probs (#17311) · 518bd02c
Patrick von Platen authored May 19, 2022
```
* [Draft] fix transition probs

* up

* up

* up

* make it work

* fix

* finish

* update
```
518bd02c

[OPT] Run test in lower precision on GPU (#17353) · e8714c03

Patrick von Platen authored May 19, 2022

* [OPT] Run test only in half precision

* up

* up

* up

* up

* finish

* fix on GPU

* Update tests/models/opt/test_modeling_opt.py

e8714c03

Adding `batch_size` test to QA pipeline. (#17330) · 2b282296
Nicolas Patry authored May 19, 2022

2b282296

[BC] Fixing usage of text pairs (#17324) · a4386d7e

Nicolas Patry authored May 19, 2022



* [BC] Fixing usage of text pairs

The BC is actually preventing users from misusing the pipeline since
users could have been willing to send text pairs and the pipeline would
instead understand the thing as a batch returning bogus results.

The correct usage of text pairs is preserved in this PR even when that
makes the code clunky.

Adds support for {"text":..,, "text_pair": ...} inputs for both dataset
iteration and more explicit usage to pairs.

* Updating the doc.

* Update src/transformers/pipelines/text_classification.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/pipelines/text_classification.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update tests/pipelines/test_pipelines_text_classification.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* quality.
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

a4386d7e

18 May, 2022 6 commits

[tests] fix copy-n-paste error (#17312) · 3601aa8f
Stas Bekman authored May 18, 2022
```
* [tests] fix copy-n-paste error

* fix
```
3601aa8f

Fix ci_url might be None (#17332) · 1b20c970

Yih-Dar authored May 18, 2022



* fix

* Update utils/notification_service.py
Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr>
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr>

1b20c970

fix (#17337) · 6aad3872

Yih-Dar authored May 18, 2022


Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

6aad3872

Fix metric calculation in examples and setup tests to run on multi-gpu for... · 1762ded3

Zachary Mueller authored May 18, 2022

Fix metric calculation in examples and setup tests to run on multi-gpu for no_trainer scripts (#17331)

* Fix length in no_trainer examples

* Add setup and teardown

* Use new accelerator config generator to automatically make tests able to run based on environment

1762ded3

docs for typical decoding (#17186) · 6e195eb9
Jader Martins authored May 18, 2022
```
Co-authored-by: Jader Martins <jadermcs94@gmail.com>
```
6e195eb9

Not send successful report (#17329) · 060fe61d

Yih-Dar authored May 18, 2022



* send report only if there is any failure
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

060fe61d