Commits · b0d539ccad090c8949c1740a9758b4152fad5f72 · chenpangpang / transformers

10 Feb, 2023 7 commits

Add X-MOD (#20939) · b0d539cc

Jannis Vamvas authored Feb 10, 2023



* Add X-MOD to Readme

* Add documentation for X-MOD

* Implement X-MOD

* Fix formatting of X-MOD docs

* Change signature of X-MOD forward methods to use lang_ids

* Minor changes

* Rebase with main and run make fix-copies

* Make suggested changes to docstrings

* Improve code readability
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

* Fix code style

* Conversion script: Remove asserts and type annotations

* Remove _TOKENIZER_FOR_DOC

* XMOD -> Xmod

* Update copyright note

* Fix doctests

* Fix docstring

* Add integration test for FillMaskPipeline

* Revert "Add integration test for FillMaskPipeline"

This reverts commit 4381eb3b1d0f5d85785f89caba83928e6efa6d1f.

* Add end-to-end integration test for mask fill

* make style

* Rebase with main and make fix-copies

---------
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

b0d539cc

GeneZC authored Feb 10, 2023

* Fix stuff related to the causal_mask in CodeGen.

1. Line 613, `_keys_to_ignore_on_load_missing  =  [r"h\.\d+\.attn\.masked_bias", r"h\.\d+\.attn\.bias"]` => `_keys_to_ignore_on_load_missing  =  [r"h\.\d+\.attn\.causal_mask"]` to load correctly from CodeGen checkpoint without `causal_mask`.
2. Line 152, `causal_mask = self.causal_mask[:, :, key_length - query_length : key_length, :key_length]
` => `causal_mask = self.causal_mask[:, :, key_length - query_length : key_length, :key_length].bool()
` to alleviate potential user warning saying like `UserWarning: where received a uint8 condition tensor. This behavior is deprecated and will be removed in a future version of PyTorch. Use a boolean condition instead.`.

* Revert the .bool()

Revert the .bool() and leave it to the future PR.

adb2503e

Remove CLI spams with Whisper FeatureExtractor (#21267) · 5b72b341

Quentin Meeus authored Feb 10, 2023

* Remove CLI spams with Whisper FeatureExtractor

Whisper feature extractor representation includes the MEL filters, a list of list that is represented as ~16,000 lines. This needlessly spams the command line. I added a `__repr__` method that replaces this list with a string "<array of shape (80, 201)>"

* Remove mel_filters from to_dict output  

Credits to @ArthurZucker

* remove unused import

* update feature extraction tests for the changes in to_dict

5b72b341

adding a tip for deepspeed integration in multi-node environment (#21459) · 129011c2

Eugene Zapolsky authored Feb 10, 2023



* adding note concerning use_node_local_storage

* overriding checkpoint.use_node_local_storage if save_on_each_node == True

* add more content

* add more content

* improve

* style

---------
Co-authored-by: Stas Bekman <stas@stason.org>

129011c2

Added with torch.no_grad() to Camembert integration test (#21544) · 21a2d900
Katie Le authored Feb 10, 2023
```
add with torch.no_grad() to Camembert integration test
Co-authored-by: Bibi <Bibi@katies-mac.local>
```
21a2d900

[`pipeline`] A simple fix for half-precision & 8bit models (#21479) · f8394268

Younes Belkada authored Feb 10, 2023



* v1 fix

* adapt from suggestions

* make style

* fix tests

* add gpu tests

* update docs

* fix other tests

* Apply suggestions from code review
Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com>

* better fix

* make fixup

* better example

* revert changes

* proposal

* more elegant solution

* Update src/transformers/pipelines/automatic_speech_recognition.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

---------
Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

f8394268

Skip failing test for now · 97d3390f
Sylvain Gugger authored Feb 09, 2023

97d3390f

09 Feb, 2023 12 commits

Added with torch.no_grad() to XLM-Roberta integration test (#21547) · 23c146c3

Katie Le authored Feb 09, 2023



* added with torch.no_grad() to the integration tests and applied make style

* added with torch.no_grad() to xlm roberta forward pass

---------
Co-authored-by: Bibi <Bibi@katies-mac.local>

23c146c3

🚨

Enforce single model initialization (#21431) · 04b2f13c

Sylvain Gugger authored Feb 09, 2023

* Enforce single model initialization

* Add OneFormer example for problem 3

* Do it the Stas way

* Actually rename the uses...

* Rewrite test

* Try to change the test this way

* Fix all init slow/fast tests

* Break connection

* Fix more tests

* Fix test for initialization

* Remove custom test

* Quality

* Fix last failing tests

* The end?

04b2f13c

Fix from_pretrained API with config and state_dict (#21542) · 2020ac4b
Sylvain Gugger authored Feb 09, 2023

2020ac4b
Fix inclusion of non py files in package (#21546) · 1efe9c0b
Sylvain Gugger authored Feb 09, 2023
```
* Fix inclusion of non py files in package

* No need for the **
```
1efe9c0b
Align BLIP-2 winit with others · 7927732f
Sylvain Gugger authored Feb 09, 2023

7927732f

Add BLIP-2 (#21441) · d7f1e7c0

NielsRogge authored Feb 09, 2023



* First draft

* More improvements

* More improvements

* Improve conversion script

* Convert all weights

* Make forward pass work

* Make logits match

* More improvements

* More improvements

* More improvements

* Use get_input_embeddings

* Improve some more

* Improve model tests

* Improve model tests

* More improvements

* Fix processor

* Update files

* Update prepare_inputs_for_generation

* More improvements

* Fix copies

* More fixes

* Make fixup

* More improvements

* Add support for seq2seq language model

* More improvements

* Fix test

* More improvements

* Improve conversion script

* Remove some todo's

* Fix README's

* Improve conversion script

* Fix generation

* Fix style and remove Blip2Model

* Fix model outputs

* More improvements

* Set eos_token_id in config

* Fix quality

* Small improvements

* Add processor tests

* More improvements

* Apply suggestions

* Apply suggestions

* Add integration test

* Update image URL

* Add integration test

* Fix model_type

* Update style

* Improve docs

* Add doc tests

* Fix copies

* Remove tests which are passing

* Improve some more

* Add tests for seq2seq language models

* Minor fix

* Convert more checkpoints

* finalize CI

* Fix blip and blip2 processors

* add `accelerate` support for `blip2`

* clean up

* make style

* Update conversion script

* Update conversion script some more

* Update organization

* revert toc file

* add blip-2 to toc file

* Some more improvements

* Fix docstring

* Improve docs

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
Co-authored-by: younesbelkada <younesbelkada@gmail.com>

d7f1e7c0

fix typo in run_speech_recognition_ctc.py (#21528) · b31cee67

lee1jun authored Feb 09, 2023

Update run_speech_recognition_ctc.py

There should be `# limitations under the License` line at the end of the documentation section.

b31cee67

Tag tests as slow ⌛ (#21537) · 0d33381f
Joao Gante authored Feb 09, 2023
```
begone slow tests
```
0d33381f
Fix ClearML Integration to run in ClearML pipelines and external Tasks. (#21531) · 3a726777
Victor Sonck authored Feb 09, 2023
```
* Added clearml pipeline fix for when task is already initialized

* Correctly initialize
```
3a726777
Fix missing unfinished_sequences (#21529) · 17109ecf
Motoki Wu authored Feb 09, 2023
```
fix missing unfinished_sequences
```
17109ecf
Generate: TF `.generate()` can now be exported with dynamic length (#21474) · 2edf9a85
Joao Gante authored Feb 09, 2023

2edf9a85
Generate: make TF `.generate()` signature == PT `.generate()` signature (#21525) · e69f9715
Joao Gante authored Feb 09, 2023

e69f9715

08 Feb, 2023 11 commits

Add `__len__` method to `_LazyAutoMapping` (#21522) · c35bb6de

Yih-Dar authored Feb 08, 2023



Add `__len__` method to `_LazyAutoMapping`
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

c35bb6de

Fix multiple `eos_token_id`s in model.generate(...) (#21461) · 9960506c

Motoki Wu authored Feb 08, 2023

* add tests with multiple eos_token_ids

* make math.prod instead of sum

* make fixup

* fix long and also use np.prod since math.prod does not exist <python 3.8

* make fixup

* add prod util

* use prod util instead of np.prod

* make fixup

* previous .long location

* use tensor ops

* remove prod

* remove prod

* update device

* make fixup

* fix none

9960506c

Fixing backward compatiblity `image_processor` in pipeline. (#21513) · 06d940ef
Nicolas Patry authored Feb 08, 2023

06d940ef
[tests] add missing `report_to none` (#21505) · 8ea994d3
Stas Bekman authored Feb 08, 2023
```
[tests] report_to none
```
8ea994d3
Update OPT conversion script to work for OPT-IML (#21519) · 98d5b727
Thomas Wang authored Feb 08, 2023

98d5b727
no more dummies for speech processors (#21517) · fe616f35
Matthijs Hollemans authored Feb 08, 2023

fe616f35
Generate: TF `compute_transition_scores` (#21341) · 1d9c26a4
Joao Gante authored Feb 08, 2023

1d9c26a4
[Doc] Minor URL fixes in PyTorch Text Classification Readme (#21511) · d3046dad
Stefan Schweter authored Feb 08, 2023
```
docs: fix some references in PyTorch text classification readme
```
d3046dad

Bump cryptography from 36.0.2 to 39.0.1 in... · e024cd71

dependabot[bot] authored Feb 08, 2023

Bump cryptography from 36.0.2 to 39.0.1 in /examples/research_projects/decision_transformer (#21507)

Bump cryptography in /examples/research_projects/decision_transformer

Bumps [cryptography](https://github.com/pyca/cryptography) from 36.0.2 to 39.0.1.
- [Release notes](https://github.com/pyca/cryptography/releases)
- [Changelog](https://github.com/pyca/cryptography/blob/main/CHANGELOG.rst)
- [Commits](https://github.com/pyca/cryptography/compare/36.0.2...39.0.1

)

---
updated-dependencies:
- dependency-name: cryptography
  dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

e024cd71

Exclude the madeup words from M2M100Tokenizer.vocab_size (#20976) · ca905ba2
Guillaume Klein authored Feb 08, 2023

ca905ba2

Wrap RemBert integration test forward passes with torch.no_grad() (#21503) · cc1d0685

Katie Le authored Feb 08, 2023



added with torch.no_grad() to the integration tests and applied make style
Co-authored-by: Bibi <Bibi@katies-mac.local>

cc1d0685

07 Feb, 2023 10 commits

Fix import in Accelerate for find_exec_bs (#21501) · 5b67ab99
Sylvain Gugger authored Feb 07, 2023

5b67ab99
Check for mapping/dict in distributed_concat function (#21500) · eb1771ef
Prajwal Kailas authored Feb 07, 2023
```
check for mapping/dict in distributed_concat function

Co-authored-by: prajwal967 <user.email>
```
eb1771ef

Add XLM-V to Model Doc (#21498) · 7e51a441

Stefan Schweter authored Feb 07, 2023

* doc: introduce new section for XLM-V model

* doc: mention more details for XLM-V integration

* docs: paper abstract in italics, model identifier for base model added

* doc: mention new XLM-V support

* auto: add XLM-V mapping

* doc: run make fix-copies ;)

7e51a441

Add inverse sqrt learning rate scheduler (#21495) · a3034c70

Adrian Sager La Ganga authored Feb 07, 2023

* added inverse sqrt lr scheduler

* Updated get_scheduler in src/transformers/optimization.py

* Updated src/transformers/__init__.py

* Added inverse sqrt lr scheduler test

* Updated docs/source/en/main_classes/optimizer_schedules.mdx

* Ran style and quality scripts

* Fix get_inverse_sqrt_schedule docstring

* Comment implementation URL

a3034c70

[tokenizer] sanitize saved config (#21483) · b9af152e
Stas Bekman authored Feb 07, 2023
```
* [tokenizer] sanitize saved config

* rm config["name_or_path"] test
```
b9af152e

Cleanup quality (#21493) · 67d07487

Sylvain Gugger authored Feb 07, 2023

* Remove mentions of flake8/isort

* Clean up inits

* Deall with all other inits

* Last special rule for dummy files

67d07487

Add limit_all_gathers option to fsdp_config and fix forward_prefetch bug (#21489) · 571fa585

raghavanone authored Feb 07, 2023

* Add limit_all_gathers option to fsdp_config and fix forward_prefetch bug

* Fix black issue

* Fix ruff failure

* Incorporate PR feedbacks

* Incorporate PR feedbacks

* Incorporate PR feedbacks

571fa585

A new test to check config attributes being used (#21453) · 479322bf

Yih-Dar authored Feb 07, 2023



* Add a new test to check config attributes being used

* Add a new test to check config attributes being used

* Add a new test to check config attributes being used

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Apply suggestions

* Update allowed cases - part 1

* Update allowed cases - part 2

* final

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

479322bf

[OPT] Adds `GPT2TokenizerFast` to the list of tokenizer to use for OPT. (#20823) · 9e7f84a5

Arthur authored Feb 07, 2023

* Add ("opt", ("GPT2Tokenizer", "GPT2TokenizerFast" if is_tokenizers_available() else None)),

* skip failing test

* Add ("opt", ("GPT2Tokenizer", "GPT2TokenizerFast" if is_tokenizers_available() else None)),

* skip failing test

9e7f84a5

Sanity check the type of id2label and label2id arguments of from_pretrained... · 8a303f52

raghavanone authored Feb 07, 2023

Sanity check the type of id2label and label2id arguments of from_pretrained for TokenClassification models (#21490)

* Sanity check the type of id2label and label2id arguments of from_pretrained for TokenClassification models

* Incorporate PR feedbacks

* Incorporate PR feedbacks

8a303f52