Commits · 05bdef16b611df0946a6a602503f1ace604b6c80 · chenpangpang / transformers

17 Apr, 2024 13 commits

Re-enable SDPA's FA2 path (#30070) · 05bdef16

fxmarty authored Apr 17, 2024



* tentatively re-enable FA2 + SDPA

* better comment

* _ignore_causal_mask_sdpa as staticmethod

* type hints

* use past_seen_tokens instead

* enable copied from for sdpa

* ruff

* llama simplifications on review

* remove unnecessary self.is_causal check

* fix copies

* cleaning

* precise message

* better doc

* add test

* simplify

* Update src/transformers/models/llama/modeling_llama.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Update src/transformers/models/llama/modeling_llama.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Update src/transformers/models/llama/modeling_llama.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* style

---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

05bdef16

Add OLMo model family (#29890) · e4ea19b9

Shane A authored Apr 17, 2024

* Add OLMo using add-new-model-like with Llama

* Fix incorrect tokenizer for OLMo

* Copy-paste relevant OLMo methods and their imports

* Add OLMo config

* Modify OLMo config to follow HF conventions

* Remove unneeded Llama code from OLMo model

* Add ability for OLMo model to output attentions

* Add OLMoPreTrainedModel and OLMoModel

* Add OLMoForCausalLM

* Minor fixes to OLMo model for style and missing functions

* Implement OLMo tokenizer

* Implement OLMo to HF conversion script

* Add tests for OLMo model

* Add tests for OLMo fast tokenizer

* Add auto-generated dummy objects

* Remove unimplemented OLMo classes from auto and init classes and re-format

* Add README and associated auto-generated files

* Use OLMo names for common properties

* Run make fixup

* Remove `|` from OLMo typing

* Remove unneeded tokenization_olmo.py

* Revert model, config and converter to add-new-model-like Llama

* Move logic for adding bos/eos token into GPTNeoxTokenizerFast

* Change OLMoConfig defaults to match OLMo-7B

* Use GPTNeoXToknizerFast in OLMo tokenizer tests

* Modify auto-generated OLMoModelTests to work for OLMo

* Add non-parametric layer norm OLMoLayerNorm

* Update weight conversion script for OLMo

* Fix __init__ and auto structure for OLMo

* Fix errors from make fixup

* Remove OLMoTokenizerFast from documentation

* Add missing 'Copied from' for OLMoModel._update_causal_mask

* Run make fix-copies

* Rearrange string replacements in OLMoForCausalLM Copied from

* Move OLMo and Llama CausalLM.forward example into global constants

* Fix OLMO_GENERATION_EXAMPLE doc string typo

* Add option for qkv clipping to OLMo

* Rearrange OLMoConfig kwargs in convert_olmo_weights_to_hf

* Add clip_qkv to OLMoConfig in convert_olmo_weights_to_hf

* Fix OLMo tokenization bug using conversion script

* Keep model in full precision after conversion

* Do not add eos token automatically

* Update references to OLMo model in HF Hub

* Do not add eos token during encoding by default

* Fix Llama generation example

* Run make fixup

* OLMo 7B integration test fix

* Remove unneeded special case for OLMoConfig

* OLMo 7B Twin 2T integration test fix

* Fix test_model_7b_greedy_generation

* Remove test_compile_static_cache

* Fix OLMo and Llama generation example

* Run make fixup

* Revert "OLMo 7B integration test fix"

This reverts commit 4df56a4b150681bfa559846f40e9b7b7f97d7908.

* Revert "OLMo 7B Twin 2T integration test fix"

This reverts commit 9ff65a4a294ace89ab047b793ca55e623a9ceefc.

* Ungate 7B integration tests and fix greedy generation test

* Add retries for flaky test_eager_matches_sdpa_generate

* Fix output of doc example for OLMoForCausalLM.forward

* Downsize OLMo doc test for OLMoForCausalLM.forward to 1B model

* Try fix incorrect characters in OLMoForCausalLM.forward doct test

* Try fix incorrect characters in OLMoForCausalLM.forward doc test using end quotes

* Remove pretraining_tp from OLMo config and model

* Add missing 'Copied from' instances

* Remove unneeded causal_mask from OLMoModel

* Revert Llama changes

* Ignore copy for OLMoForCausalLM.forward

* Change 'OLMo' to 'Olmo' in classes

* Move minimal OLMo tokenization tests to model tests

* Add missed 'Copied from' for repeat_kv

e4ea19b9

Upgrading to tokenizers 0.19.0 (#30289) · 8e5f76f5

Nicolas Patry authored Apr 17, 2024

* [DO NOT MERGE] Testing tokenizers 0.19.0rc0

* Accounting for the breaking change.

* Ruff.

* Upgrading to tokenizers `0.19` (new release with preprend_scheme fixed
and new surface for BPE tiktoken bug).

8e5f76f5

Add strategy to store results in evaluation loop (#30267) · c15aad09

Pavel Iakubovskii authored Apr 17, 2024

* Add evaluation loop container for interm. results

* Add tests for EvalLoopContainer

* Formatting

* Fix padding_index in test and typo

* Move EvalLoopContainer to pr_utils to avoid additional imports

* Fix `eval_do_concat_batches` arg description

* Fix EvalLoopContainer import

c15aad09

Add token type ids to CodeGenTokenizer (#29265) · 8d6b5096

st81 authored Apr 17, 2024

* Add create token type ids to CodeGenTokenizer

* Fix inconsistent length of token type ids

* Format source codes

* Fix inconsistent order of methods

* Update docstring

* add test_tokenizer_integration test

* Format source codes

* Add `copied from` comment to CodeGenTokenizerFast

* Add doc of create_token_type_ids_from_sequences

* Make return_token_type_ids False by default

* Make test_tokenizer_integration as slow test

* Add return_token_type_ids to tokenizer init arg

* Add test for tokenizer's init return_token_type_ids

* Format source codes

8d6b5096

FIX: Fix push important models CI (#30291) · 812a5de2
Younes Belkada authored Apr 17, 2024
```
Update push-important-models.yml
```
812a5de2
Fix `Fatal Python error: Bus error` in `ZeroShotAudioClassificationPipelineTests` (#30283) · eb75516e
Yih-Dar authored Apr 17, 2024
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
eb75516e
Fix test `ExamplesTests::test_run_translation` (#30281) · 05dab4e5
Yih-Dar authored Apr 17, 2024
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
05dab4e5
Enable fx tracing for Mistral (#30209) · 304c6a1e
Raushan Turganbay authored Apr 17, 2024
```
* tracing for mistral

* typo

* fix copies
```
304c6a1e

Configuring Translation Pipelines documents update #27753 (#29986) · 98717cb3

Utkarsha Gupte authored Apr 17, 2024

* Configuring Translation Pipelines documents update #27753

Configuring Translation Pipelines documents update

* Language Format Addition

* adding supported list of languages list

98717cb3

FIX / AWQ: Fix failing exllama test (#30288) · 080b7008
Younes Belkada authored Apr 17, 2024
```
fix filing exllama test
```
080b7008
Fix SpeechT5 forward docstrings (#30287) · 41145247
Yoach Lacombe authored Apr 17, 2024

41145247

Fix SDPA sliding window compatibility (#30127) · 40eb6d6c

fxmarty authored Apr 17, 2024



* fix sdpa + sliding window

* give credit
Co-authored-by: ehuaa <ehuamail@163.com>

* remove unnecessary warning

* fix typog

* add test

---------
Co-authored-by: ehuaa <ehuamail@163.com>

40eb6d6c

16 Apr, 2024 10 commits

Fix test fetcher (doctest) + `Idefics2`'s doc example (#30274) · 5fabebdb
Yih-Dar authored Apr 16, 2024
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
5fabebdb
fix: Fixed a `raise` statement (#30275) · 37b5946a
Sai-Suraj-27 authored Apr 16, 2024
```
* Fixed a raise statement.

* Minor changes.
```
37b5946a

BLIP - fix pt-tf equivalence test (#30258) · c63f1589

amyeroberts authored Apr 16, 2024

* BLIP - fix pt-tf equivalence test

* Update tests/models/blip/test_modeling_blip.py

* Update more model tests

c63f1589

Raise relevent err when wrong type is passed in as the accelerator_config (#29997) · e27d9308
Zach Mueller authored Apr 16, 2024
```
* Raise relevent err

* Use type instead
```
e27d9308

add `push_to_hub` to pipeline (#29172) · 0eaef0c7

Hafedh authored Apr 16, 2024



* add `push_to_hub` to pipeline

* fix docs

* format with ruff

* update save_pretrained

* update save_pretrained

* remove unnecessary comment

* switch to push_to_hub method in DynamicPipelineTester

* remove unused imports

* update docs for add_new_pipeline

* fix docs for add_new_pipeline

* add comment

* fix italien docs

* changes to token retrieval for pipelines

* Update src/transformers/pipelines/base.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

0eaef0c7

Workflow: Update tailscale to release version (#30268) · 60dea593
Younes Belkada authored Apr 16, 2024
```
Update tailscale to release version
```
60dea593

Allow for str versions of dicts based on typing (#30227) · 487505ff

Zach Mueller authored Apr 16, 2024

* Bookmark, initial impelemtation. Need to test

* Clean

* Working fully, woop woop

* I think working version now, testing

* Fin!

* rm cast, could keep None

* Fix typing issue

* rm typehint

* Add test

* Add tests and make more rigid

487505ff

FIX: Fix 8-bit serialization tests (#30051) · b86d0f4e

Younes Belkada authored Apr 16, 2024



* fix 8-bit serialization tests

* add more clarification

* Update src/transformers/quantizers/quantizer_bnb_8bit.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

b86d0f4e

FIX: Fix corner-case issue with the important models workflow (#30212) · ddf5f258

Younes Belkada authored Apr 16, 2024

* Update push-important-models.yml

* dummy commit

* Update modeling_bark.py

* test

* test

* test

* another test

* another test

* test

* final test

* final test

* test

* another test

* test

* test

* another test

* test llama

* revert everything

* remove echo

ddf5f258

More fixes for doctest (#30265) · cbc2cc18

Yih-Dar authored Apr 16, 2024



* fix

* update

* update

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

cbc2cc18

15 Apr, 2024 15 commits

Update `ko/_toctree.yml` (#30062) · 51bcadc1

Jungnerd authored Apr 16, 2024



* fix: update `ko/_toctree.yml`

* fix: update ko/_toctree.yml

* Update docs/source/ko/_toctree.yml
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* fix: delete `perf_infer_gpu_many`

* fix: Replace untranslated docs with `in_translation`
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* fix: Replace untraslated docs with `in_translation`

---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

51bcadc1

Remove incorrect arg in codellama doctest (#30257) · 5be21302
Matt authored Apr 15, 2024
```
Remove incorrect arg in codellama docstring
```
5be21302
[Docs] Update recurrent_gemma.md for some minor nits (#30238) · 8127f396
Sayak Paul authored Apr 15, 2024
```
Update recurrent_gemma.md
```
8127f396

Add Idefics2 (#30253) · 6b78360e

amyeroberts authored Apr 15, 2024



* Initial add model additions

* Test

* All weights loading

* Can perform full forward pass

* Local and remote the same

* Matching local and remote

* Fixup

* Idefics2Model importable; fixup docstrings

* Don't skip by default

* Remove deprecated use_resampler arg

* Remove self.config

* DecoupledLinear takes config

* Tidy up

* Enable eager attention and tidy up

* Most tests passing

* Update for batch of processed images

* Add image processor

* Update doc pages

* Update conversion script

* Remove erroneous breakpoint

* Remove accidendtal spelling change

* Update to reflect changes on hub - make generate work

* Fix up

* Image processor tests

* Update tests

* Add a processor

* Add a processor

* Update convert script

* Update modeling file - remove fixmes

* Bug fix

* Add processing test

* Use processor

* Fix up

* Update src/transformers/models/idefics2/modeling_idefics2.py
Co-authored-by: Victor SANH <victorsanh@gmail.com>

* Update src/transformers/models/idefics2/modeling_idefics2.py
Co-authored-by: Victor SANH <victorsanh@gmail.com>

* Fix test

* Update config - PR comments and defaults align with checkpoint

* Reviewer comments

* Add copied froms for flahs attention

* Update src/transformers/models/idefics2/modeling_idefics2.py
Co-authored-by: Victor SANH <victorsanh@gmail.com>

* Apply suggestions from code review
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Remove qk_layer_norm and freeze_layers functionality

* Fix

* Remove freeze_layer options from config

* Sync with upstream main

* Fix attention shapes siglip

* Remove Llava-next refs - TO REBASE

* Use AutoModel for text model

* Add comment to explain vision embeddings

* Fix issue with tie_word_embeddings

* Address review comments

* Fix and fix up

* Chat templates for idefics

* Fix copies

* Fix

* Add layer norms to FA2

* Fix tests

* Apply suggestions from code review
Co-authored-by: Victor SANH <victorsanh@gmail.com>

* Fix

* Review comments

* Update src/transformers/models/idefics2/modeling_idefics2.py
Co-authored-by: Victor SANH <victorsanh@gmail.com>

* Update inputs merger

* Merge weights in correct order

* Update convert script

* Update src/transformers/models/idefics2/processing_idefics2.py
Co-authored-by: Victor SANH <victorsanh@gmail.com>

* Update template

* Model code examples (fix idefics too)

* More review comments

* Tidy up

* Update processing

* Fix attention mask preparation

* Update inputs_merger inputs

* Vectorize inputs_merger

* Update src/transformers/models/idefics2/__init__.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Update src/transformers/models/idefics2/modeling_idefics2.py

* Review comments

* saying bye to the `qk_layer_norms`

* Simplify

* Update latents

* Remove erroneuous readme changes

* Return images when applying chat template

* Fix bug - prompt images are for a single sample

* Update src/transformers/models/idefics2/modeling_idefics2.py

* image splitting

* fix test

* some more comment

* some comment

* Apply suggestions from code review
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update src/transformers/models/idefics2/image_processing_idefics2.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update processor

* Update model tests

* Update src/transformers/models/idefics2/processing_idefics2.py
Co-authored-by: Victor SANH <victorsanh@gmail.com>

* Update src/transformers/models/idefics2/processing_idefics2.py
Co-authored-by: Victor SANH <victorsanh@gmail.com>

* Don't add BOS in template

* Update src/transformers/models/idefics2/processing_idefics2.py
Co-authored-by: Victor SANH <victorsanh@gmail.com>

* Remove index in examples

* Update tests to reflect #13

* Update src/transformers/models/idefics2/processing_idefics2.py
Co-authored-by: Victor SANH <victorsanh@gmail.com>

* PR comment - consistent typing

* Update readme and model doc

* Update docs

* Update checkpoint references

* Update examples

* Fix and update tests

* Small addition

* Update tests - remove copied from as no ignore placement copy could be found

* Update example

* small fixes

* Update docs/source/en/model_doc/idefics2.md
Co-authored-by: Victor SANH <victorsanh@gmail.com>

* Update docs/source/en/model_doc/idefics2.md
Co-authored-by: Victor SANH <victorsanh@gmail.com>

* Update README.md
Co-authored-by: Victor SANH <victorsanh@gmail.com>

* Connector model as bridge

* Fix up

* Fix up

* Don't pass model inputs for generation kwargs update

* IDEFICS-2 -> Idefics2

* Remove config archive name

* IDEFICS-2 -> Idefics2

* Add back llava-next

* Update readmes

* Add requirements for processor tester

* Use custom convert_to_rgb to avoid possible BC

* Fix doc example

* Fix doc example

* Skip model doc tests - as model to large

* More doc example - account for image splitting

* Update src/transformers/image_transforms.py

* Fix config doctest

---------
Co-authored-by: Pablo Montalvo <39954772+molbap@users.noreply.github.com>
Co-authored-by: ArthurZucker <arthur.zucker@gmail.com>
Co-authored-by: Victor SANH <victorsanh@gmail.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

6b78360e

[tests] add the missing `require_torch_multi_gpu` flag (#30250) · 667939a2
Fanli Lin authored Apr 15, 2024
```
add gpu flag
```
667939a2
update github actions packages' version to suppress warnings (#30249) · 440bd3c3
Yih-Dar authored Apr 15, 2024
```
update
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
440bd3c3
round epoch only in console (#30237) · 76681015
LZR authored Apr 15, 2024

76681015
Fix doctest more (for `docs/source/en`) (#30247) · fe2d20d2
Yih-Dar authored Apr 15, 2024
```
* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
fe2d20d2
Separate out kwargs in processor (#30193) · ec344b56
amyeroberts authored Apr 15, 2024
```
* Separate out kwargs in processor

* Fix up
```
ec344b56
fix: Fixed `type annotation` for compatability with python 3.8 (#30243) · fc8eda36
Sai-Suraj-27 authored Apr 15, 2024
```
* Fixed type annotation for compatability with python 3.8

* Fixed unsorted imports.
```
fc8eda36

Refactor doctest (#30210) · b6b6daf2

Yih-Dar authored Apr 15, 2024



* fix

* update

* fix

* update

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

b6b6daf2

fix: Replaced deprecated `typing.Text` with `str` (#30230) · b3595cf0
Sai-Suraj-27 authored Apr 15, 2024
```
typing.Text is deprecated. Use str instead
```
b3595cf0
Set pad_token in run_glue_no_trainer.py #28534 (#30234) · f0107862
JINO ROHIT authored Apr 15, 2024

f0107862
fix: Replace deprecated `assertEquals` with `assertEqual` (#30241) · 06b11927
Sai-Suraj-27 authored Apr 15, 2024
```
Replace deprecated assertEquals with assertEqual.
```
06b11927

Add test for parse_json_file and change typing to os.PathLike (#30183) · 8fd2de93

Xu Song authored Apr 15, 2024

* Add test for parse_json_file

* Change Path to PathLike

* Fix `Import block is un-sorted or un-formatted`

* revert parse_json_file

* Fix ruff format

* Add parse_json_file test

8fd2de93

12 Apr, 2024 2 commits
- Fixed config.json download to go to user-supplied cache directory (#30189) · b109257f
  ulatekh authored Apr 12, 2024
```
* Fixed config.json download to go to user-supplied cache directory.

* Simplied implementation suggested by @amyeroberts
```
  b109257f
- Fix/Update for doctest (#30216) · db7d1554
  Yih-Dar authored Apr 12, 2024
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  db7d1554