Commits · 51bcadc10a569847b93a30dbe3a077037ae63bad · chenpangpang / transformers

"vscode:/vscode.git/clone" did not exist on "91c2278b97a16e7dcde28fd0fce72969560f587b"

08 Apr, 2024 2 commits

[#29174] ImportError Fix: Trainer with PyTorch requires accelerate>=0.20.1 Fix (#29888) · 0201f642

Utkarsha Gupte authored Apr 08, 2024



* ImportError: Trainer with PyTorch requires accelerate>=0.20.1 Fix

Adding the evaluate and accelerate installs at the beginning of the cell to fix the issue

* ImportError Fix: Trainer with PyTorch requires accelerate>=0.20.1

* Import Error Fix

* Update installation.md

* Update quicktour.md

* rollback other lang changes

* Update _config.py

* updates for other languages

* fixing error

* Tutorial Update

* Update tokenization_utils_base.py

* Just use an optimizer string to pass the doctest?

---------
Co-authored-by: Matt <rocketknight1@gmail.com>

0201f642

doc: Correct spelling mistake (#30107) · 1fc34aa6
Cylis authored Apr 08, 2024

1fc34aa6

13 Mar, 2024 1 commit
- [docs] Remove broken ChatML format link from chat_templating.md (#29643) · f738ab3b
  Aaron Jimenez authored Mar 13, 2024
```
* remove ChatML link from en/

* remove ChatML link in ja/

* remove ChatML link in zh/
```
  f738ab3b
07 Mar, 2024 1 commit
- v4.39 deprecations 🧼 (#29492) · ffe60fdc
  Joao Gante authored Mar 07, 2024
  
  ffe60fdc
05 Mar, 2024 2 commits
- [i18n-zh] Translate add_new_pipeline.md into Chinese (#29432) · 638c423c
  Michael authored Mar 06, 2024
```
* [i18n-zh] Translate add_new_pipeline.md into Chinese

* apply suggestions from Fan-Lin
```
  638c423c
- Generate: inner decoding methods are no longer public (#29437) · 87a0783d
  Joao Gante authored Mar 05, 2024
  
  87a0783d
28 Feb, 2024 1 commit
- [i18n-zh] Sync source/zh/index.md (#29331) · 2209b7af
  Michael authored Feb 29, 2024
```
* [i18n-zh] Sync source/zh/index.md

* apply review comments
```
  2209b7af
27 Feb, 2024 1 commit

[i18n-zh] Translate fsdp.md into Chinese (#29305) · 83ab0115

Michael authored Feb 28, 2024



* [i18n-zh] Translate fsdp.md into Chinese
Signed-off-by: windsonsea <haifeng.yao@daocloud.io>

* apply suggestions from Fan-Lin

---------
Signed-off-by: windsonsea <haifeng.yao@daocloud.io>

83ab0115

26 Feb, 2024 3 commits

[i18n-zh] Translated task/asr.md into Chinese (#29233) · a44d2dc3

Michael authored Feb 27, 2024



* [zh] Translate a task: asr.md
Signed-off-by: windsonsea <haifeng.yao@daocloud.io>

* apply suggestions from Fan-Lin

---------
Signed-off-by: windsonsea <haifeng.yao@daocloud.io>

a44d2dc3

🌐

[i18n-ZH] Translate chat_templating.md into Chinese (#28790) · 734eb254

Ming Xu (徐明) authored Feb 27, 2024



* [Pix2struct] Simplify generation (#22527)

* Add model to doc tests

* Remove generate and replace by prepare_inputs_for_generation

* More fixes

* Remove print statements

* Update integration tests

* Fix generate

* Remove model from auto mapping

* Use auto processor

* Fix integration tests

* Fix test

* Add inference code snippet

* Remove is_encoder_decoder

* Update docs

* Remove notebook link

* Release: v4.28.0

* Revert (for now) the change on `Deta` in #22437 (#22750)

fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

* Patch release: v4.28.1

* update zh chat template.

* Update docs/source/zh/chat_templating.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/zh/_toctree.yml
Co-authored-by: Michael <haifeng.yao@daocloud.io>

* Update docs/source/zh/chat_templating.md
Co-authored-by: Michael <haifeng.yao@daocloud.io>

* Update docs/source/zh/chat_templating.md
Co-authored-by: Michael <haifeng.yao@daocloud.io>

* Update docs/source/zh/chat_templating.md
Co-authored-by: Michael <haifeng.yao@daocloud.io>

* Update docs/source/zh/chat_templating.md
Co-authored-by: Michael <haifeng.yao@daocloud.io>

* Update docs/source/zh/chat_templating.md
Co-authored-by: Michael <haifeng.yao@daocloud.io>

* Update docs/source/zh/chat_templating.md
Co-authored-by: Michael <haifeng.yao@daocloud.io>

---------
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <Sylvain.gugger@gmail.com>
Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com>
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
Co-authored-by: Michael <haifeng.yao@daocloud.io>

734eb254

[i18n-zh] Translated torchscript.md into Chinese (#29234) · b4334045
Michael authored Feb 27, 2024
```
Signed-off-by: windsonsea <haifeng.yao@daocloud.io>
```
b4334045

16 Feb, 2024 1 commit
- Update all references to canonical models (#29001) · f497f564
  Lysandre Debut authored Feb 16, 2024
```
* Script & Manual edition

* Update
```
  f497f564
12 Feb, 2024 2 commits
- [i18n-de] Translate CONTRIBUTING.md to German (#28954) · d90acc16
  Klaus Hipp authored Feb 12, 2024
```
* Translate contributing.md to German

* Fix formatting issues in contributing.md

* Address review comments

* Fix capitalization
```
  d90acc16
- [Docs] Add language identifiers to fenced code blocks (#28955) · fe3df9d5
  Klaus Hipp authored Feb 12, 2024
```
Add language identifiers to code blocks
```
  fe3df9d5
06 Feb, 2024 2 commits

[Docs] Add missing language options and fix broken links (#28852) · 1c31b7aa

Klaus Hipp authored Feb 06, 2024

* Add missing entries to the language selector

* Add links to the Colab and AWS Studio notebooks for ONNX

* Use anchor links in CONTRIBUTING.md

* Fix broken hyperlinks due to spaces

* Fix links to OpenAI research articles

* Remove confusing footnote symbols from author names, as they are also considered invalid markup

1c31b7aa

[Docs] Fix backticks in inline code and documentation links (#28875) · 4830f269
Klaus Hipp authored Feb 06, 2024
```
Fix backticks in code blocks and documentation links
```
4830f269

05 Feb, 2024 1 commit

Image Feature Extraction pipeline (#28216) · ba3264b4

amyeroberts authored Feb 05, 2024



* Draft pipeline

* Fixup

* Fix docstrings

* Update doctest

* Update pipeline_model_mapping

* Update docstring

* Update tests

* Update src/transformers/pipelines/image_feature_extraction.py
Co-authored-by: Omar Sanseviero <osanseviero@gmail.com>

* Fix docstrings - review comments

* Remove pipeline mapping for composite vision models

* Add to pipeline tests

* Remove for flava (multimodal)

* safe pil import

* Add requirements for pipeline run

* Account for super slow efficientnet

* Review comments

* Fix tests

* Swap order of kwargs

* Use build_pipeline_init_args

* Add back FE pipeline for Vilt

* Include image_processor_kwargs in docstring

* Mark test as flaky

* Update TODO

* Update tests/pipelines/test_pipelines_image_feature_extraction.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Add license header

---------
Co-authored-by: Omar Sanseviero <osanseviero@gmail.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

ba3264b4

02 Feb, 2024 1 commit

[Docs] Fix spelling and grammar mistakes (#28825) · 721ee783

Klaus Hipp authored Feb 02, 2024

* Fix typos and grammar mistakes in docs and examples

* Fix typos in docstrings and comments

* Fix spelling of `tokenizer` in model tests

* Remove erroneous spaces in decorators

* Remove extra spaces in Markdown link texts

721ee783

15 Jan, 2024 1 commit
- Generate: consolidate output classes (#28494) · 7e0ddf89
  Joao Gante authored Jan 15, 2024
  
  7e0ddf89
12 Jan, 2024 1 commit
- TF: purge `TFTrainer` (#28483) · 4fb3d3a0
  Joao Gante authored Jan 12, 2024
  
  4fb3d3a0
04 Jan, 2024 1 commit

README: install transformers from conda-forge channel (#28313) · 5d36025c

Kevin Herro authored Jan 04, 2024

Switch to the conda-forge channel for transformer installation,
as the huggingface channel does not offer the latest version.

Fixes #28248

5d36025c

03 Jan, 2024 1 commit
- Translate contributing.md into Chinese (#28243) · 3ea88336
  Mayfsz authored Jan 04, 2024
```
* Translate contributing.md into Chinese

* Update review comments
```
  3ea88336
15 Dec, 2023 1 commit
- doc: Correct spelling mistake (#28064) · 70a127a3
  Cylis authored Dec 15, 2023
  
  70a127a3
08 Dec, 2023 1 commit

F.scaled_dot_product_attention support (#26572) · 80377eb0

fxmarty authored Dec 08, 2023



* add sdpa

* wip

* cleaning

* add ref

* yet more cleaning

* and more :)

* wip llama

* working llama

* add output_attentions=True support

* bigcode sdpa support

* fixes

* gpt-bigcode support, require torch>=2.1.1

* add falcon support

* fix conflicts falcon

* style

* fix attention_mask definition

* remove output_attentions from attnmaskconverter

* support whisper without removing any Copied from statement

* fix mbart default to eager renaming

* fix typo in falcon

* fix is_causal in SDPA

* check is_flash_attn_2_available in the models init as well in case the model is not initialized through from_pretrained

* add warnings when falling back on the manual implementation

* precise doc

* wip replace _flash_attn_enabled by config.attn_implementation

* fix typo

* add tests

* style

* add a copy.deepcopy on the config in from_pretrained, as we do not want to modify it inplace

* obey to config.attn_implementation if a config is passed in from_pretrained

* fix is_torch_sdpa_available when torch is not installed

* remove dead code

* Update src/transformers/modeling_attn_mask_utils.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Update src/transformers/modeling_attn_mask_utils.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Update src/transformers/modeling_attn_mask_utils.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Update src/transformers/modeling_attn_mask_utils.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Update src/transformers/modeling_attn_mask_utils.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Update src/transformers/models/bart/modeling_bart.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* remove duplicate pretraining_tp code

* add dropout in llama

* precise comment on attn_mask

* add fmt: off for _unmask_unattended docstring

* precise num_masks comment

* nuke pretraining_tp in LlamaSDPAAttention following Arthur's suggestion

* cleanup modeling_utils

* backward compatibility

* fix style as requested

* style

* improve documentation

* test pass

* style

* add _unmask_unattended tests

* skip meaningless tests for idefics

* hard_check SDPA requirements when specifically requested

* standardize the use if XXX_ATTENTION_CLASSES

* fix SDPA bug with mem-efficient backend on CUDA when using fp32

* fix test

* rely on SDPA is_causal parameter to handle the causal mask in some cases

* fix FALCON_ATTENTION_CLASSES

* remove _flash_attn_2_enabled occurences

* fix test

* add OPT to the list of supported flash models

* improve test

* properly test on different SDPA backends, on different dtypes & properly handle separately the pad tokens in the test

* remove remaining _flash_attn_2_enabled occurence

* Update src/transformers/modeling_utils.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Update src/transformers/modeling_utils.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Update src/transformers/modeling_utils.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Update src/transformers/modeling_attn_mask_utils.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Update docs/source/en/perf_infer_gpu_one.md
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* remove use_attn_implementation

* fix docstring & slight bug

* make attn_implementation internal (_attn_implementation)

* typos

* fix tests

* deprecate use_flash_attention_2=True

* fix test

* add back llama that was removed by mistake

* fix tests

* remove _flash_attn_2_enabled occurences bis

* add check & test that passed attn_implementation is valid

* fix falcon torchscript export

* fix device of mask in tests

* add tip about torch.jit.trace and move bt doc below sdpa

* fix parameterized.expand order

* move tests from test_modeling_attn_mask_utils to test_modeling_utils as a relevant test class is already there

* update sdpaattention class with the new cache

* Update src/transformers/configuration_utils.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Update src/transformers/models/bark/modeling_bark.py

* address review comments

* WIP torch.jit.trace fix. left: test both eager & sdpa

* add test for torch.jit.trace for both eager/sdpa

* fix falcon with torch==2.0 that needs to use sdpa

* fix doc

* hopefully last fix

* fix key_value_length that has no default now in mask converter

* is it flacky?

* fix speculative decoding bug

* tests do pass

* fix following #27907

---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

80377eb0

04 Dec, 2023 1 commit
- translate internal folder files to chinese (#27638) · a502b0d4
  jiaqiw09 authored Dec 05, 2023
```
* translate

* update

* update

---------
Co-authored-by: jiaqiw <wangjiaqi50@huawei.com>
```
  a502b0d4
27 Nov, 2023 2 commits

translation main-class files to chinese (#27588) · cad1b119

jiaqiw09 authored Nov 28, 2023



* translate work

* update

* update

* update [[autodoc]]

* Update callback.md

---------
Co-authored-by: jiaqiw <wangjiaqi50@huawei.com>

cad1b119

docs: replace torch.distributed.run by torchrun (#27528) · ce315081

Peter Pan authored Nov 28, 2023



* docs: replace torch.distributed.run by torchrun

 `transformers` now officially support pytorch >= 1.10.
 The entrypoint `torchrun`` is present from 1.10 onwards.
Signed-off-by: Peter Pan <Peter.Pan@daocloud.io>

* Update src/transformers/trainer.py

with @ArthurZucker's suggestion
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

---------
Signed-off-by: Peter Pan <Peter.Pan@daocloud.io>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

ce315081

17 Nov, 2023 2 commits
- translate deepspeed.md to chinese (#27495) · d1a00f9d
  jiaqiw09 authored Nov 18, 2023
```
* translate deepspeed.md

* update
```
  d1a00f9d
- Broken links fixed related to datasets docs (#27569) · ffbcfc01
  V.Prasanna kumar authored Nov 18, 2023
```
fixed the broken links belogs to dataset library of transformers
```
  ffbcfc01
16 Nov, 2023 2 commits

translate Trainer.md to chinese (#27527) · b074461e
jiaqiw09 authored Nov 16, 2023
```
* translate

* update

* update
```
b074461e

translate model.md to chinese (#27518) · 06343b06

Hz, Ji authored Nov 16, 2023



* translate model.md to chinese

* apply review suggestion
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

06343b06

14 Nov, 2023 1 commit
- translate hpo_train.md and perf_hardware.md to chinese (#27431) · 73bc0c9e
  jiaqiw09 authored Nov 14, 2023
```
* translate

* translate

* update
```
  73bc0c9e
13 Nov, 2023 1 commit
- Perf torch compile (#27422) · eb79b55b
  jiaqiw09 authored Nov 13, 2023
```
* translate perrf_torch_compile.md

* translate tf_xla.md

* update
```
  eb79b55b
08 Nov, 2023 2 commits
- translate debugging.md to chinese (#27374) · ced9fd86
  jiaqiw09 authored Nov 08, 2023
```
* update

* update
```
  ced9fd86
- translate big_models.md and performance.md to chinese (#27334) · ef716736
  jiaqiw09 authored Nov 08, 2023
```
* translate performance.md

* tranlsate performance.md and big_models.md

* update translation

* update review
```
  ef716736
07 Nov, 2023 2 commits

translate model_sharing.md and llm_tutorial.md to chinese (#27283) · e2647450

jiaqiw09 authored Nov 07, 2023

* translate model_sharing.md

* translate llm_tutorial.md to chiense

* update wrong translation

* update _torctree.yml

* update typos

* update

e2647450

translate the en tokenizer_summary.md to Chinese (#27291) · f213d5dd
九是否随意的称呼 authored Nov 08, 2023
```
* translate the en tokenizer_summary.md to Chinese

* revise WordPiece

* add to source/zh/_toctree.yml
```
f213d5dd

06 Nov, 2023 1 commit
- [docs] fixed links with 404 (#27327) · 9beb2737
  Maria Khalusova authored Nov 06, 2023
```
* fixed links with 404

* make style
```
  9beb2737
03 Nov, 2023 2 commits
- translate run_scripts.md to chinese (#27246) · cc3e4781
  jiaqiw09 authored Nov 03, 2023
```
* translate run_scripts.md to chinese

* translate run_scripts.md to chinese

* translate run_scripts.md to chinese
```
  cc3e4781
- translate autoclass_tutorial to chinese (#27269) · bf7cfac2
  jiaqiw09 authored Nov 03, 2023
```
* translate autoclass_tutorial.md  to chinese

* translate update
```
  bf7cfac2