Commits · 831bc25d8fdb85768402f772cf65cc3d7872b211 · chenpangpang / transformers

01 Mar, 2024 10 commits
- Correct zero division error in inverse sqrt scheduler (#28982) · 831bc25d
  David Valente authored Mar 01, 2024
```
* Correct zero division error in inverse sqrt scheduler

* default timescale to 10_000
```
  831bc25d
- Fix deprecated arg issue (#29372) · 1a7c117d
  Zach Mueller authored Mar 01, 2024
```
* Fix deprecated arg issue

* Trainer check too

* Check for dict or dataclass

* Simplify, make config always AcceleratorConfig

* Upstream to Trainer
```
  1a7c117d
- Fix llama + gemma accelete tests (#29380) · cec77334
  Marc Sun authored Mar 01, 2024
  
  cec77334
- Support subfolder with `AutoProcessor` (#29169) · 15f8296a
  Jingya HUANG authored Mar 01, 2024
```
enable subfolder
```
  15f8296a
- [`YOLOS`] Fix - return padded annotations (#29300) · f1b1379f
  amyeroberts authored Mar 01, 2024
```
* Fix yolos processing

* Add back slow marker - protects for pycocotools in slow

* Slow decorator goes above copied from header
```
  f1b1379f
- 🚨🚨[Whisper Tok] Update integration test (#29368) · 0a0a279e
  Sanchit Gandhi authored Mar 01, 2024
```
* [Whisper Tok] Update integration test

* make style
```
  0a0a279e
- [`Llama + AWQ`] fix `prepare_inputs_for_generation` 🫠 (#29381) · e7b98370
  Arthur authored Mar 01, 2024
```
* use the generation config 🫠

* fixup
```
  e7b98370
- FIX [`quantization` / `ESM`] Fix ESM 8bit / 4bit with bitsandbytes (#29329) · 50db7ca4
  Younes Belkada authored Mar 01, 2024
```
* fix ESM 8bit

* Apply suggestions from code review
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* fixup

---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
```
  50db7ca4
- Fix Base Model Name of LlamaForQuestionAnswering (#29258) · 2858d6c6
  Leon Engländer authored Mar 01, 2024
```
* LlamaForQuestionAnswering self.transformer->self.model

* fix "Copied from" string

* Llama QA model: set base_model_prefix = "transformer"
```
  2858d6c6
- Expose `offload_buffers` parameter of `accelerate` to... · 5ee0868a
  Song Fuchang authored Mar 01, 2024
```
Expose `offload_buffers` parameter of `accelerate` to `PreTrainedModel.from_pretrained` method (#28755)

Expose offload_buffers parameter to from_pretrained method
```
  5ee0868a
29 Feb, 2024 6 commits
- Fix @require_read_token in tests (#29367) · 0ad770c3
  Lucain authored Feb 29, 2024
  
  0ad770c3
- Patch YOLOS and others (#29353) · bb4f816a
  NielsRogge authored Feb 29, 2024
```
Fix issue
```
  bb4f816a
- Avoid using uncessary `get_values(MODEL_MAPPING)` (#29362) · 44fe1a1c
  Yih-Dar authored Feb 29, 2024
```
* more fixes

* more fixes

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  44fe1a1c
- FIX [`CI`] `require_read_token` in the llama FA2 test (#29361) · b647acdb
  Younes Belkada authored Feb 29, 2024
```
Update test_modeling_llama.py
```
  b647acdb
- FIX [`CI`]: Fix failing tests for peft integration (#29330) · 8d8ac9c2
  Younes Belkada authored Feb 29, 2024
```
fix failing tests for peft integration
```
  8d8ac9c2
- FIX [`CI` / `starcoder2`] Change starcoder2 path to correct one for slow tests (#29359) · 1aee9afd
  Younes Belkada authored Feb 29, 2024
```
change starcoder2 path to correct one
```
  1aee9afd
28 Feb, 2024 14 commits

[i18n-zh] Sync source/zh/index.md (#29331) · 2209b7af
Michael authored Feb 29, 2024
```
* [i18n-zh] Sync source/zh/index.md

* apply review comments
```
2209b7af

Better SDPA unmasking implementation (#29318) · 49204c1d

fxmarty authored Feb 28, 2024

* better unmask imple

* comment

* typo

* bug report pytorch

* cleanup

* fix import

* add back example

* retrigger ci

* come on

49204c1d

[CI] Quantization workflow (#29046) · f54d82ca

Marc Sun authored Feb 28, 2024



* [CI] Quantization workflow

* build dockerfile

* fix dockerfile

* update self-cheduled.yml

* test build dockerfile on push

* fix torch install

* udapte to python 3.10

* update aqlm version

* uncomment build dockerfile

* tests if the scheduler works

* fix docker

* do not trigger on psuh again

* add additional runs

* test again

* all good

* style

* Update .github/workflows/self-scheduled.yml
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

* test build dockerfile with torch 2.2.0

* fix extra

* clean

* revert changes

* Revert "revert changes"

This reverts commit 4cb52b8822da9d1786a821a33e867e4fcc00d8fd.

* revert correct change

---------
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

f54d82ca

check if position_ids exists before using it (#29306) · 554e7ada
jiqing-feng authored Feb 28, 2024
```
Co-authored-by: Joao Gante <joao@huggingface.co>
```
554e7ada

RoPE loses precision for Llama / Gemma + Gemma logits.float() (#29285) · d3a4b475

Daniel Han authored Feb 29, 2024



* Update modeling_llama.py

Llama - Force float32 since bfloat16 loses precision on long contexts

* Update modeling_llama.py

* Update modeling_gemma.py

Fix RoPE and logits.float()

* @torch.no_grad()

* @torch.no_grad()

* Cos, Sin to float32

* cos, sin to float32

* Update src/transformers/models/gemma/modeling_gemma.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Update src/transformers/models/llama/modeling_llama.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Resolve PR conflicts

* Fix RoPE for llama

* Revert "Fix RoPE for llama"

This reverts commit b860a22dab9bb01cd15cb9a3220abeaefad3e458.

* Fix RoPE for llama

* RoPE device

* Autocast device type

* RoPE

* RoPE isinstance

---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

d3a4b475

Idefics: generate fix (#29320) · 7628b3a0
Joao Gante authored Feb 28, 2024

7628b3a0

Disable Mixtral `output_router_logits` during inference (#29249) · 2ce56d35

Leonardo Emili authored Feb 28, 2024

* Set output_router_logits=False in prepare_inputs_for_generation for mixtral

* Add output_router_logits=False to prepare_inputs_for_generation for mixtral

* Fix style

2ce56d35

[`Llama ROPE`] Fix torch export but also slow downs in forward (#29198) · 8a8a0a4a

Arthur authored Feb 28, 2024

* remove control flow

* update gptneox

* update ....

* nits

* Actually let's just break. Otherwise we are silently failing which imo is not optimal

* version BC

* fix tests

* fix eager causal

* nit

* add a test

* style

* nits

* nits

* more nits for the test

* update and fix

* make sure cuda graphs are not skipped

* read token is needed for meta llama

* update!

* fiixup

* compile test should be slow

* fix thet fix copies

* stle 🫠

8a8a0a4a

[`T5 and Llama Tokenizer`] remove warning (#29346) · 7c87f357

Arthur authored Feb 28, 2024



* remove warning

* add co-author

* update

---------
Co-authored-by: hiaoxui <hiaoxui@users.noreply.github.com>

7c87f357

[`require_read_token`] fix typo (#29345) · a5288852
Arthur authored Feb 28, 2024
```
fix wrapper
```
a5288852
Remove numpy usage from owlvit (#29326) · e715c78c
fxmarty authored Feb 28, 2024
```
* remove numpy usage from owlvit

* fix init owlv2

* style
```
e715c78c

FIX [`Gemma` / `CI`] Make sure our runners have access to the model (#29242) · ad00c482

Younes Belkada authored Feb 28, 2024



* pu hf token in gemma tests

* update suggestion

* add to flax

* revert

* fix

* fixup

* forward contrib credits from discussion

---------
Co-authored-by: ArthurZucker <ArthurZucker@users.noreply.github.com>

ad00c482

simplify get_class_in_module and fix for paths containing a dot (#29262) · bd5b9863
Jared Van Bortel authored Feb 27, 2024

bd5b9863

Starcoder2 model - bis (#29215) · 63caa370

RaymondLi0 authored Feb 28, 2024



* Copy model

* changes

* misc

* fixes

* add embed and residual dropout (#30)

* misc

* remove rms norm and gated MLP

* remove copied mentions where its not a copy anymore

* remove unused _shape

* copied from mistral instead

* fix copies

* fix copies

* add not doctested

* fix

* fix copyright

* Update docs/source/en/model_doc/starcoder2.md
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Update src/transformers/models/starcoder2/configuration_starcoder2.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Update src/transformers/models/starcoder2/configuration_starcoder2.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* fix doc

* revert some changes

* add fa2 tests

* fix styling nit

* fix

* push dummy docs

---------
Co-authored-by: Joel Lamy-Poirier <joel.lamy-poirier@servicenow.com>
Co-authored-by: younesbelkada <younesbelkada@gmail.com>
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

63caa370

27 Feb, 2024 10 commits

[i18n-zh] Translate fsdp.md into Chinese (#29305) · 83ab0115

Michael authored Feb 28, 2024



* [i18n-zh] Translate fsdp.md into Chinese
Signed-off-by: windsonsea <haifeng.yao@daocloud.io>

* apply suggestions from Fan-Lin

---------
Signed-off-by: windsonsea <haifeng.yao@daocloud.io>

83ab0115

Fix a few typos in `GenerationMixin`'s docstring (#29277) · 227cd54a
Sadra Barikbin authored Feb 27, 2024
```
Co-authored-by: Joao Gante <joao@huggingface.co>
```
227cd54a
Token level timestamps for long-form generation in Whisper (#29148) · ddf7ac42
Raushan Turganbay authored Feb 27, 2024

ddf7ac42
Add compatibility with skip_memory_metrics for mps device (#29264) · 8a1faf28
Marc Sun authored Feb 27, 2024
```
* Add compatibility with mps device

* fix

* typo and style
```
8a1faf28
Use torch 2.2 for deepspeed CI (#29246) · 5c341d45
Yih-Dar authored Feb 27, 2024
```
update
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
5c341d45

[tests] enable benchmark unit tests on XPU (#29284) · 63a0c8f1

Fanli Lin authored Feb 27, 2024

* add xpu for benchmark

* no auto_map

* use require_torch_gpu

* use gpu

* revert

* revert

* fix style

63a0c8f1

Fix `attn_implementation` documentation (#29295) · 6d3b643e
fxmarty authored Feb 27, 2024
```
fix
```
6d3b643e

Image Feature Extraction docs (#28973) · 83e366bf

Merve Noyan authored Feb 27, 2024



* Image Feature Extraction docs

* Update docs/source/en/tasks/image_feature_extraction.md
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update image_feature_extraction.md

* Update docs/source/en/tasks/image_feature_extraction.md
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update docs/source/en/tasks/image_feature_extraction.md
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Address comments

* Update docs/source/en/tasks/image_feature_extraction.md
Co-authored-by: Maria Khalusova <kafooster@gmail.com>

* Update docs/source/en/tasks/image_feature_extraction.md
Co-authored-by: Maria Khalusova <kafooster@gmail.com>

* Update docs/source/en/tasks/image_feature_extraction.md
Co-authored-by: Maria Khalusova <kafooster@gmail.com>

* Update docs/source/en/tasks/image_feature_extraction.md
Co-authored-by: Maria Khalusova <kafooster@gmail.com>

* Update docs/source/en/tasks/image_feature_extraction.md
Co-authored-by: Maria Khalusova <kafooster@gmail.com>

* Update docs/source/en/tasks/image_feature_extraction.md
Co-authored-by: Maria Khalusova <kafooster@gmail.com>

* Update docs/source/en/tasks/image_feature_extraction.md
Co-authored-by: Maria Khalusova <kafooster@gmail.com>

* Update docs/source/en/tasks/image_feature_extraction.md
Co-authored-by: Maria Khalusova <kafooster@gmail.com>

* Update image_feature_extraction.md

* Update image_feature_extraction.md
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
Co-authored-by: Maria Khalusova <kafooster@gmail.com>

83e366bf

Cleaner Cache `dtype` and `device` extraction for CUDA graph generation for... · e3fc90ae

Andrei Panferov authored Feb 27, 2024

Cleaner Cache `dtype` and `device` extraction for CUDA graph generation for quantizers compatibility (#29079)

* input_layernorm as the beacon of hope

* cleaner dtype extraction

* AQLM + CUDA graph test

* is available check

* shorter text test

e3fc90ae

Add generate kwargs to VQA pipeline (#29134) · a3f9221a
regisss authored Feb 27, 2024

a3f9221a