Commits · bb1d0d0d9e7ca356cf5673031183e955cc160158 · chenpangpang / transformers

14 Dec, 2023 7 commits
- Fix languages covered by M4Tv2 (#28019) · bb1d0d0d
  Yoach Lacombe authored Dec 14, 2023
```
* correct language assessment  + add tests

* Update src/transformers/models/seamless_m4t_v2/modeling_seamless_m4t_v2.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* make style + simplify and enrich test

---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
```
  bb1d0d0d
- SeamlessM4T: `test_retain_grad_hidden_states_attentions` is flaky (#28035) · e2b16485
  Joao Gante authored Dec 14, 2023
  
  e2b16485
- Generate: assisted decoding now uses `generate` for the assistant (#28030) · 9e5c28c5
  Joao Gante authored Dec 14, 2023
```
generate refactor
```
  9e5c28c5
- Fix AMD push CI not triggered (#28029) · dde6c427
  Yih-Dar authored Dec 14, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  dde6c427
- [`core` / `modeling`] Fix training bug with PEFT + GC (#28031) · 73de5108
  Younes Belkada authored Dec 14, 2023
```
fix trainign bug
```
  73de5108
- [`SeamlessM4TTokenizer`] Safe import (#28026) · 2788f8d8
  Arthur authored Dec 14, 2023
```
safe import
```
  2788f8d8
- well well well (#28011) · 131a528b
  Arthur authored Dec 14, 2023
  
  131a528b
13 Dec, 2023 10 commits

add `modules_in_block_to_quantize` arg in GPTQconfig (#27956) · 17506d12

Marc Sun authored Dec 13, 2023



* add inside_layer_modules arg

* fix

* change to modules_to_quantize_inside_block

* fix

* remane again

* Apply suggestions from code review
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* better docsting

* fix again with less explanation

* Update src/transformers/utils/quantization_config.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* style

---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

17506d12

Add model_docs from cpmant.md to derformable_detr.md (#27884) · fe44b1f1

Rockerz authored Dec 13, 2023



* upfaste

* Update

* Update docs/source/ja/model_doc/deformable_detr.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/ja/model_doc/data2vec.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/ja/model_doc/cvt.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* add suggestions

* Toctree update

* remove git references

* Update docs/source/ja/_toctree.yml
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/ja/model_doc/decision_transformer.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

fe44b1f1

Dev version · 3ed3e319
Lysandre authored Dec 13, 2023

3ed3e319

[Doc] Spanish translation of glossary.md (#27958) · 815ea8e8

Aaron Jimenez authored Dec 13, 2023

* Add glossary to es/_toctree.yml

* Add glossary.md to es/

* A section translated

* B and C section translated

* Fix typo in en/glossary.md C section

* D section translated | Add a extra line in en/glossary.md

* E and F section translated | Fix typo in en/glossary.md

* Fix words preentrenado

* H and I section translated | Fix typo in en/glossary.md

* L section translated

* M and N section translated

* P section translated

* R section translated

* S section translated

* T section translated

* U and Z section translated | Fix TensorParallel link in both files

* Fix word

815ea8e8

Fix bug with rotating checkpoints (#28009) · 93766251

Zach Mueller authored Dec 13, 2023

* Fix bug

* Write test

* Keep back old modification for grad accum steps

* Whitespace...

* Whitespace again

* Race condition

* Wait for everyone

93766251

[`CI slow`] Fix expected values (#27999) · ec43d687
Arthur authored Dec 13, 2023
```
* fix expected values

* style

* test is slow
```
ec43d687

Fix PatchTSMixer slow tests (#27997) · 749f94e4

Arindam Jati authored Dec 13, 2023



* fix slow tests

* revert formatting

---------
Co-authored-by: Arindam Jati <arindam.jati@ibm.com>
Co-authored-by: Kashif Rasul <kashif.rasul@gmail.com>

749f94e4

Adds VIP-llava to transformers (#27932) · c7f076a0

Younes Belkada authored Dec 13, 2023

* v1

* add-new-model-like

* revert

* fix forward and conversion script

* revert

* fix copies

* fixup

* fix

* Update docs/source/en/index.md

* Apply suggestions from code review

* push

* fix

* fixes here and there

* up

* fixup and fix tests

* Apply suggestions from code review

* add docs

* fixup

* fixes

* docstring

* add docstring

* fixup

* docstring

* fixup

* nit

* docs

* more copies

* fix copies

* nit

* update test

c7f076a0

[`Whisper`] raise better errors (#27971) · 371fb0b7
Arthur authored Dec 13, 2023
```
* [`Whisper`] raise better erros
fixes #27893

* update torch as well
```
371fb0b7
[`Tokenizer Serialization`] Fix the broken serialisation (#27099) · 230ac352
Arthur authored Dec 13, 2023
```
* nits

* nits

* actual fix

* style

* ze fix

* fix fix fix style
```
230ac352

12 Dec, 2023 7 commits
- fix typo in dvclive callback (#27983) · f4db565b
  Dave Berenbaum authored Dec 12, 2023
  
  f4db565b
- [doc] fix typo (#27981) · 99361430
  Stas Bekman authored Dec 12, 2023
  
  99361430
- Fix SDPA correctness following torch==2.1.2 regression (#27973) · 78172dcd
  fxmarty authored Dec 12, 2023
```
* fix sdpa with non-contiguous inputs for gpt_bigcode

* fix other archs

* add currently comment

* format
```
  78172dcd
- Better key error for AutoConfig (#27976) · 5e4ef0a0
  Matt authored Dec 12, 2023
```
* Improve the error printed when loading an unrecognized architecture

* Improve the error printed when loading an unrecognized architecture

* Raise a ValueError instead because KeyError prints weirdly

* make fixup
```
  5e4ef0a0
- Fix link in README.md of Image Captioning (#27969) · a49f4aca
  saswatmeher authored Dec 12, 2023
```
Update the link for vision encoder decoder doc used by
FlaxVisionEncoderDecoderModel link.
```
  a49f4aca
- Hot-fix-mixstral-loss (#27948) · 680c610f
  Arthur authored Dec 12, 2023
```
* fix loss computation

* compute on GPU if possible
```
  680c610f
- Generate: `assisted_decoding` now accepts arbitrary candidate generators (#27750) · 4b759da8
  Joao Gante authored Dec 12, 2023
```
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
```
  4b759da8
11 Dec, 2023 16 commits

fixed typos (issue 27919) (#27920) · e6604247

Anthony Susevski authored Dec 11, 2023



* fixed typos (issue 27919)

* Update docs/source/en/tasks/knowledge_distillation_for_image_classification.md
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

e6604247

Support PeftModel signature inspect (#27865) · e5079b0b

dancingpipi authored Dec 12, 2023



* Support PeftModel signature inspect

* Use get_base_model() to get the base model

---------
Co-authored-by: shujunhua1 <shujunhua1@jd.com>

e5079b0b

[docs] Fused AWQ modules (#27896) · 35478182
Steven Liu authored Dec 11, 2023
```
streamline
```
35478182
Update bounding box format everywhere (#27944) · 67b1335c
NielsRogge authored Dec 11, 2023
```
Update formats
```
67b1335c
[`Mixtral`] Change mistral op order (#27955) · 54d0b1c2
Younes Belkada authored Dec 11, 2023
```
up
```
54d0b1c2

fix no sequence length models error (#27522) · 4850aaba

Adam Louly authored Dec 11, 2023

* fix no sequence length models error

* block size check

---------

Co-authored-by: Adam Louly <adamlouly@microsoft.com@orttrainingdev9.d32nl1ml4oruzj4qz3bqlggovf.px.internal.cloudapp.net>

4850aaba

Fix for stochastic depth decay rule in the TimeSformer implementation (#27875) · 4b4b8642
Ashish Tawari authored Dec 11, 2023
```
Update modeling_timesformer.py

Fixing typo to correct the stochastic depth decay rule
```
4b4b8642
fix bug in mask2former: cost matrix is infeasible (#27897) · c0a354d8
Chenhao Xu authored Dec 12, 2023
```
fix bug: cost matrix is infeasible
```
c0a354d8

Fix a couple of typos and add an illustrative test (#26941) · 7e35f370

rjenc29 authored Dec 11, 2023

* fix a typo and add an illustrative test

* appease black

* reduce code duplication and add Annotion type back with a pending deprecation warning

* remove unused code

* change warning type

* black formatting fix

* change enum deprecation approach to support 3.8 and earlier

* add stacklevel

* fix black issue

* fix ruff issues

* fix ruff issues

* move tests to own mixin

* include yolos

* fix black formatting issue

* fix black formatting issue

* use logger instead of warnings and include target version for deprecation

7e35f370

Add deepspeed test to amd scheduled CI (#27633) · 39acfe84

Ella Charlaix authored Dec 11, 2023



* add deepspeed scheduled test for amd

* fix image

* add dockerfile

* add comment

* enable tests

* trigger

* remove trigger for this branch

* trigger

* change runner env to trigger the docker build image test

* use new docker image

* remove test suffix from docker image tag

* replace test docker image with original image

* push new image

* Trigger

* add back amd tests

* fix typo

* add amd tests back

* fix

* comment until docker image build scheduled test fix

* remove deprecated deepspeed build option

* upgrade torch

* update docker & make tests pass

* Update docker/transformers-pytorch-deepspeed-amd-gpu/Dockerfile

* fix

* tmp disable test

* precompile deepspeed to avoid timeout during tests

* fix comment

* trigger deepspeed tests with new image

* comment tests

* trigger

* add sklearn dependency to fix slow tests

* enable back other tests

* final update

---------
Co-authored-by: Felix Marty <felix@hf.co>
Co-authored-by: Félix Marty <9808326+fxmarty@users.noreply.github.com>
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

39acfe84

Fix AMD scheduled CI not triggered (#27951) · 0f59d2f1
Yih-Dar authored Dec 11, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
0f59d2f1
In PreTrainedTokenizerBase add missing word in error message (#27949) · 417bb914
Peter Götz authored Dec 11, 2023
```
"text input must of type" -> "text input must be of type"
```
417bb914
Fix parameter count in readme for mixtral 45b (#27945) · 5cec306c
Timon Käch authored Dec 11, 2023
```
fix parameter count in readme
```
5cec306c
Update import message (#27946) · 921a6bf2
NielsRogge authored Dec 11, 2023
```
* Update import message

* Update message
```
921a6bf2
Fix test for auto_find_batch_size on multi-GPU (#27947) · 44127ec6
Zach Mueller authored Dec 11, 2023
```
* Fix test for multi-GPU

* WIth CPU handle
```
44127ec6

Docs for AutoBackbone & Backbone (#27456) · b911c1f1

Merve Noyan authored Dec 11, 2023



* Initial commit for AutoBackbone & Backbone

* Added timm and clarified out_indices

* Swapped the example to out_indices

* fix toctree

* Update autoclass_tutorial.md

* Update backbones.md

* Update autoclass_tutorial.md

* Add dummy torch input instead

* Add dummy torch input

* Update autoclass_tutorial.md

* Update backbones.md

* minor fix

* Update docs/source/en/main_classes/backbones.md
Co-authored-by: Maria Khalusova <kafooster@gmail.com>

* Update docs/source/en/autoclass_tutorial.md
Co-authored-by: Maria Khalusova <kafooster@gmail.com>

* Added illustrations and explained backbone & neck

* Update docs/source/en/main_classes/backbones.md
Co-authored-by: Maria Khalusova <kafooster@gmail.com>

* Update backbones.md

---------
Co-authored-by: Maria Khalusova <kafooster@gmail.com>

b911c1f1