Commits · 592f2eabd17cbdebd13dec54edf412f9f8232152 · chenpangpang / transformers

10 Oct, 2023 1 commit

Control first downsample stride in ResNet (#26374) · 592f2eab

jiqing-feng authored Oct 10, 2023

* control first downsample stride

* reduce first only works for ResNetBottleNeckLayer

* fix param name

* fix style

592f2eab

09 Oct, 2023 10 commits

[docstring] Fix docstrings for `CLIP` (#26691) · a5e6df82
Isaac Chung authored Oct 09, 2023
```
fix docstrings for vanilla clip
```
a5e6df82
Fix stale bot (#26692) · 87b4ade9
Lysandre Debut authored Oct 09, 2023
```
* Fix stale bot

* Comments
```
87b4ade9

[docstring] Fix docstring for DonutImageProcessor (#26641) · 3257946f

Alex Bzdel authored Oct 09, 2023

* removed donutimageprocessor from objects_to_ignore

* added docstring for donutimageprocessor

* readding donut file

* moved docstring to correct location

3257946f

[docstring] Fix docstring for `CLIPImageProcessor` (#26676) · d2f06dff
Isaac Chung authored Oct 09, 2023
```
fix docstring for CLIPImageProcessor
```
d2f06dff
[docstring] Fix docstring CLIP configs (#26677) · 3763101f
Isaac Chung authored Oct 09, 2023
```
* fix docstrings for CLIP configs

* black formatted
```
3763101f

fix typos in idefics.md (#26648) · c7f01bee

tom white authored Oct 09, 2023

* fix typos in idefics.md

Two typos found in reviewing this documentation.

1) max_new_tokens=4, is not sufficient to generate "Vegetables" as indicated - you will get only "Veget". (incidentally - some mention of how to select this value might be useful as it seems to change in each example)

2) inputs = processor(prompts, return_tensors="pt").to(device) as inputs need to be on the same device (as they are in all other examples on the page)

* Update idefics.md

Change device to cuda explicitly to match other examples

c7f01bee

Avoid CI OOM (#26639) · 740fc6a1

Yih-Dar authored Oct 09, 2023



fix avoid oom
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

740fc6a1

fix links in README.md for the GPT, GPT-2, and Llama2 Models (#26640) · 8835bff6
D. Carpintero authored Oct 09, 2023
```
* fix OpenAI GPT, GPT-2 links

* fix Llama2 link
```
8835bff6
Fixed malapropism error (#26660) · 86a4e5a9
Shreyas S authored Oct 09, 2023
```
Update test_integration.py

Fixed malapropism clone>copy
```
86a4e5a9
[DINOv2] Convert more checkpoints (#26177) · 2629c8f3
NielsRogge authored Oct 09, 2023
```
* Convert checkpoints

* Update doc test

* Address comment
```
2629c8f3

06 Oct, 2023 11 commits

docs(zh): review and punctuation & space fix (#26627) · 897a826d
Jabasukuriputo Wang authored Oct 06, 2023

897a826d
[docstring] Fix docstring for `AlbertConfig` (#26636) · 360ea8fc
Yih-Dar authored Oct 06, 2023
```
example fix docstring
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
360ea8fc

[`LlamaTokenizerFast`] Adds edge cases for the template processor (#26606) · 9ad815e4

Arthur authored Oct 06, 2023

* make sure eos and bos are properly handled for fast tokenizer

* fix code llama as well

* nits

* fix the conversion script as well

* fix failing test

9ad815e4

remove SharedDDP as it is deprecated (#25702) · 27597fea

statelesshz authored Oct 06, 2023



* remove SharedDDP as it was drepracated

* apply review suggestion

* make style

* Oops,forgot to remove the compute_loss context manager in Seq2SeqTrainer.

* remove the unnecessary conditional statement

* keep the logic of IPEX

* clean code

* mix precision setup & make fixup

---------
Co-authored-by: statelesshz <jihuazhong1@huawei.com>

27597fea

Fix failing `MusicgenTest .test_pipeline_text_to_audio` (#26586) · e840aa67
Yih-Dar authored Oct 06, 2023
```
* fix

* fix

* Fix

* Fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
e840aa67
fix RoPE t range issue for fp16 (#26602) · 87499420
rui-ren authored Oct 06, 2023

87499420
Update chat template docs with more tips on writing a template (#26625) · ea52ed9d
Matt authored Oct 06, 2023

ea52ed9d

Remove unnecessary unsqueeze - squeeze in rotary positional embedding (#26162) · 64845307

fxmarty authored Oct 06, 2023

* remove unnecessary unsqueeze-squeeze in llama

* correct other models

* fix

* revert gpt_neox_japanese

* fix copie

* fix test

64845307

Update tokenization_code_llama_fast.py (#26576) · 65aabafe

Tianqi Liu authored Oct 06, 2023

* Update tokenization_code_llama_fast.py

* Update test_tokenization_code_llama.py

* Update test_tokenization_code_llama.py

65aabafe

Fixed inconsistency in several fast tokenizers (#26561) · af38c837
Towdo authored Oct 06, 2023

af38c837

Remove unnecessary `view`s of `position_ids` (#26059) · 8878eb1b

Ramiro Leal-Cavazos authored Oct 06, 2023

* Remove unnecessary `view` of `position_ids` in `modeling_llama`

When `position_ids` is `None`, its value is generated using
`torch.arange`, which creates a tensor of size `(seq_length +
past_key_values_length) - past_key_values_length = seq_length`. The
tensor is then unsqueezed, resulting in a tensor of shape `(1,
seq_length)`. This means that the last `view` to a tensor of shape
`(-1, seq_length)` is a no-op.

This commit removes the unnecessary view.

* Remove no-op `view` of `position_ids` in rest of transformer models

8878eb1b

05 Oct, 2023 11 commits

Don't install `pytorch-quantization` in Doc Builder docker file (#26622) · 75a33d60
Yih-Dar authored Oct 05, 2023
```
Fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
75a33d60

[docs] Update to scripts building index.md (#26546) · 18fbeec8

Maria Khalusova authored Oct 05, 2023

* build the table in index.md with links to the model_doc

* removed list generation on index.md

* fixed missing models

* make style

18fbeec8

Fix `transformers-pytorch-gpu` docker build (#26615) · 9d206012
Yih-Dar authored Oct 05, 2023
```
Fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
9d206012
Don't close ClearML task if it was created externally (#26614) · 9e78c9ac
eajechiloae authored Oct 05, 2023
```
don't close clearml task if it was created externally
```
9e78c9ac

#26566 swin2 sr allow in out channels (#26568) · 0a3b9d02

Marvin Gabler authored Oct 05, 2023



* feat: close #26566, changed model & config files to accept arbitary in and out channels

* updated docstrings

* fix: linter error

* fix: update Copy docstrings

* fix: linter update

* fix: rename num_channels_in to num_channels to prevent breaking changes

* fix: make num_channels_out None per default

* Update src/transformers/models/swin2sr/configuration_swin2sr.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* fix: update tests to include num_channels_out

* fix:linter

* fix: remove normalization with precomputed rgb values when #input_channels!=#output_channels

---------
Co-authored-by: marvingabler <marvingabler@outlook.de>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

0a3b9d02

[`core`] fix silent bug `keep_in_fp32` modules (#26589) · e6d250e4
Younes Belkada authored Oct 05, 2023
```
* fix silent bug `keep_in_fp32` modules

* final fix

* added a common test.

* Trigger CI

* revert
```
e6d250e4

Make `ModelOutput` serializable (#26493) · 19f0b7dd

Charles Bensimon authored Oct 05, 2023

* Make `ModelOutput` serializable

Original PR from diffusers : https://github.com/huggingface/diffusers/pull/5234

* Black

19f0b7dd

Fix failing tests on `main` due to torch 2.1 (#26607) · 54e17a15
Yih-Dar authored Oct 05, 2023
```
* fix

* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
54e17a15
[Falcon] Set `use_cache=False` before creating `presents` which relies on `use_cache` (#26328) · 2ab76c2c
Yun Dai authored Oct 05, 2023
```
* Set `presents=None` when `use_cache` is set to False for activation ckpt

* Update modeling_falcon.py

* fix black
```
2ab76c2c

[`GPTNeoX`] Faster rotary embedding for GPTNeoX (based on llama changes) (#25830) · 253f9a3f

Arthur authored Oct 05, 2023

* Faster rotary embedding for GPTNeoX

* there might be un-necessary moves from device

* fixup

* fix dtype issue

* add copied from statements

* fox copies

* oupsy

* add copied from Llama for scaled ones as well

* fixup

* fix

* fix copies

253f9a3f

[ `NougatProcessor`] Fix the default channel (#26608) · b4e66d7a
Arthur authored Oct 05, 2023
```
fix
```
b4e66d7a

04 Oct, 2023 7 commits
- add zh translation for installation (#26084) · 43bfd093
  Yeyang authored Oct 05, 2023
```
* translate installation to zh

* fix translation typo
```
  43bfd093
- [Wav2Vec2] Fix tokenizer set lang (#26349) · 2d8ee981
  Sanchit Gandhi authored Oct 04, 2023
```
* fix wav2vec2 doctest

* suggestion

* fix

* final fix

* revert since we need AddedTokens
```
  2d8ee981
- Update mistral.md to update 404 link (#26590) · f9ab07f9
  Galland authored Oct 04, 2023
  
  f9ab07f9
- skip flaky hub tests (#26594) · c037b2e3
  Arthur authored Oct 04, 2023
```
skip flaky
```
  c037b2e3
- Fix encoder->decoder typo bug in convert_t5x_checkpoint_to_pytorch.py (#26587) · ca7912d1
  Soyoung Yoon authored Oct 05, 2023
```
Fix bug in convert_t5x_checkpoint_to_pytorch.py
```
  ca7912d1
- Fix embarrassing typo in the doc chat template! (#26596) · 8b03615b
  Matt authored Oct 04, 2023
  
  8b03615b
- Add # Copied from statements to audio feature extractors that use the floats_list function (#26581) · 9deb18ca
  dg845 authored Oct 04, 2023
```
Add # Copied from statements to audio feature extractors that use the floats_list function.
```
  9deb18ca