Commits · 3bc65505fc0801e3d9ff741ec725fb0cb4d863d6 · chenpangpang / transformers

12 Oct, 2023 1 commit

Fix doctest for `Blip2ForConditionalGeneration` (#26737) · 3bc65505

Yih-Dar authored Oct 12, 2023



* fix

* fix

* fix

* fix

* fix

* fix

* fix

* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

3bc65505

11 Oct, 2023 15 commits

Translated the accelerate.md file of the documentation to Chinese (#26161) · e1cec434

TERRY LEE authored Oct 12, 2023



* translate accelerate page

* Update docs/source/zh/accelerate.md
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

e1cec434

add japanese documentation (#26138) · 9b7668c0

Rockerz authored Oct 11, 2023



* udpaet

* update

* Update docs/source/ja/autoclass_tutorial.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* add codes workflows/build_pr_documentation.yml

* Create preprocessing.md

* added traning.md

* Create Model_sharing.md

* add quicktour.md

* new

* ll

* Create benchmark.md

* Create Tensorflow_model

* add

* add community.md

* add create_a_model

* create custom_model.md

* create_custom_tools.md

* create fast_tokenizers.md

* create

* add

* Update docs/source/ja/_toctree.yml
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* md

* add

* commit

* add

* h

* Update docs/source/ja/peft.md
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

* Update docs/source/ja/_toctree.yml
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

* Update docs/source/ja/_toctree.yml
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

* Suggested Update

* add perf_train_gpu_one.md

* added perf based MD files

* Modify toctree.yml and Add transmartion to md codes

* Add `serialization.md` and edit `_toctree.yml`

* add task summary and tasks explained

* Add and Modify files starting from T

* Add testing.md

* Create main_classes files

* delete main_classes folder

* Add toctree.yml

* Update llm_tutorail.md

* Update docs/source/ja/_toctree.yml
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update misspelled filenames

* Update docs/source/ja/_toctree.yml
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/ja/_toctree.yml

* Update docs/source/ja/_toctree.yml

* missplled file names inmrpovements

* Update _toctree.yml

* close tip block

* close another tip block

* Update docs/source/ja/quicktour.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/ja/pipeline_tutorial.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/ja/pipeline_tutorial.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/ja/preprocessing.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/ja/peft.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/ja/add_new_model.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/ja/testing.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/ja/task_summary.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/ja/tasks_explained.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update glossary.md

* Update docs/source/ja/transformers_agents.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/ja/llm_tutorial.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/ja/create_a_model.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/ja/torchscript.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/ja/benchmarks.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/ja/troubleshooting.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/ja/troubleshooting.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/ja/troubleshooting.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/ja/add_new_model.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update perf_torch_compile.md

* Update Year to default in en documentation

* Final Update

---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

9b7668c0

[docstring] Fix docstring for `CodeLlamaTokenizer` (#26709) · 797a1bab
Bojun-Feng authored Oct 11, 2023
```
* update check_docstrings

* update docstring
```
797a1bab

[docstring] Fix docstring for `LlamaTokenizer` and `LlamaTokenizerFast` (#26669) · aaccf184

Minho Ryang authored Oct 12, 2023

* [docstring] Fix docstring for `LlamaTokenizer` and `LlamaTokenizerFast`

* [docstring] Fix docstring typo at `LlamaTokenizer` and `LlamaTokenizerFast`

aaccf184

Revert #20715 (#26734) · e58cbed5

Yih-Dar authored Oct 11, 2023



* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

e58cbed5

Update docker files to use `torch==2.1.0` (#26735) · b219ae6b

Yih-Dar authored Oct 11, 2023



Update docker files to use torch 2.1
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

b219ae6b

Fix checkpoint path in `no_trainer` scripts (#26733) · 1d6a8474
Zach Mueller authored Oct 11, 2023
```
checkpoint path
```
1d6a8474
Fix stale bot for locked issues (#26711) · 6ecb2ab6
Lysandre Debut authored Oct 11, 2023

6ecb2ab6
fix the model card issue as `use_cuda_amp` is no more available (#26731) · 69873d52
Sourab Mangrulkar authored Oct 11, 2023

69873d52

[docstring] `SwinModel` docstring fix (#26679) · cc44ca80

Shivanand authored Oct 11, 2023



* remove from utils

* updated doc string

* only in the model

* Update src/transformers/models/swin/modeling_swin.py
Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com>

* Update src/transformers/models/swin/modeling_swin.py
Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com>

---------
Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com>

cc44ca80

[Assistant Generation] Improve Encoder Decoder (#26701) · da69de17

Patrick von Platen authored Oct 11, 2023

* [Assistant Generation] Improve enc dec

* save more

* Fix logit processor checks

* Clean

* make style

* fix deprecation

* fix generation test

* Apply suggestions from code review

* fix biogpt

* make style

da69de17

`Copied from` for test files (#26713) · 5334796d

Yih-Dar authored Oct 11, 2023



* copied statement for test files

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

5334796d

Update docs to explain disabling callbacks using report_to (#26155) · 9f406392
Ben Gubler authored Oct 11, 2023
```
* feat: update callback doc to explain disabling callbacks using report_to

* docs: update report_to docstring
```
9f406392

In assisted decoding, pass model_kwargs to model's forward call (fix... · dcc49d8a

Billy Bradley authored Oct 11, 2023

In assisted decoding, pass model_kwargs to model's forward call (fix prepare_input_for_generation in all models) (#25242)

* In assisted decoding, pass model_kwargs to model's forward call

Previously, assisted decoding would ignore any additional kwargs
that it doesn't explicitly handle. This was inconsistent with other
generation methods, which pass the model_kwargs through
prepare_inputs_for_generation and forward the returned dict to the
model's forward call.

The prepare_inputs_for_generation method needs to be amended in all
models, as previously it only kept the last input ID when a past_key_values
was passed.

* Improve variable names in _extend_attention_mask

* Refactor extending token_type_ids into a function

* Replace deepcopy with copy to optimize performance

* Update new persimmon model with llama changes for assisted generation

* Update new mistral model for assisted generation with prepare_inputs_for_generation

* Update position_ids creation in falcon prepare_inputs_for_generation to support assisted generation

dcc49d8a

Make Whisper Encoder's sinusoidal PE non-trainable by default (#26032) · 1e3c9dda

Thien Tran authored Oct 11, 2023



* set encoder's PE as non-trainable

* freeze flax

* init sinusoids

* add test for non-trainable embed positions

* simplify TF encoder embed_pos

* revert tf

* clean up

* add sinusoidal init for jax

* make consistent sinusoidal function

* fix dtype

* add default dtype

* use numpy for sinusoids. fix jax

* add sinusoid init for TF

* fix

* use custom embedding

* use specialized init for each impl

* fix sinusoids init. add test for pytorch

* fix TF dtype

* simplify sinusoid init for flax and tf

* add tests for TF

* change default dtype to float32

* add sinusoid test for flax

* Update src/transformers/models/whisper/modeling_flax_whisper.py
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

* Update src/transformers/models/whisper/modeling_tf_whisper.py
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

* move sinusoidal init to _init_weights

---------
Co-authored-by: sanchit-gandhi <sanchit@huggingface.co>
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

1e3c9dda

10 Oct, 2023 6 commits

[JAX] Replace uses of `jnp.array` in types with `jnp.ndarray`. (#26703) · fc639143

Roy Hvaara authored Oct 10, 2023

`jnp.array` is a function, not a type:
https://jax.readthedocs.io/en/latest/_autosummary/jax.numpy.array.html


so it never makes sense to use `jnp.array` in a type annotation. Presumably the intent was to write `jnp.ndarray` aka `jax.Array`.
Co-authored-by: Peter Hawkins <phawkins@google.com>

fc639143

Fix source_prefix default value (#26654) · 3eceaa36
jheitmann authored Oct 10, 2023

3eceaa36
fix a typo in flax T5 attention - attention_mask variable is misnamed (#26663) · 975003ea
théo gigant authored Oct 10, 2023
```
* fix a typo in flax t5 attention

* fix the typo in flax longt5 attention
```
975003ea

[docstring] Fix docstring for `LlamaConfig` (#26685) · e8fdd787

Pavarissy authored Oct 10, 2023

* Your commit message here

* fix LlamaConfig docstring

* run make fixup

* fix formatting after review

reformat of the file to prevent script issues

* rerun make fixup after reformat

e8fdd787

Fix Typo: table in deepspeed.md (#26705) · a9862a0f
Tuowei Wang authored Oct 10, 2023

a9862a0f

Control first downsample stride in ResNet (#26374) · 592f2eab

jiqing-feng authored Oct 10, 2023

* control first downsample stride

* reduce first only works for ResNetBottleNeckLayer

* fix param name

* fix style

592f2eab

09 Oct, 2023 10 commits

[docstring] Fix docstrings for `CLIP` (#26691) · a5e6df82
Isaac Chung authored Oct 09, 2023
```
fix docstrings for vanilla clip
```
a5e6df82
Fix stale bot (#26692) · 87b4ade9
Lysandre Debut authored Oct 09, 2023
```
* Fix stale bot

* Comments
```
87b4ade9

[docstring] Fix docstring for DonutImageProcessor (#26641) · 3257946f

Alex Bzdel authored Oct 09, 2023

* removed donutimageprocessor from objects_to_ignore

* added docstring for donutimageprocessor

* readding donut file

* moved docstring to correct location

3257946f

[docstring] Fix docstring for `CLIPImageProcessor` (#26676) · d2f06dff
Isaac Chung authored Oct 09, 2023
```
fix docstring for CLIPImageProcessor
```
d2f06dff
[docstring] Fix docstring CLIP configs (#26677) · 3763101f
Isaac Chung authored Oct 09, 2023
```
* fix docstrings for CLIP configs

* black formatted
```
3763101f

fix typos in idefics.md (#26648) · c7f01bee

tom white authored Oct 09, 2023

* fix typos in idefics.md

Two typos found in reviewing this documentation.

1) max_new_tokens=4, is not sufficient to generate "Vegetables" as indicated - you will get only "Veget". (incidentally - some mention of how to select this value might be useful as it seems to change in each example)

2) inputs = processor(prompts, return_tensors="pt").to(device) as inputs need to be on the same device (as they are in all other examples on the page)

* Update idefics.md

Change device to cuda explicitly to match other examples

c7f01bee

Avoid CI OOM (#26639) · 740fc6a1

Yih-Dar authored Oct 09, 2023



fix avoid oom
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

740fc6a1

fix links in README.md for the GPT, GPT-2, and Llama2 Models (#26640) · 8835bff6
D. Carpintero authored Oct 09, 2023
```
* fix OpenAI GPT, GPT-2 links

* fix Llama2 link
```
8835bff6
Fixed malapropism error (#26660) · 86a4e5a9
Shreyas S authored Oct 09, 2023
```
Update test_integration.py

Fixed malapropism clone>copy
```
86a4e5a9
[DINOv2] Convert more checkpoints (#26177) · 2629c8f3
NielsRogge authored Oct 09, 2023
```
* Convert checkpoints

* Update doc test

* Address comment
```
2629c8f3

06 Oct, 2023 8 commits
- docs(zh): review and punctuation & space fix (#26627) · 897a826d
  Jabasukuriputo Wang authored Oct 06, 2023
  
  897a826d
- [docstring] Fix docstring for `AlbertConfig` (#26636) · 360ea8fc
  Yih-Dar authored Oct 06, 2023
```
example fix docstring
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  360ea8fc
- [`LlamaTokenizerFast`] Adds edge cases for the template processor (#26606) · 9ad815e4
  Arthur authored Oct 06, 2023
```
* make sure eos and bos are properly handled for fast tokenizer

* fix code llama as well

* nits

* fix the conversion script as well

* fix failing test
```
  9ad815e4
- remove SharedDDP as it is deprecated (#25702) · 27597fea
  statelesshz authored Oct 06, 2023
```
* remove SharedDDP as it was drepracated

* apply review suggestion

* make style

* Oops,forgot to remove the compute_loss context manager in Seq2SeqTrainer.

* remove the unnecessary conditional statement

* keep the logic of IPEX

* clean code

* mix precision setup & make fixup

---------
Co-authored-by: statelesshz <jihuazhong1@huawei.com>
```
  27597fea
- Fix failing `MusicgenTest .test_pipeline_text_to_audio` (#26586) · e840aa67
  Yih-Dar authored Oct 06, 2023
```
* fix

* fix

* Fix

* Fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  e840aa67
- fix RoPE t range issue for fp16 (#26602) · 87499420
  rui-ren authored Oct 06, 2023
  
  87499420
- Update chat template docs with more tips on writing a template (#26625) · ea52ed9d
  Matt authored Oct 06, 2023
  
  ea52ed9d
- Remove unnecessary unsqueeze - squeeze in rotary positional embedding (#26162) · 64845307
  fxmarty authored Oct 06, 2023
```
* remove unnecessary unsqueeze-squeeze in llama

* correct other models

* fix

* revert gpt_neox_japanese

* fix copie

* fix test
```
  64845307