Commits · 1c7e5e236823cd38faac8115f96205a82c17fff9 · chenpangpang / transformers

21 Jul, 2023 1 commit
- fix fsdp checkpointing issues (#24926) · 1c7e5e23
  Sourab Mangrulkar authored Jul 21, 2023
```
* fix fsdp load

* Update trainer.py

* remove saving duplicate state_dict
```
  1c7e5e23
20 Jul, 2023 15 commits

Fallback for missing attribute `Parameter.ds_numel` (#24942) · 9ef5256d
Apoorv Khandelwal authored Jul 20, 2023
```
* [trainer] fallback for deepspeed param count

* [trainer] more readable numel count
```
9ef5256d
Contrastive Search peak memory reduction (#24120) · caf5e369
Benjamin Badger authored Jul 20, 2023
```
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>
```
caf5e369
Change logic for logging in the examples (#24956) · aa1b09c5
Zach Mueller authored Jul 20, 2023
```
Change logic
```
aa1b09c5
[`RWKV`] Add Gradient Checkpointing support for RWKV (#24955) · 89a1f342
Younes Belkada authored Jul 20, 2023
```
add GC support for RWKV
```
89a1f342

Bump aiohttp from 3.8.1 to 3.8.5 in /examples/research_projects/decision_transformer (#24954) · 9f912ef6

dependabot[bot] authored Jul 20, 2023

Bump aiohttp in /examples/research_projects/decision_transformer

Bumps [aiohttp](https://github.com/aio-libs/aiohttp) from 3.8.1 to 3.8.5.
- [Release notes](https://github.com/aio-libs/aiohttp/releases)
- [Changelog](https://github.com/aio-libs/aiohttp/blob/v3.8.5/CHANGES.rst)
- [Commits](https://github.com/aio-libs/aiohttp/compare/v3.8.1...v3.8.5

)

---
updated-dependencies:
- dependency-name: aiohttp
  dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

9f912ef6

fix type annotations for arguments in training_args (#24550) · e75cb0cb

Shauray Singh authored Jul 20, 2023

* testing

* example script

* fix typehinting

* some tests

* make test

* optional update

* Union of arguments

* does this fix the issue

* remove reports

* set default to False

* documentation change

* None support

* does not need None

* Fix typing annotations for FSDP and DeepSpeed in TrainingArguments (#24549)

* Fix typing annotations for FSDP and DeepSpeed in TrainingArguments

* Change dict to Dict

* Revert "Fix typing annotations for FSDP and DeepSpeed in TrainingArguments" (#24574)

Revert "Fix typing annotations for FSDP and DeepSpeed in TrainingArguments (#24549)"

This reverts commit c5e29d43

.

* Fix typing annotations for FSDP and DeepSpeed in TrainingArguments (#24549)

* Fix typing annotations for FSDP and DeepSpeed in TrainingArguments

* Change dict to Dict

* merge

* hacky fix

* fixup

---------
Co-authored-by: Max Ryabinin <mryabinin0@gmail.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

e75cb0cb

[DOCS] Example for `LogitsProcessor` class (#24848) · 0c41765d

Shauray Singh authored Jul 20, 2023

* make docs

* fixup

* resolved

* remove debugs

* Revert "fixup"

This reverts commit 5e0f636aae0bf8707bc8bdaa6a9427fbf66834ed.

* prev (ignore)

* fixup broke some files

* remove files

* reverting modeling_reformer

* lang fix

0c41765d

Fix `main_input_name` in `src/transformers/keras_callbacks.py` (#24916) · 35c04596
Yih-Dar authored Jul 20, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
35c04596
Update processing_vision_text_dual_encoder.py (#24950) · 85514c17
Premtim Sa authored Jul 20, 2023
```
Fixing small typo: kwrags -> kwargs
```
85514c17

Bump pygments from 2.11.2 to 2.15.0 in /examples/research_projects/decision_transformer (#24949) · 98598066

dependabot[bot] authored Jul 20, 2023

Bump pygments in /examples/research_projects/decision_transformer

Bumps [pygments](https://github.com/pygments/pygments) from 2.11.2 to 2.15.0.
- [Release notes](https://github.com/pygments/pygments/releases)
- [Changelog](https://github.com/pygments/pygments/blob/master/CHANGES)
- [Commits](https://github.com/pygments/pygments/compare/2.11.2...2.15.0

)

---
updated-dependencies:
- dependency-name: pygments
  dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

98598066

Generate: sequence bias can handle same terminations (#24822) · 89136ff7
Joao Gante authored Jul 20, 2023

89136ff7
replace no_cuda with use_cpu in test_pytorch_examples (#24944) · 37d8611a
statelesshz authored Jul 20, 2023
```
* replace no_cuda with use_cpu in test_pytorch_examples

* remove codes that never be used

* fix style
```
37d8611a

Deprecate unused OpenLlama architecture (#24922) · 79444f37

Tom Aarsen authored Jul 20, 2023

* Resolve typo in check_repo.py

* Specify encoding when opening modeling files

* Deprecate the OpenLlama architecture

* Add disclaimer pointing to Llama

I'm open to different wordings here

* Match the capitalisation of LLaMA

79444f37

Add multi-label text classification support to pytorch example (#24770) · 8fd8c8e4

ranchlai authored Jul 20, 2023

* Add text classification example

* set the problem type and finetuning task

* ruff reformated

* fix bug for unseting label_to_id for regression

* update README.md

* fixed finetuning task

* update comment

* check if label exists in feature before removing

* add useful logging

8fd8c8e4

🌐

[i18n-KO] Translated`tasks/document_question_answering.md` to Korean (#24588) · 7381987f

Jungnerd authored Jul 20, 2023



* docs: ko: `document_question_answering.md`

* fix: resolve suggestions
Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com>

* fix: resolve suggestions
Co-authored-by: Hyeonseo Yun <0525yhs@gmail.com>
Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com>

---------
Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com>
Co-authored-by: Hyeonseo Yun <0525yhs@gmail.com>

7381987f

19 Jul, 2023 8 commits

[doc] `image_processing_vilt.py` wrong default documented (#24931) · 6112b1c6
Stas Bekman authored Jul 19, 2023
```
[doc] image_processing_vilt.py wrong default
```
6112b1c6
[`Llama2`] replace `self.pretraining_tp` with `self.config.pretraining_tp` (#24906) · ee4250a3
Younes Belkada authored Jul 19, 2023
```
* add possibility to disable TP

* fixup

* adapt from offline discussions
```
ee4250a3
Fix minor llama2.md model doc typos (#24909) · 3a43794d
Travis Cline authored Jul 19, 2023
```
Update llama2.md

 Fix typos in the llama2 model doc
```
3a43794d
fix typo in BARK_PRETRAINED_MODEL_ARCHIVE_LIST (#24902) · 99c1268e
lee1jun authored Jul 19, 2023
```
fix typo in BARK_PRETRAINED_MODEL_ARCHIVE_LIST

suno/barh should be suno/bark
```
99c1268e
Fixed issue where ACCELERATE_USE_CPU="False" results in bool(True) (#24907) · aa4afa67
Madhava Jay authored Jul 19, 2023
```
- This results in cpu mode on Apple Silicon mps
```
aa4afa67
Fix `test_model_parallelism` for `FalconModel` (#24914) · 243b2ea3
Yih-Dar authored Jul 19, 2023
```
* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
243b2ea3

Update tested versions in READMEs (#24895) · c0359702

Eliah Kagan authored Jul 19, 2023

* Update supported Python and PyTorch versions in readme

* Update Python, etc. versions in non-English readmes

These were more out of date than in the English readme. This
updates all the versions the readmes claim the repository is tested
with to the same versions stated in the English readme.

Those versions are current at least in the case of the Python and
PyTorch versions (and less out of date for the others).

* Propagate trailing whitespace fix to model list

This runs "make fix-copies". The only change is the removal of
whitespace. No actual information or wording is changed.

* Update tested TensorFlow to 2.6 in all readmes

Per pinning in setup.py

Unlike Python and PyTorch, the minimum supported TensorFlow version
has not very recently changed, but old versions were listed in all
READMEs.

c0359702

Avoid some pipeline tasks to use `use_cache=True` (#24893) · 129cb6d5

Yih-Dar authored Jul 19, 2023



* fix

* fix

* fix

* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

129cb6d5

18 Jul, 2023 12 commits

Check for accelerate env var when doing CPU only (#24890) · 476be08c
Zach Mueller authored Jul 18, 2023
```
Check for use-cpu
```
476be08c
Disable ipex env var if false (#24885) · a982c022
Zach Mueller authored Jul 18, 2023
```
Disable ipex if in use
```
a982c022

[`Llama2`] Add support for Llama 2 (#24891) · 07360b6c

Arthur authored Jul 18, 2023



* add llama

* add other readmes

* update padding id in readme

* add link to paper

* fix paths and tokenizer

* more nits

* styling

* fit operation in 2 lines when possible

* nits

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* add form

* update reademe

* update readme, we don't have a default pad token

* update test and tokenization

* LLaMA instead of Llama

* nits

* add expected text

* add greeedy output

* styling

* Update src/transformers/models/llama/modeling_llama.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* sequential device map

* skip relevant changes

---------
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

07360b6c

Separate CircleCI cache between `main` and `pull` (or other branches) (#24886) · 30c172fc
Yih-Dar authored Jul 18, 2023
```
* fix

* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
30c172fc
check if eval dataset is dict (#24877) · dd49404a
Hwijeen Ahn authored Jul 18, 2023
```
* check if eval dataset is dict

* formatting
```
dd49404a
[`Blip`] Fix blip output name (#24889) · 5c5cb4ee
Younes Belkada authored Jul 18, 2023
```
* fix blip output name

* add property

* oops

* fix failing test
```
5c5cb4ee
[`InstructBlip`] Fix int8/fp4 issues (#24888) · a9e067a4
Younes Belkada authored Jul 18, 2023
```
* fix dtype issue

* revert `.float()`

* fix copies
```
a9e067a4

Add DINOv2 (#24016) · 3ec10e6c

NielsRogge authored Jul 18, 2023

* First draft

* More improvements

* Convert patch embedding layer

* Convert all weights

* Make conversion work

* Improve conversion script

* Fix style

* Make all tests pass

* Add image processor to auto mapping

* Add swiglu ffn

* Add image processor to conversion script

* Fix conversion of giant model

* Fix documentation

* Fix style

* Fix tests

* Address comments

* Address more comments

* Remove unused arguments

* Remove more arguments

* Rename parameters

* Include mask token

* Address comments

* Add docstring

* Transfer checkpoints

* Empty commit

3ec10e6c

Enable `ZeroShotAudioClassificationPipelineTests::test_small_model_pt` (#24882) · 57da42ad
Yih-Dar authored Jul 18, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
57da42ad
add ascend npu accelerator support (#24879) · 9c875839
statelesshz authored Jul 18, 2023
```
* Add Ascend NPU accelerator support

* fix style warining
```
9c875839

Fix CircleCI cache (#24880) · f14c7f99

Yih-Dar authored Jul 18, 2023



* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

f14c7f99

[`Docs`] Clarify 4bit docs (#24878) · ca974aff

Younes Belkada authored Jul 18, 2023



* clarify 4bit docs

* Apply suggestions from code review
Co-authored-by: lewtun <lewis.c.tunstall@gmail.com>

---------
Co-authored-by: lewtun <lewis.c.tunstall@gmail.com>

ca974aff

17 Jul, 2023 4 commits
- Remove `tests/onnx` (#24868) · 2ab75add
  Yih-Dar authored Jul 17, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  2ab75add
- Skip Add model like job (#24865) · d561408c
  Sylvain Gugger authored Jul 17, 2023
  
  d561408c
- Skip failing `ZeroShotAudioClassificationPipelineTests::test_small_model_pt` for now (#24867) · 870dfc15
  Yih-Dar authored Jul 17, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  870dfc15
- deprecate no_cuda (#24863) · 9dc965bb
  Marc Sun authored Jul 17, 2023
```
* deprecate no_cuda

* style

* remove doc

* remove doc 2

* fix style
```
  9dc965bb