Commits · d30cf3d02fc58aeee56447a5f51b60d10557aab8 · chenpangpang / transformers

26 Jul, 2023 7 commits

Fix past CI after #24334 (#25113) · d30cf3d0
Yih-Dar authored Jul 26, 2023
```
update
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
d30cf3d0
update `use_auth_token` -> `token` (#25083) · 224da5df
Yih-Dar authored Jul 26, 2023
```
* update

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
224da5df

fix "UserWarning: Creating a tensor from a list of numpy.ndarrays is … (#24772) · c53c8e49

Leo authored Jul 26, 2023



fix "UserWarning: Creating a tensor from a list of numpy.ndarrays is extremely slow. Please consider converting the list to a single numpy.ndarray with numpy.array() before converting to a tensor."
Co-authored-by: 刘长伟 <hzliuchw@corp.netease.com>

c53c8e49

Add descriptive docstring to TemperatureLogitsWarper (#24892) · 04a5c859

David Reguera authored Jul 26, 2023

* Add descriptive docstring to TemperatureLogitsWarper

It addresses https://github.com/huggingface/transformers/issues/24783



* Remove niche features
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* Commit suggestion
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* Refactor the examples to simpler ones

* Add a missing comma
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* Make args description more compact
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* Remove extra text after making description more compact
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* Fix linter

---------
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

04a5c859

Fix `PvtModelIntegrationTest::test_inference_fp16` (#25106) · 31acba56
Yih-Dar authored Jul 26, 2023
```
update
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
31acba56

🌐

[i18n-KO] Translated pipeline_webserver.md to Korean (#24828) · ee63520a

Kihoon Son authored Jul 26, 2023



* translated pipeline_webserver.md
Co-Authored-By: Hyeonseo Yun <0525yhs@gmail.com>
Co-Authored-By: Wonhyeong Seo <wonhseo@kakao.com>
Co-Authored-By: Sohyun Sim <96299403+sim-so@users.noreply.github.com>
Co-Authored-By: Gabriel Yang <gabrielwithhappy@gmail.com>
Co-Authored-By: Nayeon Han <nayeon2.han@gmail.com>
Co-Authored-By: Jungnerd <46880056+jungnerd@users.noreply.github.com>

* Update pipeline_webserver.md

* Apply suggestions from code review
Co-authored-by: Hyeonseo Yun <0525yhs@gmail.com>
Co-authored-by: Sangam Lee <74291999+augustinLib@users.noreply.github.com>
Co-authored-by: Kim haewon <ehdvkf02@naver.com>

---------
Co-authored-by: Hyeonseo Yun <0525yhs@gmail.com>
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>
Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com>
Co-authored-by: Gabriel Yang <gabrielwithhappy@gmail.com>
Co-authored-by: Nayeon Han <nayeon2.han@gmail.com>
Co-authored-by: Jungnerd <46880056+jungnerd@users.noreply.github.com>
Co-authored-by: Sangam Lee <74291999+augustinLib@users.noreply.github.com>
Co-authored-by: Kim haewon <ehdvkf02@naver.com>

ee63520a

documentation for llama2 models (#25102) · 277d3aed
Shauray Singh authored Jul 26, 2023
```
* fix documentation

* changes
```
277d3aed

25 Jul, 2023 28 commits

fix tied_params for meta tensor (#25101) · a5cc30d7
Marc Sun authored Jul 25, 2023
```
* fix tied_params for meta tensor

* remove duplicate
```
a5cc30d7

Bump certifi from 2022.12.7 to 2023.7.22 in /examples/research_projects/visual_bert (#25097) · f1deb21f

dependabot[bot] authored Jul 25, 2023

Bump certifi in /examples/research_projects/visual_bert

Bumps [certifi](https://github.com/certifi/python-certifi) from 2022.12.7 to 2023.7.22.
- [Commits](https://github.com/certifi/python-certifi/compare/2022.12.07...2023.07.22

)

---
updated-dependencies:
- dependency-name: certifi
  dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

f1deb21f

Bump certifi from 2022.12.7 to 2023.7.22 in... · 45bde362

dependabot[bot] authored Jul 25, 2023

Bump certifi from 2022.12.7 to 2023.7.22 in /examples/research_projects/decision_transformer (#25098)

Bump certifi in /examples/research_projects/decision_transformer

Bumps [certifi](https://github.com/certifi/python-certifi) from 2022.12.7 to 2023.7.22.
- [Commits](https://github.com/certifi/python-certifi/compare/2022.12.07...2023.07.22

)

---
updated-dependencies:
- dependency-name: certifi
  dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

45bde362

Bump certifi from 2022.12.7 to 2023.7.22 in /examples/research_projects/lxmert (#25096) · 6b8dbc28

dependabot[bot] authored Jul 25, 2023

Bump certifi in /examples/research_projects/lxmert

Bumps [certifi](https://github.com/certifi/python-certifi) from 2022.12.7 to 2023.7.22.
- [Commits](https://github.com/certifi/python-certifi/compare/2022.12.07...2023.07.22

)

---
updated-dependencies:
- dependency-name: certifi
  dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

6b8dbc28

Fix doctest (#25031) · da5ff18a

Yih-Dar authored Jul 25, 2023



fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

da5ff18a

[`T5`, `MT5`, `UMT5`] Add [T5, MT5, UMT5]ForSequenceClassification (#24726) · 8f36ab3e

Sebastian Husch Lee authored Jul 25, 2023

* Initial addition of t5forsequenceclassification

* Adding imports and adding tests

* Formatting

* Running make fix-copies

* Adding mt5forseq

* Formatting

* run make fix-copies

* Adding to docs

* Add model_parallel

* Fix bug

* Fix

* Remove TODO

* Fixing tests for T5ForSequenceClassification

* Undo changes to dependency_versions_table.py

* Change classification head to work with T5Config directly

* Change seq length to let tests pass

* PR comments for formatting

* Formatting

* Initial addition of UMT5ForSequenceClassification

* Adding to inits and formatting

* run make fix-copies

* Add doc for UMT5ForSeqClass

* Update UMT5 config

* Fix docs

* Skip torch fx test for SequenceClassification

* Formatting

* Add skip to UMT5 tests as well

* Fix umt5 tests

* Running make fix-copies

* PR comments

* Fix for change to sentence_representation

* Rename seq_len to hidden_size since that's what it is

* Use base_model to follow format of the rest of the library

* Update docs

* Extract the decoder_input_ids changes and make one liner

* Make one-liner

8f36ab3e

Hotfix for failing `MusicgenForConditionalGeneration` tests (#25091) · 21150cb0
Yih-Dar authored Jul 25, 2023
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
21150cb0

[ `PreTrainedTokenizerFast`] Keep properties from fast tokenizer (#25053) · f9cc3338

Arthur authored Jul 25, 2023

* draft solution

* use `setdefault`

* nits

* add tests and fix truncation issue

* fix test

* test passes locally

* quality

* updates

* update tsets

f9cc3338

Edit err message and comment in `test_model_is_small` (#25087) · 0779fc8e
Connor Henderson authored Jul 25, 2023
```
* Edit err message and comment in

* put back 80M comment
```
0779fc8e
[`TF`] Also apply patch to support left padding (#25085) · 2fac3422
Arthur authored Jul 25, 2023
```
* tf versions

* apply changes to other models

* 3 models slipped through the cracks
```
2fac3422

[ `ForSequenceClassification`] Support `left` padding (#24979) · f1045227

Arthur authored Jul 25, 2023

* support left padding

* nit

* Update src/transformers/models/gpt_neox/modeling_gpt_neox.py

* Update src/transformers/models/gpt_neox/modeling_gpt_neox.py

f1045227

Allow generic composite models to pass more kwargs (#24927) · 1e662f0f

Yih-Dar authored Jul 25, 2023



* fix

* Update src/transformers/generation/utils.py
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* update

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

1e662f0f

🌐

[i18n-KO] Translated `perf_infer_cpu.md` to Korean (#24920) · b51312e2

김준재_T3056 authored Jul 25, 2023



* docs: ko: perf_infer_cpu.md

* feat: chatgpt draft

* fix: manual edits

* Update docs/source/ko/_toctree.yml

* Update docs/source/ko/perf_infer_cpu.md

* Update docs/source/ko/perf_infer_cpu.md

이 부분은 저도 걸리적거렸던 부분입니다. 반영하겠습니다!
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>

* Update docs/source/ko/perf_infer_cpu.md

동의합니다! 제가 원본에 너무 얽매여 있었네요!
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>

* Update docs/source/ko/perf_infer_cpu.md

말씀하신대로 원문에 너무 집착했던것 같습니다
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>

* Update docs/source/ko/perf_infer_cpu.md

더 나은 어휘 사용에 감사드립니다!
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>

* Update docs/source/ko/perf_infer_cpu.md

이 당시 '주기'란 용어를 생각해내질 못했네요...
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>

* Update docs/source/ko/perf_infer_cpu.md

좀 더 자연스러운 문맥이 됐네요!
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>

* Update docs/source/ko/perf_infer_cpu.md

굳이 원본 형식에 얽매일 필요가 없군요!
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>

* Update docs/source/ko/perf_infer_cpu.md
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>

---------
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>

b51312e2

[DOCS] add example NoBadWordsLogitsProcessor (#25046) · b99f7bd4
Gema Parreño authored Jul 25, 2023
```
* add example NoBadWordsLogitsProcessor

* fix L764 & L767

* make style
```
b99f7bd4

[`MPT`] Add MosaicML's `MPT` model to transformers (#24629) · dcb183f4

Arthur authored Jul 25, 2023



* draft add new model like

* some cleaning of the config

* nits

* add nested configs

* nits

* update

* update

* added layer norms + triton kernels

* consider only LPLayerNorm for now.

* update

* all keys match.

* Update

* fixing nits here and there

* working forward pass.

* removed einops dependency

* nits

* format

* add alibi

* byebye head mask

* refactor attention

* nits.

* format

* fix nits.

* nuke ande updates

* nuke tokenizer test

* don't reshape query with kv heads

* added a bit of documentation.

* remove unneeded things

* nuke more stuff

* nit

* logits match - same generations

* rm unneeded methods

* 1 remaining failing CI test

* nit

* fix nits

* fix docs

* fix docs

* rm tokenizer

* fixup

* fixup

* fixup and fix tests

* fixed configuration object.

* use correct activation

* few minor fixes

* clarify docs a bit

* logits match à 1e-12

* skip and unskip a test

* added some slow tests.

* fix readme

* add more details

* Update docs/source/en/model_doc/mpt.md
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Apply suggestions from code review
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* fix configuration issues

* more fixes in config

* added more models

* Apply suggestions from code review
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* remove unneeded position ids

* fix some  comments

* Apply suggestions from code review
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* revert suggestion

* mpt alibi + added batched generation

* Update src/transformers/models/mpt/__init__.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* remove init config

* Update src/transformers/models/mpt/configuration_mpt.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* fix nit

* add another slow test

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* fits in one line

* some refactor because make fixup doesn't pass

* add ft notebook

* update md

* correct doc path

---------
Co-authored-by: younesbelkada <younesbelkada@gmail.com>
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

dcb183f4

Fix: repeat per sample for SAM image embeddings (#25074) · 1dbc1440
Xiaoke Huang authored Jul 25, 2023
```
Repeat per sample for SAM image embeddings
```
1dbc1440
🌐 [i18n-KO] Translated `hpo_train.md` to Korean (#24968) · cb8abee5
Harheem Kim authored Jul 25, 2023
```
* dos: ko: hpo_train.mdx

* feat: chatgpt draft

* fix: manual edits

* fix: resolve suggestions
```
cb8abee5

[`generate`] Only warn users if the `generation_config`'s `max_length` is set... · f2c1df93

Arthur authored Jul 25, 2023

[`generate`]  Only warn users if the `generation_config`'s `max_length` is set to the default value (#25030)

* check max length is default

* nit

* update warning: no-longer deprecate

* comment in the configuration_utils in case max length's default gets changed in the futur

f2c1df93

replace `per_gpu_eval_batch_size` with `per_device_eval_batch_size` in readme... · c879318c

Alan Ji authored Jul 25, 2023

replace `per_gpu_eval_batch_size` with `per_device_eval_batch_size` in readme of multiple-choice task (#25078)

replace `per_gpu_eval_batch_size` with `per_device_eval_batch_size`
in readme of multiple-choice

c879318c

Fix broken link in README_hd.md (#25067) · 25e443c0
Susnato Dhar authored Jul 25, 2023
```
Update README_hd.md
```
25e443c0
Set `TF32` flag for PyTorch cuDNN backend (#25075) · 6bc61aa7
Xuehai Pan authored Jul 25, 2023

6bc61aa7
fix: add TOC anchor link (#25066) · 5dba88b2
Injin Paek authored Jul 25, 2023

5dba88b2
Fix last models for common tests that are too big. (#25058) · f295fc8a
Sylvain Gugger authored Jul 25, 2023
```
* Fix last models for common tests that are too big.

* Remove print statement
```
f295fc8a

🌐

[i18n-KO] Translated `perf_hardware.md` to Korean (#24966) · ee1eb3b3

Sangam Lee authored Jul 25, 2023



* docs: ko: perf_hardware.md

* feat: nmt draft

* fix: manual edits

* fix: resolve suggestions
Co-authored-by: Hyeonseo Yun <0525yhs@gmail.com>

* fix: resolve suggestions
Co-authored-by: Hyeonseo Yun <0525yhs@gmail.com>

* fix: resolve suggestions
Co-authored-by: Hyeonseo Yun <0525yhs@gmail.com>

* fix: resolve suggestions
Co-authored-by: Hyeonseo Yun <0525yhs@gmail.com>

* fix: resolve suggestions
Co-authored-by: Hyeonseo Yun <0525yhs@gmail.com>

* fix: resolve suggestions
Co-authored-by: Hyeonseo Yun <0525yhs@gmail.com>

* fix: resolve suggestions
Co-authored-by: Hyeonseo Yun <0525yhs@gmail.com>

* fix: resolve suggestions
Co-authored-by: Haewon Kim <ehdvkf02@naver.com>

* Fix: manual edits

* fix: manual edits

* fix: manual edits

* fix: manual edits

* fix: fix rendering error of perf_hardware.md

---------
Co-authored-by: Hyeonseo Yun <0525yhs@gmail.com>
Co-authored-by: Haewon Kim <ehdvkf02@naver.com>

ee1eb3b3

🌐

[i18n-KO] Translated `<tf_xla>.md` to Korean (#24904) · f6fe1d55

Haewon Kim authored Jul 25, 2023

* docs: ko: tf_xla.md

* feat: chatgpt draft

* fix: manual edits

* fix: manual edits

* fix: manual edits

* fix: resolve suggestions

f6fe1d55

[Docs] fix rope_scaling doc string (#25072) · faf25c04
Kashif Rasul authored Jul 25, 2023
```
fix rope_scaling doc string
```
faf25c04
Generate - add beam indices output in contrained beam search (#25042) · c0742b15
Joao Gante authored Jul 25, 2023

c0742b15
[`RWKV`] Add note in doc on `RwkvStoppingCriteria` (#25055) · c53a6eae
Arthur authored Jul 25, 2023
```
* Add note in doc on `RwkvStoppingCriteria`

* give some breathing space to the code
```
c53a6eae

24 Jul, 2023 5 commits

Better error message when signal is not supported on OS (#25049) · d2295708
Sylvain Gugger authored Jul 24, 2023
```
* Better error message when signal is not supported on OS

* Address review comments
```
d2295708

🌐

[i18n-KO] Translated `perf_train_cpu.md` to Korean (#24911) · c0d1c330

seank021 authored Jul 25, 2023



* dos: ko: perf_train_cpu.md

* feat: chatgpt draft

* fix: manual edits

* fix: resolve suggestions

* fix: manual edits
Co-authored-by: Haewon Kim <ehdvkf02@naver.com>

---------
Co-authored-by: Haewon Kim <ehdvkf02@naver.com>

c0d1c330

[`8bit`] Fix 8bit corner case with Blip2 8bit (#25047) · b08f41e6
Younes Belkada authored Jul 24, 2023
```
fix 8bit corner case with Blip2 8bit
```
b08f41e6

compute_loss in trainer failing to label shift for PEFT model when label... · 3611fc90

Nate Brake authored Jul 24, 2023


compute_loss in trainer failing to label shift for PEFT model when label smoothing enabled. (#25044)

* added PeftModelForCausalLM to MODEL_FOR_CAUSAL_LM_MAPPING_NAMES dict

* check for PEFT model in compute_loss section

---------
Co-authored-by: Nathan Brake <nbrake3@mmm.com>

3611fc90

Pvt model (#24720) · a03d13c8

Rinat authored Jul 24, 2023

* pull and push updates

* add docs

* fix modeling

* Add and run test

* make copies

* add task

* fix tests and fix small issues

* Checks on a Pull Request

* fix docs

* add desc pvt.md

a03d13c8