Commits · a796f7eea6c86b54671a6f522cebbe41f630bb62 · chenpangpang / transformers

13 Sep, 2023 11 commits
- Falcon: batched generation (#26137) · a796f7ee
  Joao Gante authored Sep 13, 2023
  
  a796f7ee
- Fix `test_finetune_bert2bert` (#25984) · 95a90410
  Yih-Dar authored Sep 13, 2023
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  95a90410
- Generate: ignore warning when `generation_config.max_length` is set to `None` (#26147) · 86ffef87
  Joao Gante authored Sep 13, 2023
  
  86ffef87
- docs: feat: add llama2 notebook resources from OSSCA community (#26076) · a6ae2bd0
  김준재_T3056 authored Sep 14, 2023
  
  a6ae2bd0
- [`RWKV`] Final fix RWMV 4bit (#26134) · 7ccac73f
  Younes Belkada authored Sep 13, 2023
```
* Final fix RWMV 4bit

* fixup

* add a test

* add more clarifications
```
  7ccac73f
- Update spectrogram and waveform model mapping for TTS/A pipeline (#26114) · 32ec7345
  Vaibhav Srivastav authored Sep 13, 2023
```
update names mapping for spectrogram and waveform models
```
  32ec7345
- Add missing space in generation/utils.py (#26121) · a9b63ca9
  Juarez Bochi authored Sep 13, 2023
```
Add missing space in utils.py

Warning now reads as "...  to control thegeneration length. We ..."
```
  a9b63ca9
- [`core`] fix 4bit `num_parameters` (#26132) · c8b26096
  Younes Belkada authored Sep 13, 2023
```
* fix 4bit `num_parameters`

* stronger check
```
  c8b26096
- Fix AutoTokenizer docstring typo (#26117) · 7db1ad63
  amyeroberts authored Sep 13, 2023
```
Fix docstring typo
```
  7db1ad63
- fix the deepspeed tests (#26021) · b4773273
  Sourab Mangrulkar authored Sep 13, 2023
```
* fix the deepspeed tests

* resolve comment
```
  b4773273
- safeguard torch distributed check (#26056) · 73b13ac0
  Sourab Mangrulkar authored Sep 13, 2023
  
  73b13ac0
12 Sep, 2023 12 commits

Fix `MarianTokenizer` to remove metaspace character in `decode` (#26091) · 12f043ea

Tanay Mehta authored Sep 13, 2023

* add: check to remove metaspace from marian tokenizer

* fix: metaspace character being removed from everywhere

* fix: remove redundant check at top

* add: test for marian tokenizer decode fix

* fix: simplified the test

12f043ea

Text2text pipeline: don't parameterize from the config (#26118) · 03e309d5
Joao Gante authored Sep 12, 2023

03e309d5
chore: correct update_step and correct gradient_accumulation_steps (#26068) · 4fb64e28
Phuc Van Phan authored Sep 13, 2023

4fb64e28

enable optuna multi-objectives feature (#25969) · 8f609ab9

Wang, Yi authored Sep 13, 2023



* enable optuna multi-objectives feature
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>

* Apply suggestions from code review
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* update hpo doc

* update docstring
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>

* extend direction to List[str] type
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>

* Update src/transformers/integrations/integration_utils.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

---------
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

8f609ab9

🌐

[i18n-KO] Translated `contributing.md` to Korean (#25877) · 92f2fbad

MinJae Kang authored Sep 13, 2023



* docs: ko-contributing.md

* feat: chatGPT draft

* feat: manual edits

* feat: change linked document

* fix: resolve suggestion
Co-authored-by: Haewon Kim <ehdvkf02@naver.com>

* fix: resolve suggestion
Co-authored-by: Haewon Kim <ehdvkf02@naver.com>

* fix: resolve suggestion
Co-authored-by: Haewon Kim <ehdvkf02@naver.com>

* fix: resolve suggestion
Co-authored-by: Haewon Kim <ehdvkf02@naver.com>

* fix: resolve suggestion
Co-authored-by: Haewon Kim <ehdvkf02@naver.com>

* fix: resolve suggestion
Co-authored-by: Haewon Kim <ehdvkf02@naver.com>

* fix: resolve suggestion
Co-authored-by: SeongWooChoi <46990061+nuatmochoi@users.noreply.github.com>

* fix: resolve suggestion
Co-authored-by: SeongWooChoi <46990061+nuatmochoi@users.noreply.github.com>

* fix: resolve suggestion
Co-authored-by: SeongWooChoi <46990061+nuatmochoi@users.noreply.github.com>

* fix: resolve suggestion

* fix: resolve suggestion

* feat: delete file to resolve error

---------
Co-authored-by: Haewon Kim <ehdvkf02@naver.com>
Co-authored-by: SeongWooChoi <46990061+nuatmochoi@users.noreply.github.com>

92f2fbad

[docs] Updates to TTS task guide with regards to the new TTS pipeline (#26095) · 1fe7ce48

Maria Khalusova authored Sep 12, 2023



* tts guide updates with a pipeline

* Apply suggestions from code review
Co-authored-by: Yoach Lacombe <52246514+ylacombe@users.noreply.github.com>

* Update docs/source/en/tasks/text-to-speech.md
Co-authored-by: Vaibhav Srivastav <vaibhavs10@gmail.com>

---------
Co-authored-by: Yoach Lacombe <52246514+ylacombe@users.noreply.github.com>
Co-authored-by: Vaibhav Srivastav <vaibhavs10@gmail.com>

1fe7ce48

🌐

[i18n-KO] Translated `llama2.md` to Korean (#26047) · be9438ed

MinJae Kang authored Sep 13, 2023



* docs: ko-llama2.md

* feat: chatGPT draft and manul edits

* feat: added inline TOC

* fix: inline TOC

* fix: resolve suggestions
Co-authored-by: Jungnerd <46880056+jungnerd@users.noreply.github.com>

* fix: resolve suggestion
Co-authored-by: Jungnerd <46880056+jungnerd@users.noreply.github.com>

* fix: resolve suggestion
Co-authored-by: Jungnerd <46880056+jungnerd@users.noreply.github.com>

---------
Co-authored-by: Jungnerd <46880056+jungnerd@users.noreply.github.com>

be9438ed

Fix ExponentialDecayLengthPenalty negative logits issue (#25594) · 6acc27ee

pokjay authored Sep 12, 2023



* Fix issues in test_exponential_decay_length_penalty

Fix tests which were broken and add validation of negative scores.

Current test didn't take into account that ExponentialDecayLengthPenalty updates the score inplace, resulting in updates to base tested Tensor.

In addition, the gt assert had empty Tensors due to indexing along the batch dimension.

Test is currently expected to fail to show ExponentialDecayLengthPenalty issues with negative scores

* Fix ExponentialDecayLengthPenalty negative logits issue

In cases where the scores are negative, ExponentialDecayLengthPenalty decreases the score of eos_token_id instead of increasing it.
To fix this issue we compute the penalty of the absolute value and add it to the original score.

* Add examples for ExponentialDecayLengthPenalty

* Fix styling issue in ExponentialDecayLengthPenalty doc

* Apply suggestions from code review
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Style and quality fix

* Fix example outputs

---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

6acc27ee

Update logits_process.py docstrings (#25971) · d65c4a4f
larekrow authored Sep 12, 2023

d65c4a4f
Generate: legacy mode is only triggered when `generation_config` is untouched (#25962) · 3319eb54
Joao Gante authored Sep 12, 2023

3319eb54
[`core`] Import tensorflow inside relevant methods in `trainer_utils` (#26106) · 18abc756
Younes Belkada authored Sep 12, 2023
```
import tensorflow inside relevant methods in trainer_utils
```
18abc756

[`Persimmon`] Add support for persimmon (#26042) · 9cccb3a8

Arthur authored Sep 12, 2023



* intiial commit

* updates

* nits

* update conversion script

* update conversion script

* use path to load

* add tips etc

* some modeling logic

* modeling update

* more nits

* nits

* normal layer norm

* update config and doc

* nits

* update doc remove unused

* update

* fix inits and stuff

* fixup

* revert wrong changes

* updates

* more nits

* add default config values to the configuration file

* fixup happy

* update

* 2 tests left

* update readmes

* more nits

* slow test and more documentation

* update readme

* fix licences

* styling

* use fast if possible when saving tokenizer

* remove todo

* remove tokenization tests

* small last nits

* Apply suggestions from code review
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>

* nits to skip the timout doctest

* fix integration test

* fix test

* update eos token

* update to allow fast tokenization

* styling

* fix codeLlama as well for the update post processor

* Apply suggestions from code review
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* add more copied from statements

* update

* doc passes doctest

* remove `# final layer norm?`

* change docstring prompot

* update

* Update README.md
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* don't doctest the conversion script as it requires more packages

* don't init a model in the config

* oups

* fix doctest

---------
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

9cccb3a8

11 Sep, 2023 4 commits
- docs: add space to docs (#26067) · 5af2c626
  Phuc Van Phan authored Sep 12, 2023
```
* docs: add space to docs

* docs: remove reduntant space
```
  5af2c626
- [Core] Add lazy import structure to imports (#26090) · ce2e7ef3
  Patrick von Platen authored Sep 11, 2023
```
* improve import time

* Update src/transformers/integrations/__init__.py

* sort import
```
  ce2e7ef3
- docs: update link huggingface map (#26077) · 9cebae64
  Phuc Van Phan authored Sep 11, 2023
  
  9cebae64
- only main process should call _save on deepspeed zero3 (#25959) · 7fd2d686
  Hang authored Sep 11, 2023
```
only main process should call _save when deepspeed zero3
```
  7fd2d686
09 Sep, 2023 1 commit
- [`CITests`] skip failing tests until #26054 is merged (#26063) · 95b37495
  Arthur authored Sep 08, 2023
```
* skip failing tests until #26054 is merged

* fixup
```
  95b37495
08 Sep, 2023 5 commits

[`CodeLlamaTokenizerFast`] Fix fix `set_infilling_processor` to properly reset (#26041) · 09b2de6e

Arthur authored Sep 08, 2023

* fix `set_infilling_processor` to properly reset

* Add docstring!

* fixups

* more details in the docuemtation about the tokenization

* styl;e

09b2de6e

🌐 [i18n-KO] Translated `llama.md` to Korean (#26044) · d5360603
Harheem Kim authored Sep 09, 2023
```
* docs: ko-llama.md

* fix: chatgpt draft

* feat: manual edits

* fix: resolve suggestions
```
d5360603

Skip warning if tracing with dynamo (#25581) · 6c26faa1

Angela Yi authored Sep 08, 2023

* Ignore warning if tracing with dynamo

* fix import error

* separate to function

* add test

6c26faa1

Update missing docs on `activation_dropout` and fix DropOut docs for SEW-D (#26031) · 18ee1fe7
Thien Tran authored Sep 08, 2023
```
* add missing doc for activation dropout

* fix doc for SEW-D dropout

* deprecate hidden_dropout for SEW-D
```
18ee1fe7

Fix Dropout Implementation in Graphormer (#24817) · 0c67a72c

Alexander Krauck authored Sep 08, 2023

This commit corrects the dropout implementation in Graphormer, aligning it with the original implementation and improving performance. Specifically:

1. The `attention_dropout` variable, intended for use in GraphormerMultiheadAttention, was defined but not used. This has been corrected to use `attention_dropout` instead of the regular `dropout`.
2. The `activation_dropout` for the activations in the feed-forward layers was missing. Instead, the regular `dropout` was used. This commit adds `activation_dropout` to the feed-forward layers.

These changes ensure the dropout implementation matches the original Graphormer and delivers empirically better performance.

0c67a72c

07 Sep, 2023 7 commits

Try to fix training Loss inconsistent after resume from old checkpoint (#25872) · fb7d2469

dumpmemory authored Sep 08, 2023



* fix loss inconsistent after resume  #25340

* fix typo

* clean code

* reformatted code

* adjust code according to comments

* adjust check_dataloader_randomsampler location

* return sampler only

* handle sampler is None

* Update src/transformers/trainer_pt_utils.py

thanks @amyeroberts
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

fb7d2469

Punctuation fix (#26025) · c5e66a40
MyungHa Kwon authored Sep 08, 2023
```
fix typo
```
c5e66a40
Fix vilt config docstring parameter to match value in init (#26017) · 00efd64e
raghavanone authored Sep 08, 2023
```
* Fix vilt config init parameter to match the ones in documentation

* Fix the documentation
```
00efd64e

Added HerBERT to README.md (#26020) · 02c4a77f

Muskan Kumar authored Sep 07, 2023

* Added HerBERT to README.md

* Update README.md to contain HerBERT (#26016)

* Resolved #26016: Updated READMEs and index.md to contain Herbert

Updated READMEs and ran make fix-copies

02c4a77f

[VITS] Fix nightly tests (#25986) · 2af87d01

Sanchit Gandhi authored Sep 07, 2023

* fix tokenizer

* make bs even

* fix multi gpu test

* style

* model forward

* fix torch import

* revert tok pin

2af87d01

Add `tgs` speed metrics (#25858) · 3744126c

CokeDong authored Sep 08, 2023



* Add tgs metrics

* bugfix and black formatting

* workaround for tokens counting

* formating and bugfix

* Fix

* Add opt-in for tgs metrics

* make style and fix error

* Fix doc

* fix docbuild

* hf-doc-build

* fix

* test

* Update src/transformers/training_args.py

renaming
Co-authored-by: Zach Mueller <muellerzr@gmail.com>

* Update src/transformers/training_args.py

renaming
Co-authored-by: Zach Mueller <muellerzr@gmail.com>

* Fix some symbol

* test

* Update src/transformers/trainer_utils.py

match nameing patterns
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update src/transformers/training_args.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update src/transformers/trainer.py

nice
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Fix reviews

* Fix

* Fix black

---------
Co-authored-by: Zach Mueller <muellerzr@gmail.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

3744126c

Fix CircleCI config (#26023) · 0188739a
Yih-Dar authored Sep 07, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
0188739a