Commits · 9960506cbee4642a75beda7753cdf23a4f493784 · chenpangpang / transformers

08 Feb, 2023 2 commits

Fix multiple `eos_token_id`s in model.generate(...) (#21461) · 9960506c

Motoki Wu authored Feb 08, 2023

* add tests with multiple eos_token_ids

* make math.prod instead of sum

* make fixup

* fix long and also use np.prod since math.prod does not exist <python 3.8

* make fixup

* add prod util

* use prod util instead of np.prod

* make fixup

* previous .long location

* use tensor ops

* remove prod

* remove prod

* update device

* make fixup

* fix none

9960506c

Generate: TF `compute_transition_scores` (#21341) · 1d9c26a4
Joao Gante authored Feb 08, 2023

1d9c26a4

07 Feb, 2023 3 commits

Cleanup quality (#21493) · 67d07487

Sylvain Gugger authored Feb 07, 2023

* Remove mentions of flake8/isort

* Clean up inits

* Deall with all other inits

* Last special rule for dummy files

67d07487

Generate: TF can now generate from embeddings in encoder-decoder models (#21475) · 1e4cf8bb
Joao Gante authored Feb 07, 2023

1e4cf8bb

[CI ] Remove `past` in favor of `pat_key_values` (#21443) · 12eb528b

Arthur authored Feb 07, 2023

* fix past renamed to past_key_value

* update more `past`that were ski^êd

* fixup

* remove changes made to rag

* refactor `_reorder_cache` to use `past_key_values`

* fix git `prepare_inputs_for_generation` to pass tests when false is needed in use_cache

12eb528b

06 Feb, 2023 2 commits

Update quality tooling for formatting (#21480) · 6f79d264

Sylvain Gugger authored Feb 06, 2023

* Result of black 23.1

* Update target to Python 3.7

* Switch flake8 to ruff

* Configure isort

* Configure isort

* Apply isort with line limit

* Put the right black version

* adapt black in check copies

* Fix copies

6f79d264

Generate: TF can now accept custom logits processors (#21454) · 49433310
Joao Gante authored Feb 06, 2023

49433310

03 Feb, 2023 1 commit
- 🚨🚨 Generate: standardize beam search behavior across frameworks (#21368) · f21af262
  Joao Gante authored Feb 03, 2023
  
  f21af262
02 Feb, 2023 1 commit

Fixes bug in the creation of ExponentialDecayLengthPenalty (#21423) · 6a3d1a98

Jorge C. Gomes authored Feb 02, 2023

input_ids_seq_length doesn't exist in the GenerationConfig, it exists as local variable in the function.

Setting exponential_decay_length_penalty therefore results in an error:
`AttributeError: 'GenerationConfig' object has no attribute 'input_ids_seq_length'`

This simple change fixes this issue, and the exponential_decay_length_penalty works as expected.

6a3d1a98

01 Feb, 2023 1 commit
- Generate: decoder-only models can generate with `inputs_embeds` (#21405) · 92ce53aa
  Joao Gante authored Feb 01, 2023
  
  92ce53aa
30 Jan, 2023 1 commit
- Generate: Relaxed `max_length` and `max_new_tokens` coexistence (#21347) · 42b60f8b
  Joao Gante authored Jan 30, 2023
```
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
```
  42b60f8b
27 Jan, 2023 2 commits

Little cleanup: let huggingface_hub manage token retrieval (#21333) · 8f3b4a1d

Lucain authored Jan 27, 2023

* Let huggingface_hub manage token retrieval

* flake8

* code quality

* adapt in every PushToHubMixin children

* add explicit return type

8f3b4a1d

[Whisper] another patch (#21324) · 0dff407d

Arthur authored Jan 27, 2023

* another patch

* fix timestamp test modeling

* let it be negative when the token is None

0dff407d

26 Jan, 2023 1 commit
- Generate: better `compute_transition_scores` examples (#21323) · 2b8feffa
  Joao Gante authored Jan 26, 2023
  
  2b8feffa
25 Jan, 2023 3 commits

[WHISPER] Small patch (#21307) · 6f3faf38

Arthur authored Jan 25, 2023

* add small patch

* update tests, forced decoder ids is not prioritary against generation config

* fix two new tests

6f3faf38

Small fix to ExponentialDecayLengthPenalty docstring (#21308) · 140c6ede

Nick Hill authored Jan 25, 2023

Currently, it incorrectly states that the exponential_decay_length_penalty tuple parameter is optional.

Also changed the corresponding type hint to be more specific.

140c6ede

[Whisper] Refactor whisper (#21252) · 255257f3

Arthur authored Jan 25, 2023

* update whisper logit processor

* add generate for whisper

* remove part of the whisper specific code from pipeline

* update logit processes

* major update

* enforce first timestamp

* update generate

* add more tests

* update new decoding strategy

* Apply suggestions from code review

* update docstring

* fixup

* default config will not have multilingual ar

* update expected tokenizer size, see pull on the hub for whisper-tiny

255257f3

24 Jan, 2023 1 commit

[GenerationConfig] add additional kwargs handling (#21269) · 94a7edd9

Arthur authored Jan 24, 2023



* add additional kwargs handling

* fix issue when serializing

* correct order of kwargs removal for serialization in from dict

* add `dict_torch_dtype_to_str` in case a dtype is needed for generation

* add condition when adding the kwargs : not from config

* Add comment based on review
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* add test function

* default None when poping arg
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

94a7edd9

23 Jan, 2023 1 commit
- Generate: precision fix in compute_transition_scores doctests (#21251) · c8d719ff
  Joao Gante authored Jan 23, 2023
  
  c8d719ff
20 Jan, 2023 1 commit
- Generate: documented function to compute the transition scores (#21191) · af37d183
  Joao Gante authored Jan 20, 2023
```
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
```
  af37d183
19 Jan, 2023 1 commit

Add hallucination filter (#18675) · b9403e95

Karim Foda authored Jan 19, 2023



* Add hallucination penalty

* Make quality changes

* Inverse penalty

* Fix imports & quality

* Fix name spelling issue

* set encoder_repetition_penalty and fix quality

* Fix failing test

* Add to config_common_kwargs

* Fix modelling_rag error

* Update src/transformers/generation_logits_process.py
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* Remove breakpoint

* Make style fixes

* Update encoder_repetition_penalty default value

* Merge latest main changes

* Make fixup changes

* Add EncoderRepetitionPenaltyLogitsProcessor to generation/__init__.py

* Fix repo-inconsistency

* Remove venv

* Remove tensorflow-macos & add tests

* Add documentation

* Fix quality issues

* move encoder_repetition_penalty to config

* Update src/transformers/configuration_utils.py
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* Update src/transformers/generation/configuration_utils.py
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* Remove encoder_repetition_penalty from tests

* Fix type error

* Fix format error
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

b9403e95

17 Jan, 2023 6 commits

Add Epsilon- and Eta-Sampling (#21121) · 865da84a

Sherman Siu authored Jan 17, 2023

* Add epsilon- and eta-sampling.

Add epsilon- and eta-sampling, following the official code from https://github.com/john-hewitt/truncation-sampling and adapting to be more configurable, as required by Huggingface transformers.

* Add unit tests for epsilon- and eta-sampling.

* Black: fix code formatting.

* Fix docstring spacing.

* Clean up newlines.

* Fix implementation bugs and their associated tests.

* Remove epsilon- and eta-sampling parameters from PretrainedConfig.

* Clarify and clean up the documentation.

* Remove parameters for PretrainedConfig test.

865da84a

Refactoring of the text generate API docs (#21112) · 02488103

Maria Khalusova authored Jan 17, 2023

* initial commit, refactoring the text generation api reference

* removed repetitive code examples

* Refactoring the text generation docs to reduce repetition

* make style

02488103

Whisper Timestamp processor and prediction (#20620) · bb300ac6

Arthur authored Jan 17, 2023



* add draft logit processor

* add template functions

* update timesapmt processor parameters

* draft script

* simplify code

* cleanup

* fixup and clean

* update pipeline

* style

* clean up previous idea

* add tokenization utils

* update tokenizer and asr output

* fit whisper type

* style and update test

* clean test

* style test

* update tests

* update error test

* udpate code (not based on review yet)

* update tokenization

* update asr pipeline

* update code

* cleanup and update test

* fmt

* remove text verificatino

* cleanup

* cleanup

* add model test

* update tests

* update code add docstring

* update code and add docstring

* fix pipeline tests

* add draft logit processor

add template functions

update timesapmt processor parameters

draft script

simplify code

cleanup

fixup and clean

update pipeline

style

clean up previous idea

add tokenization utils

update tokenizer and asr output

fit whisper type

style and update test

clean test

style test

update tests

update error test

udpate code (not based on review yet)

update tokenization

update asr pipeline

update code

cleanup and update test

fmt

remove text verificatino

cleanup

cleanup

add model test

update tests

update code add docstring

update code and add docstring

fix pipeline tests

* Small update.

* Fixup.

* Tmp.

* More support.

* Making `forced_decoder_ids` non mandatory for users to set.

* update and fix first bug

* properly process sequence right after merge if last

* tofo

* allow list inputs + compute begin index better

* start adding tests

* add the 3 edge cases

* style

* format sequences

* fixup

* update

* update

* style

* test passes, edge cases should be good

* update last value

* remove Trie

* update tests and expec ted values

* handle bigger chunk_length

* clean tests a bit

* refactor chunk iter and clean pipeline

* update tests

* style

* refactor chunk iter and clean pipeline

* upade

* resolve comments

* Apply suggestions from code review
Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com>

* take stride right into account

* update test expected values

* Update code based on review
Co-authored-by: sgugger <sylvain.gugger@gmail.com>
Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com>
Co-authored-by: sgugger <sylvain.gugger@gmail.com>

bb300ac6

Clarify and add missing typical_p argument docstring. (#21095) · 8896ebb9

Sherman Siu authored Jan 17, 2023



* Clarify and add missing typical_p docstring.

* Make the docstring easier to understand.

* Clarify typical_p docstring

Accept the suggestion by @stevhliu for paraphrasing the docstring.
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Use the same docstring as in GenerationConfig

Follow the suggestion suggested by @stevhliu in the pull request conversation.

* Fix docstring spacing.
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

8896ebb9

Small simplification to TopKLogitsWarper (#21130) · 3bbc2451
Nick Hill authored Jan 17, 2023
```
The max of top_k and min_tokens_to_keep performed on every call can just be done once up-front.
```
3bbc2451
Generate: TF contrastive search must pop `use_cache` from `model_kwargs` (#21149) · 7b5e943c
Joao Gante authored Jan 17, 2023

7b5e943c

16 Jan, 2023 1 commit

Add `min_new_tokens` argument in generate() (implementation based on... · fa906a26

Silver authored Jan 16, 2023

Add `min_new_tokens` argument in generate() (implementation based on `MinNewTokensLengthLogitsProcessor`) (#21044)

add a new parameter min_new_tokens for generate()

fa906a26

08 Jan, 2023 1 commit

Replace `past` with `past_key_values` (#20944) · f0577df6

Arthur authored Jan 08, 2023

* start cleanup

* more updates

* more models are affected

* more updates

* update generation utils

* style

* revert change that removed reorder cachce

* update generation utils

* style

* style

* remove reorder cache

f0577df6

05 Jan, 2023 3 commits
- Generate: FLAX uses `GenerationConfig` as the basis for `.generate()` parametrization (#21007) · bc53fc62
  Joao Gante authored Jan 05, 2023
  
  bc53fc62
- Generate: FLAX infers pad token in its absence and has functional example (#21009) · beb24f2a
  Joao Gante authored Jan 05, 2023
  
  beb24f2a
- Generate: post-generate config TF doctest fix (#21018) · 480799f7
  Joao Gante authored Jan 05, 2023
  
  480799f7
04 Jan, 2023 1 commit
- Generate: TF uses `GenerationConfig` as the basis for `.generate()` parametrization (#20994) · a6c850e4
  Joao Gante authored Jan 04, 2023
  
  a6c850e4
03 Jan, 2023 4 commits

Add custom stop token ids for generation (#20727) · 45da7cec

Motoki Wu authored Jan 03, 2023

* Add StopIdStoppingCriteria

* add a working test for stop id criteria

* add to global scope

* add stop_ids to generate

* add pipeline test

* use tokenizer encode in test

* add test to generation utils

* reformat

* fixup

* make-fix-copies

* rename to stop_token_id

* use stop_tokens instead

* add to text to text generation

* make fixup

* make repo-consistency

* Add support for list of ints for eos_token_id inside generation/utils.py

* Instead of having if elses, cast the eos_token_id into a List[int]

* Add List[int] support for logits_process.py

* add List[int] for beam_search.py

* add List[int] for forced_eos_token_id

* revert stop token id stopping criteria changes

* make fixup

* fix tests

* add eos_token_id to generation/utils.py and added tests test_utils.py

* add eos_token_id type hints and fix for pad tokens

* add comments

* remove some prints and remove forced false test

* fix

* put back test_stop_sequence_stopping_criteria

* remove unused import and make fixup

* add a none check

* update docstring

* add more docstring for list ints

* make fixup

45da7cec

Enable `decoder_attention_mask` in `generate` function (#20726) · 15c68c67

samuelpullely authored Jan 03, 2023

* Enable `decoder_attention_mask` in `generate` function

* Make style corrections

* Run `make repo-consistency`

* Add integration test

15c68c67

`MinNewTokensLengthLogitsProcessor` for `.generate` method #20814 (#20892) · 367fdf33

Konstantin Kotik authored Jan 03, 2023



* feat: add min new length logit processor

* test: add min new length logit processor

* docs: add MinNewTokensLengthLogitsProcessor

* feat: import MinNewTokensLengthLogitsProcessor

* fix: update pytorch dummy objects

* refactor & fix: rename attributes and var and get rid of dynamic attribute

* tests: align test with new interface

* docs: fix typo

* docs: minor clarification

* Empty-Commit

* empty commit

* run automated quality edits
Co-authored-by: Joao Gante <joao@huggingface.co>

367fdf33

Generate: delete unused TF `_reorder_cache` (#20964) · 4fd89e49
Joao Gante authored Jan 03, 2023

4fd89e49

02 Jan, 2023 1 commit

Generate: TF XLA beam sample (#20927) · 588faad1

Joao Gante authored Jan 02, 2023

* beam sample in beam search

* rag now works with the updated beam search

* delete legacy (non-XLA) generation code related to beam sample

588faad1

28 Dec, 2022 1 commit
- Generate: correctly detect default max length (#20911) · bbcd9618
  Joao Gante authored Dec 28, 2022
```
correctly detect default max length
```
  bbcd9618
21 Dec, 2022 1 commit
- Generate: post-generate config doctest fix (#20804) · 829e8894
  Joao Gante authored Dec 21, 2022
```
* fix doctests

* revert unwanted change
```
  829e8894