Commits · 92ce53aab859012f7714dae6d6fce7a7d701e75f · chenpangpang / transformers

01 Feb, 2023 1 commit
- Generate: decoder-only models can generate with `inputs_embeds` (#21405) · 92ce53aa
  Joao Gante authored Feb 01, 2023
  
  92ce53aa
31 Jan, 2023 1 commit
- Template for framework-agnostic tests (#21348) · 623346ab
  Joao Gante authored Jan 31, 2023
  
  623346ab
30 Jan, 2023 1 commit
- Generate: Relaxed `max_length` and `max_new_tokens` coexistence (#21347) · 42b60f8b
  Joao Gante authored Jan 30, 2023
```
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
```
  42b60f8b
20 Jan, 2023 1 commit
- Generate: documented function to compute the transition scores (#21191) · af37d183
  Joao Gante authored Jan 20, 2023
```
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
```
  af37d183
04 Jan, 2023 1 commit
- Generate: Fix CI related to #20727 (#21003) · b9104896
  Joao Gante authored Jan 04, 2023
  
  b9104896
03 Jan, 2023 1 commit

Add custom stop token ids for generation (#20727) · 45da7cec

Motoki Wu authored Jan 03, 2023

* Add StopIdStoppingCriteria

* add a working test for stop id criteria

* add to global scope

* add stop_ids to generate

* add pipeline test

* use tokenizer encode in test

* add test to generation utils

* reformat

* fixup

* make-fix-copies

* rename to stop_token_id

* use stop_tokens instead

* add to text to text generation

* make fixup

* make repo-consistency

* Add support for list of ints for eos_token_id inside generation/utils.py

* Instead of having if elses, cast the eos_token_id into a List[int]

* Add List[int] support for logits_process.py

* add List[int] for beam_search.py

* add List[int] for forced_eos_token_id

* revert stop token id stopping criteria changes

* make fixup

* fix tests

* add eos_token_id to generation/utils.py and added tests test_utils.py

* add eos_token_id type hints and fix for pad tokens

* add comments

* remove some prints and remove forced false test

* fix

* put back test_stop_sequence_stopping_criteria

* remove unused import and make fixup

* add a none check

* update docstring

* add more docstring for list ints

* make fixup

45da7cec

21 Nov, 2022 1 commit
- Generate: `model_kwargs` can also be an input to `prepare_inputs_for_generation` (#20353) · 4cf38148
  Joao Gante authored Nov 21, 2022
  
  4cf38148
14 Nov, 2022 1 commit
- Generate: add Bloom fixes for contrastive search (#20213) · 938cb047
  Joao Gante authored Nov 14, 2022
  
  938cb047
09 Nov, 2022 1 commit

Generate: move generation_*.py src files into generation/*.py (#20096) · f270b960

Joao Gante authored Nov 09, 2022

* move generation_*.py src files into generation/*.py

* populate generation.__init__ with lazy loading

* move imports and references from generation.xxx.object to generation.object

f270b960

01 Nov, 2022 1 commit

Generate: contrastive search with full optional outputs (#19963) · 831590f6

Joao Gante authored Nov 01, 2022

* Use beam search functionality; Add extra outputs and test

* Add full tests for contrastive search

* Add error message on unconventional cache format

831590f6

21 Oct, 2022 1 commit
- Generate: contrastive search test updates (#19787) · e0b825a8
  Joao Gante authored Oct 21, 2022
```
* contrastive search test updates

* make fixup
```
  e0b825a8
19 Oct, 2022 1 commit

Adding the state-of-the-art contrastive search decoding methods for the... · 71786b10

GMFTBY authored Oct 19, 2022

Adding the state-of-the-art contrastive search decoding methods for the codebase of generation_utils.py (#19477)

* add: the contrastive search for generaton_utils

* add: testing scripts for contrastive search under examples/text-generation

* update the quality of codes

* revise the docstring; make the generation_contrastive_search.py scripts;

* revise the examples/pytorch/text-generation/run_generation_contrastive_search.py to the auto-APIs format

* revise the necessary documents

* fix: revise the docstring of generation_contrastive_search.py

* Fix the code indentation

* fix: revise the nits and examples in contrastive_search docstring.

* fix the copyright

* delete generation_contrastive_search.py

* revise the logic in contrastive_search

* update the intergration test and the docstring

* run the tests over

* add the slow decorate to the contrastive_search intergrate test

* add more test

* do the style, quality, consistency checks

71786b10

30 Sep, 2022 1 commit
- Add stop sequence to text generation pipeline (#18444) · e3963581
  Karim Foda authored Sep 30, 2022
  
  e3963581
02 Sep, 2022 1 commit
- Generate: validate `model_kwargs` on TF (and catch typos in generate arguments) (#18651) · 9196f48b
  Joao Gante authored Sep 02, 2022
  
  9196f48b
19 Aug, 2022 1 commit
- Generate: add missing `**model_kwargs` in sample tests (#18696) · e95d433d
  Joao Gante authored Aug 19, 2022
  
  e95d433d
12 Aug, 2022 1 commit
- Generate: validate `model_kwargs` (and catch typos in generate arguments) (#18261) · ed1924e8
  Joao Gante authored Aug 12, 2022
```
* validate generate model_kwargs

* generate tests -- not all models have an attn mask
```
  ed1924e8
23 Jul, 2022 1 commit
- Generate: deprecate default `max_length` (#18018) · 7e44226f
  Joao Gante authored Jul 23, 2022
  
  7e44226f
28 Jun, 2022 1 commit
- Update expected values in constrained beam search tests (#17887) · 0b0dd977
  Yih-Dar authored Jun 28, 2022
```
* fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  0b0dd977
21 Jun, 2022 1 commit

Fix `top_k_top_p_filtering` having unexpected behavior (#17744) · 3b00b623

unifyh authored Jun 22, 2022

- Fix `top_k_top_p_filtering` not passing `filter_value` to
   `TopPLogitsWarper` causing any top-p filtered logits to be -inf
   instead of specified value

 - Add corresponding test

3b00b623

10 Jun, 2022 1 commit
- [Generation Test] Make fast test actually fast (#17661) · 13e875cc
  Patrick von Platen authored Jun 10, 2022
  
  13e875cc
19 May, 2022 1 commit
- [Generation] Fix Transition probs (#17311) · 518bd02c
  Patrick von Platen authored May 19, 2022
```
* [Draft] fix transition probs

* up

* up

* up

* make it work

* fix

* finish

* update
```
  518bd02c
12 May, 2022 1 commit

Black preview (#17217) · afe5d42d

Sylvain Gugger authored May 12, 2022

* Black preview

* Fixup too!

* Fix check copies

* Use the same version as the CI

* Bump black

afe5d42d

11 Apr, 2022 1 commit
- Generate: min length can't be larger than max length (#16668) · b0bf3011
  Joao Gante authored Apr 11, 2022
```
* min length must be smaller than max length

* Update min_length in tests
```
  b0bf3011
16 Mar, 2022 1 commit
- Fix generation min length (#16206) · 2410d0f8
  Patrick von Platen authored Mar 16, 2022
```
* up

* fix min lengths
```
  2410d0f8
07 Mar, 2022 1 commit

[Bug Fix] Beam search example in docs fails & a fix (integrating `max_length`... · ef9c3ca3

Chan Woo Kim authored Mar 07, 2022

[Bug Fix] Beam search example in docs fails & a fix (integrating `max_length` in `BeamScorer.finalize()`) (#15555)

* added the test and fix

* had left out a comment

ef9c3ca3

04 Mar, 2022 1 commit

Constrained Beam Search [*With* Disjunctive Decoding] (#15761) · 5c6f57ee

Chan Woo Kim authored Mar 05, 2022



* added classes to get started with constrained beam search

* in progress, think i can directly force tokens now but not yet with the round robin

* think now i have total control, now need to code the bank selection

* technically works as desired, need to optimize and fix design choices leading to undersirable outputs

* complete PR #1 without disjunctive decoding

* removed incorrect tests

* Delete k.txt

* Delete test.py

* Delete test.sh

* revert changes to test scripts

* genutils

* full implementation with testing, no disjunctive yet

* shifted docs

* passing all tests realistically ran locally

* removing accidentally included print statements

* fixed source of error in initial PR test

* fixing the get_device() vs device trap

* fixed documentation docstrings about constrained_beam_search

* fixed tests having failing for Speech2TextModel's floating point inputs

* fix cuda long tensor

* added examples and testing for them and founx & fixed a bug in beam_search and constrained_beam_search

* deleted accidentally added test halting code with assert False

* code reformat

* Update tests/test_generation_utils.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update tests/test_generation_utils.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update tests/test_generation_utils.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update tests/test_generation_utils.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update tests/test_generation_utils.py

* fixing based on comments on PR

* took out the testing code that should but work fails without the beam search moditification ; style changes

* fixing comments issues

* docstrings for ConstraintListState

* typo in PhrsalConstraint docstring

* docstrings improvements

* finished adding what is sort of an opinionated implementation of disjunctive generation, but it revealed errors in inner beam search logic during testing.

* fixed bug found in constrained beam search that used beam_idx that were not global across all the batches

* disjunctive constraint working 100% correctly

* passing all tests

* Accidentally included mlruns

* Update src/transformers/generation_beam_constraints.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update src/transformers/generation_beam_constraints.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* complete overhaul of type complexities and other nits

* strict type checks in generate()

* fixing second round of feedback by narsil

* fixed failing generation test because of type check overhaul

* generation test fail fix

* fixing test fails
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

5c6f57ee

23 Feb, 2022 1 commit

[Test refactor 1/5] Per-folder tests reorganization (#15725) · 29c10a41

Lysandre Debut authored Feb 23, 2022



* Per-folder tests reorganization
Co-authored-by: sgugger <sylvain.gugger@gmail.com>
Co-authored-by: Stas Bekman <stas@stason.org>

29c10a41

09 Feb, 2022 1 commit

Constrained Beam Search [without disjunctive decoding] (#15416) · 2b5603f6

Chan Woo Kim authored Feb 10, 2022



* added classes to get started with constrained beam search

* in progress, think i can directly force tokens now but not yet with the round robin

* think now i have total control, now need to code the bank selection

* technically works as desired, need to optimize and fix design choices leading to undersirable outputs

* complete PR #1 without disjunctive decoding

* removed incorrect tests

* Delete k.txt

* Delete test.py

* Delete test.sh

* revert changes to test scripts

* genutils

* full implementation with testing, no disjunctive yet

* shifted docs

* passing all tests realistically ran locally

* removing accidentally included print statements

* fixed source of error in initial PR test

* fixing the get_device() vs device trap

* fixed documentation docstrings about constrained_beam_search

* fixed tests having failing for Speech2TextModel's floating point inputs

* fix cuda long tensor

* added examples and testing for them and founx & fixed a bug in beam_search and constrained_beam_search

* deleted accidentally added test halting code with assert False

* code reformat

* Update tests/test_generation_utils.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update tests/test_generation_utils.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update tests/test_generation_utils.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update tests/test_generation_utils.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update tests/test_generation_utils.py

* fixing based on comments on PR

* took out the testing code that should but work fails without the beam search moditification ; style changes

* fixing comments issues

* docstrings for ConstraintListState

* typo in PhrsalConstraint docstring

* docstrings improvements
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

2b5603f6

24 Jan, 2022 1 commit

[Beam Search] Correct returned beam scores (#14654) · 8d6acc6c

Patrick von Platen authored Jan 24, 2022

* better

* save intermediate

* finish code

* up

* docs

* Apply suggestions from code review

* up

* add compute transition  beam scores function to model and make sure scores are correct with eos

* apply nicos comments

* Apply suggestions from code review

* another fix

8d6acc6c

30 Dec, 2021 1 commit
- [Generate] correct encoder_outputs are passed without attention_mask (#14980) · c043ce6c
  Patrick von Platen authored Dec 30, 2021
```
* [Generate] correct encoder_outputs are passed without attention_mask

* Apply suggestions from code review

* up
```
  c043ce6c
23 Dec, 2021 1 commit
- [Generate] Remove attention_mask and integrate model_main_input_name (#14856) · fe4197ab
  Patrick von Platen authored Dec 23, 2021
```
* up

* save

* correct

* up

* correct more

* up

* up

* up

* up

* up

* correct

* fix tf

* fix

* remove tokenizer
```
  fe4197ab
21 Dec, 2021 1 commit

Add custom `stopping_criteria` and `logits_processor` to `generate` (#14779) · 5722d058

Leandro von Werra authored Dec 21, 2021



* add custom `stopping_criteria` and `logits_processor` to `generate`

* add tests for custom `stopping_criteria` and `logits_processor`

* fix typo in RAG

* address reviewer comments

* improve custom logits processor/stopping criteria error message

* fix types in merge function signature

* change default for custom list from `None` to empty list

* fix rag generate

* add string split suggestion
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

5722d058

17 Dec, 2021 2 commits

[ImageGPT] Deprecate pixel_values input name to input_ids (#14801) · 84ea427f

Patrick von Platen authored Dec 17, 2021



* [ImageGPT] Deprecate pixel_values input name to input_ids

* up

* Apply suggestions from code review
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* correct

* finish
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

84ea427f

[Generate] Correct input_ids detection (#14815) · 95119ad7
Patrick von Platen authored Dec 17, 2021
```
* [Generate] Correct input_ids detection

* correct
```
95119ad7

16 Dec, 2021 1 commit

[Generate] Make generate multi-modal (#14784) · b18d8534

Patrick von Platen authored Dec 16, 2021



* finish refactor

* refactor

* add tests

* add more tests

* up

* finish tests

* finish

* up

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* improve docstring

* fix docs
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

b18d8534

19 Nov, 2021 1 commit
- [Generation] Allow `inputs_embeds` as an input (#14443) · f25a9332
  Patrick von Platen authored Nov 19, 2021
```
* up

* finalize

* finalize

* finish

* Update src/transformers/generation_utils.py

* apply feedback
```
  f25a9332
08 Oct, 2021 1 commit
- [Generation] Fix max_new_tokens (#13919) · c8b07612
  Patrick von Platen authored Oct 08, 2021
```
* up

* Update src/transformers/generation_stopping_criteria.py

* finish
```
  c8b07612
11 Jun, 2021 1 commit
- Fix head masking generate tests (#12110) · e47765d8
  Patrick von Platen authored Jun 11, 2021
```
* fix_torch_device_generate_test

* remove @

* fix tests
```
  e47765d8
10 Jun, 2021 1 commit

Fix a condition in test_generate_with_head_masking (#11911) · 0eaeae2e

Daniel Stancl authored Jun 10, 2021

* Fix a condition in test_generate_with_head_masking

* Fix usage of head_mask in bigbirg_pegasus

* Fix head masking for speech2text

* Resolve copy mismatch + drop unwanted print statement

* Fix the condition

0eaeae2e

27 May, 2021 1 commit

Adding new argument `max_new_tokens` for generate. (#11476) · 80d712fa

Nicolas Patry authored May 27, 2021

* Adding new argument `max_new_tokens` for generate.

This is a proposal to add a new argument `max_new_tokens` to `generate`.
This include a `MaxNewTokensCriteria` that enables callers that don't
know about the token length ahead (like pipelines callers) to manage
more easily the length of their generated output.

* Adding a test for the user warning when both`max_length` and
`max_new_tokens` are used together.

* Removed redundant `no_grad`.

80d712fa