Commits · 938cb04789afe44169fba3866bfc1d4a3eacd8ee · chenpangpang / transformers

"vscode:/vscode.git/clone" did not exist on "ada0add89be0da299be3a59e52071be4ea6669e8"

14 Nov, 2022 1 commit
- Generate: add Bloom fixes for contrastive search (#20213) · 938cb047
  Joao Gante authored Nov 14, 2022
  
  938cb047
09 Nov, 2022 1 commit

Generate: move generation_*.py src files into generation/*.py (#20096) · f270b960

Joao Gante authored Nov 09, 2022

* move generation_*.py src files into generation/*.py

* populate generation.__init__ with lazy loading

* move imports and references from generation.xxx.object to generation.object

f270b960

01 Nov, 2022 1 commit

Generate: contrastive search with full optional outputs (#19963) · 831590f6

Joao Gante authored Nov 01, 2022

* Use beam search functionality; Add extra outputs and test

* Add full tests for contrastive search

* Add error message on unconventional cache format

831590f6

21 Oct, 2022 1 commit
- Generate: contrastive search test updates (#19787) · e0b825a8
  Joao Gante authored Oct 21, 2022
```
* contrastive search test updates

* make fixup
```
  e0b825a8
19 Oct, 2022 1 commit

Adding the state-of-the-art contrastive search decoding methods for the... · 71786b10

GMFTBY authored Oct 19, 2022

Adding the state-of-the-art contrastive search decoding methods for the codebase of generation_utils.py (#19477)

* add: the contrastive search for generaton_utils

* add: testing scripts for contrastive search under examples/text-generation

* update the quality of codes

* revise the docstring; make the generation_contrastive_search.py scripts;

* revise the examples/pytorch/text-generation/run_generation_contrastive_search.py to the auto-APIs format

* revise the necessary documents

* fix: revise the docstring of generation_contrastive_search.py

* Fix the code indentation

* fix: revise the nits and examples in contrastive_search docstring.

* fix the copyright

* delete generation_contrastive_search.py

* revise the logic in contrastive_search

* update the intergration test and the docstring

* run the tests over

* add the slow decorate to the contrastive_search intergrate test

* add more test

* do the style, quality, consistency checks

71786b10

30 Sep, 2022 1 commit
- Add stop sequence to text generation pipeline (#18444) · e3963581
  Karim Foda authored Sep 30, 2022
  
  e3963581
02 Sep, 2022 1 commit
- Generate: validate `model_kwargs` on TF (and catch typos in generate arguments) (#18651) · 9196f48b
  Joao Gante authored Sep 02, 2022
  
  9196f48b
19 Aug, 2022 1 commit
- Generate: add missing `**model_kwargs` in sample tests (#18696) · e95d433d
  Joao Gante authored Aug 19, 2022
  
  e95d433d
12 Aug, 2022 1 commit
- Generate: validate `model_kwargs` (and catch typos in generate arguments) (#18261) · ed1924e8
  Joao Gante authored Aug 12, 2022
```
* validate generate model_kwargs

* generate tests -- not all models have an attn mask
```
  ed1924e8
23 Jul, 2022 1 commit
- Generate: deprecate default `max_length` (#18018) · 7e44226f
  Joao Gante authored Jul 23, 2022
  
  7e44226f
28 Jun, 2022 1 commit
- Update expected values in constrained beam search tests (#17887) · 0b0dd977
  Yih-Dar authored Jun 28, 2022
```
* fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  0b0dd977
21 Jun, 2022 1 commit

Fix `top_k_top_p_filtering` having unexpected behavior (#17744) · 3b00b623

unifyh authored Jun 22, 2022

- Fix `top_k_top_p_filtering` not passing `filter_value` to
   `TopPLogitsWarper` causing any top-p filtered logits to be -inf
   instead of specified value

 - Add corresponding test

3b00b623

10 Jun, 2022 1 commit
- [Generation Test] Make fast test actually fast (#17661) · 13e875cc
  Patrick von Platen authored Jun 10, 2022
  
  13e875cc
19 May, 2022 1 commit
- [Generation] Fix Transition probs (#17311) · 518bd02c
  Patrick von Platen authored May 19, 2022
```
* [Draft] fix transition probs

* up

* up

* up

* make it work

* fix

* finish

* update
```
  518bd02c
12 May, 2022 1 commit

Black preview (#17217) · afe5d42d

Sylvain Gugger authored May 12, 2022

* Black preview

* Fixup too!

* Fix check copies

* Use the same version as the CI

* Bump black

afe5d42d

11 Apr, 2022 1 commit
- Generate: min length can't be larger than max length (#16668) · b0bf3011
  Joao Gante authored Apr 11, 2022
```
* min length must be smaller than max length

* Update min_length in tests
```
  b0bf3011
16 Mar, 2022 1 commit
- Fix generation min length (#16206) · 2410d0f8
  Patrick von Platen authored Mar 16, 2022
```
* up

* fix min lengths
```
  2410d0f8
07 Mar, 2022 1 commit

[Bug Fix] Beam search example in docs fails & a fix (integrating `max_length`... · ef9c3ca3

Chan Woo Kim authored Mar 07, 2022

[Bug Fix] Beam search example in docs fails & a fix (integrating `max_length` in `BeamScorer.finalize()`) (#15555)

* added the test and fix

* had left out a comment

ef9c3ca3

04 Mar, 2022 1 commit

Constrained Beam Search [*With* Disjunctive Decoding] (#15761) · 5c6f57ee

Chan Woo Kim authored Mar 05, 2022



* added classes to get started with constrained beam search

* in progress, think i can directly force tokens now but not yet with the round robin

* think now i have total control, now need to code the bank selection

* technically works as desired, need to optimize and fix design choices leading to undersirable outputs

* complete PR #1 without disjunctive decoding

* removed incorrect tests

* Delete k.txt

* Delete test.py

* Delete test.sh

* revert changes to test scripts

* genutils

* full implementation with testing, no disjunctive yet

* shifted docs

* passing all tests realistically ran locally

* removing accidentally included print statements

* fixed source of error in initial PR test

* fixing the get_device() vs device trap

* fixed documentation docstrings about constrained_beam_search

* fixed tests having failing for Speech2TextModel's floating point inputs

* fix cuda long tensor

* added examples and testing for them and founx & fixed a bug in beam_search and constrained_beam_search

* deleted accidentally added test halting code with assert False

* code reformat

* Update tests/test_generation_utils.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update tests/test_generation_utils.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update tests/test_generation_utils.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update tests/test_generation_utils.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update tests/test_generation_utils.py

* fixing based on comments on PR

* took out the testing code that should but work fails without the beam search moditification ; style changes

* fixing comments issues

* docstrings for ConstraintListState

* typo in PhrsalConstraint docstring

* docstrings improvements

* finished adding what is sort of an opinionated implementation of disjunctive generation, but it revealed errors in inner beam search logic during testing.

* fixed bug found in constrained beam search that used beam_idx that were not global across all the batches

* disjunctive constraint working 100% correctly

* passing all tests

* Accidentally included mlruns

* Update src/transformers/generation_beam_constraints.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update src/transformers/generation_beam_constraints.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* complete overhaul of type complexities and other nits

* strict type checks in generate()

* fixing second round of feedback by narsil

* fixed failing generation test because of type check overhaul

* generation test fail fix

* fixing test fails
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

5c6f57ee

23 Feb, 2022 1 commit

[Test refactor 1/5] Per-folder tests reorganization (#15725) · 29c10a41

Lysandre Debut authored Feb 23, 2022



* Per-folder tests reorganization
Co-authored-by: sgugger <sylvain.gugger@gmail.com>
Co-authored-by: Stas Bekman <stas@stason.org>

29c10a41

09 Feb, 2022 1 commit

Constrained Beam Search [without disjunctive decoding] (#15416) · 2b5603f6

Chan Woo Kim authored Feb 10, 2022



* added classes to get started with constrained beam search

* in progress, think i can directly force tokens now but not yet with the round robin

* think now i have total control, now need to code the bank selection

* technically works as desired, need to optimize and fix design choices leading to undersirable outputs

* complete PR #1 without disjunctive decoding

* removed incorrect tests

* Delete k.txt

* Delete test.py

* Delete test.sh

* revert changes to test scripts

* genutils

* full implementation with testing, no disjunctive yet

* shifted docs

* passing all tests realistically ran locally

* removing accidentally included print statements

* fixed source of error in initial PR test

* fixing the get_device() vs device trap

* fixed documentation docstrings about constrained_beam_search

* fixed tests having failing for Speech2TextModel's floating point inputs

* fix cuda long tensor

* added examples and testing for them and founx & fixed a bug in beam_search and constrained_beam_search

* deleted accidentally added test halting code with assert False

* code reformat

* Update tests/test_generation_utils.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update tests/test_generation_utils.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update tests/test_generation_utils.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update tests/test_generation_utils.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update tests/test_generation_utils.py

* fixing based on comments on PR

* took out the testing code that should but work fails without the beam search moditification ; style changes

* fixing comments issues

* docstrings for ConstraintListState

* typo in PhrsalConstraint docstring

* docstrings improvements
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

2b5603f6

24 Jan, 2022 1 commit

[Beam Search] Correct returned beam scores (#14654) · 8d6acc6c

Patrick von Platen authored Jan 24, 2022

* better

* save intermediate

* finish code

* up

* docs

* Apply suggestions from code review

* up

* add compute transition  beam scores function to model and make sure scores are correct with eos

* apply nicos comments

* Apply suggestions from code review

* another fix

8d6acc6c

30 Dec, 2021 1 commit
- [Generate] correct encoder_outputs are passed without attention_mask (#14980) · c043ce6c
  Patrick von Platen authored Dec 30, 2021
```
* [Generate] correct encoder_outputs are passed without attention_mask

* Apply suggestions from code review

* up
```
  c043ce6c
23 Dec, 2021 1 commit
- [Generate] Remove attention_mask and integrate model_main_input_name (#14856) · fe4197ab
  Patrick von Platen authored Dec 23, 2021
```
* up

* save

* correct

* up

* correct more

* up

* up

* up

* up

* up

* correct

* fix tf

* fix

* remove tokenizer
```
  fe4197ab
21 Dec, 2021 1 commit

Add custom `stopping_criteria` and `logits_processor` to `generate` (#14779) · 5722d058

Leandro von Werra authored Dec 21, 2021



* add custom `stopping_criteria` and `logits_processor` to `generate`

* add tests for custom `stopping_criteria` and `logits_processor`

* fix typo in RAG

* address reviewer comments

* improve custom logits processor/stopping criteria error message

* fix types in merge function signature

* change default for custom list from `None` to empty list

* fix rag generate

* add string split suggestion
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

5722d058

17 Dec, 2021 2 commits

[ImageGPT] Deprecate pixel_values input name to input_ids (#14801) · 84ea427f

Patrick von Platen authored Dec 17, 2021



* [ImageGPT] Deprecate pixel_values input name to input_ids

* up

* Apply suggestions from code review
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* correct

* finish
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

84ea427f

[Generate] Correct input_ids detection (#14815) · 95119ad7
Patrick von Platen authored Dec 17, 2021
```
* [Generate] Correct input_ids detection

* correct
```
95119ad7

16 Dec, 2021 1 commit

[Generate] Make generate multi-modal (#14784) · b18d8534

Patrick von Platen authored Dec 16, 2021



* finish refactor

* refactor

* add tests

* add more tests

* up

* finish tests

* finish

* up

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* improve docstring

* fix docs
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

b18d8534

19 Nov, 2021 1 commit
- [Generation] Allow `inputs_embeds` as an input (#14443) · f25a9332
  Patrick von Platen authored Nov 19, 2021
```
* up

* finalize

* finalize

* finish

* Update src/transformers/generation_utils.py

* apply feedback
```
  f25a9332
08 Oct, 2021 1 commit
- [Generation] Fix max_new_tokens (#13919) · c8b07612
  Patrick von Platen authored Oct 08, 2021
```
* up

* Update src/transformers/generation_stopping_criteria.py

* finish
```
  c8b07612
11 Jun, 2021 1 commit
- Fix head masking generate tests (#12110) · e47765d8
  Patrick von Platen authored Jun 11, 2021
```
* fix_torch_device_generate_test

* remove @

* fix tests
```
  e47765d8
10 Jun, 2021 1 commit

Fix a condition in test_generate_with_head_masking (#11911) · 0eaeae2e

Daniel Stancl authored Jun 10, 2021

* Fix a condition in test_generate_with_head_masking

* Fix usage of head_mask in bigbirg_pegasus

* Fix head masking for speech2text

* Resolve copy mismatch + drop unwanted print statement

* Fix the condition

0eaeae2e

27 May, 2021 1 commit

Adding new argument `max_new_tokens` for generate. (#11476) · 80d712fa

Nicolas Patry authored May 27, 2021

* Adding new argument `max_new_tokens` for generate.

This is a proposal to add a new argument `max_new_tokens` to `generate`.
This include a `MaxNewTokensCriteria` that enables callers that don't
know about the token length ahead (like pipelines callers) to manage
more easily the length of their generated output.

* Adding a test for the user warning when both`max_length` and
`max_new_tokens` are used together.

* Removed redundant `no_grad`.

80d712fa

19 May, 2021 1 commit
- [T5 failing CI] Fix generate test (#11770) · 43891be1
  Patrick von Platen authored May 19, 2021
```
* fix_torch_device_generate_test

* remove @
```
  43891be1
18 May, 2021 1 commit

Fix usage of head masks by PT encoder-decoder models' `generate()` function (#11621) · 680d181c

Daniel Stancl authored May 19, 2021

* Add missing head masking for generate() function

* Add head_mask, decoder_head_mask and cross_attn_head_mask
into prepare_inputs_for_generation for generate() function
for multiple encoder-decoder models.

* Add test_genereate_with_head_masking

* [WIP] Update the new test and handle special cases

* make style

* Omit ProphetNet test so far

* make fix-copies

680d181c

07 May, 2021 1 commit

Add BigBirdPegasus (#10991) · dc3f6758

Vasudev Gupta authored May 07, 2021



* init bigbird pegasus

* add debugging nb ; update config

* init conversion

* update conversion script

* complete conversion script

* init forward()

* complete forward()

* add tokenizer

* add some slow tests

* commit current

* fix copies

* add docs

* add conversion script for bigbird-roberta-summarization

* remove TODO

* small fixups

* correct tokenizer

* add bigbird core for now

* fix config

* fix more

* revert pegasus-tokenizer back

* make style

* everything working for pubmed; yayygit status

* complete tests finally

* remove bigbird pegasus tok

* correct tokenizer

* correct tests

* add tokenizer files

* finish make style

* fix test

* update

* make style

* fix tok utils base file

* make fix-copies

* clean a bit

* small update

* fix some suggestions

* add to readme

* fix a bit, clean tests

* fix more tests

* Update src/transformers/__init__.py

* Update src/transformers/__init__.py

* make fix-copies

* complete attn switching, auto-padding left

* make style

* fix auto-padding test

* make style

* fix batched attention tests

* put tolerance at 1e-1 for stand-alone decoder test

* fix docs

* fix tests

* correct slow tokenizer conversion

* Apply suggestions from code review
Co-authored-by: Suraj Patil <surajp815@gmail.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* complete remaining suggestions

* fix test
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Suraj Patil <surajp815@gmail.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

dc3f6758

26 Apr, 2021 1 commit

Remove max length beam scorer (#11378) · 741d48f5

Ashwin Geet D'Sa authored Apr 27, 2021



* removed max_len

* removed max_length from BeamSearchScorer

* correct max length

* finish

* del vim

* finish & add test
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

741d48f5

21 Apr, 2021 1 commit

Removed `max_length` from being mandatory within `generate`. (#11314) · aad95c7c

Nicolas Patry authored Apr 21, 2021

* Removed `max_length` from being mandatory within `generate`.

- Moving on to fully using `StoppingCriteria` for `greedy` and `sample`
modes.
- `max_length` still used for `beam_search` and `group_beam_search`
(Follow up PR)
- Fixes a bug with MaxLengthStoppingCriteria (we should stop as soon a
we hit the max_length, the comparison needs to be or equal, that affects
the tests).
- Added options to use `logits_processor` and `stopping_criteria`
directly within `generate` function (so some users can define their own
`logits_processor` and `stopping_criteria`).
- Modified the backward compat tests to make sure we issue a warning.

* Fix `max_length` argument in `generate`.

* Moving validate to being functional.

- Renamed `smax_length` to `stoppping_max_length`.

* Removing `logits_processor` and `stopping_criteria` from `generate`
arguments.

* Deepcopy.

* Fix global variable name.

aad95c7c

22 Mar, 2021 1 commit
- [Generate] Add save mode logits processor to remove nans and infs if necessary (#10769) · 77bf3fe7
  Patrick von Platen authored Mar 23, 2021
```
* push

* finish

* finish

* make fix copies

* change name
```
  77bf3fe7
12 Mar, 2021 1 commit

Adding new parameter to `generate`: `max_time`. (#9846) · 543d0549

Nicolas Patry authored Mar 12, 2021

* [WIP] Adding new parameter to `generate`:  `max_time`.

Generation by tokens number is sometimes a bit clunky because we don't
know how many tokens are good enough or even how many tokens are in
the payload (for pipelines users for instance). This leads to hard
to understand behavior.

This PR proposes a new argument `max_time` which is a float of seconds
for the allowed time for `generate` to run on.
Ideally combinations of `max_tokens=None`, `max_time=2` could be used to
generate as many tokens as possible within time budget.

NB: Another possible approach consists of passing a callback to `generate`
  putting the caller in charge of the actual decision of when to stop
  generating tokens. It opens the door to 'which args should we pass'
  to this callback. It's hard to imagine other use-cases for this
  early stopping behavior than time (that are not already covered by
  parameters of generate)

* Revamp with StoppingCriteria

* Removing deprecated mentions.

* Forgot arguments to stopping criteria.

* Readding max_length it's not just used as a stopping criteria.

* Default value for `stopping_criteria`.

* Address @patrickvonplaten comments.

- More docstrings
- Actual doc
- Include in global namespace
- Remove TF work.

* Put back `max_length` (deprecation different PR).

* Doc quality.

* Fixing old behavior without `stopping_criteria` but with `max_length`.

Making sure we don't break that in the future.

* Adding more tests for possible inconsistencies between

`max_length` and `stopping_criteria`.

* Fixing the torch imports.

543d0549