Commits · dcc49d8a7ef91c5e1baeb4d510ec4f37bc259760 · chenpangpang / transformers

"examples/vscode:/vscode.git/clone" did not exist on "57a6626929f0e47b2de5fd58040d413ee34dc676"

11 Oct, 2023 1 commit

In assisted decoding, pass model_kwargs to model's forward call (fix... · dcc49d8a

Billy Bradley authored Oct 11, 2023

In assisted decoding, pass model_kwargs to model's forward call (fix prepare_input_for_generation in all models) (#25242)

* In assisted decoding, pass model_kwargs to model's forward call

Previously, assisted decoding would ignore any additional kwargs
that it doesn't explicitly handle. This was inconsistent with other
generation methods, which pass the model_kwargs through
prepare_inputs_for_generation and forward the returned dict to the
model's forward call.

The prepare_inputs_for_generation method needs to be amended in all
models, as previously it only kept the last input ID when a past_key_values
was passed.

* Improve variable names in _extend_attention_mask

* Refactor extending token_type_ids into a function

* Replace deepcopy with copy to optimize performance

* Update new persimmon model with llama changes for assisted generation

* Update new mistral model for assisted generation with prepare_inputs_for_generation

* Update position_ids creation in falcon prepare_inputs_for_generation to support assisted generation

dcc49d8a

14 Sep, 2023 1 commit

Fix beam search when using model parallel (#24969) · 8881f38a

Dong-Yong Lee authored Sep 15, 2023



* Fix GPTNeoX beam search when using parallelize

* Fix beam search idx device when using model parallel

* remove onnx related stuff
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* fix: move test_beam_search_on_multi_gpu to GenerationTesterMixin

* fix: add right item to _no_split_modules of MegaPreTrainedModel

* fix: add num_beams within parallelized beam_search test
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

8881f38a

12 Sep, 2023 2 commits

Fix ExponentialDecayLengthPenalty negative logits issue (#25594) · 6acc27ee

pokjay authored Sep 12, 2023



* Fix issues in test_exponential_decay_length_penalty

Fix tests which were broken and add validation of negative scores.

Current test didn't take into account that ExponentialDecayLengthPenalty updates the score inplace, resulting in updates to base tested Tensor.

In addition, the gt assert had empty Tensors due to indexing along the batch dimension.

Test is currently expected to fail to show ExponentialDecayLengthPenalty issues with negative scores

* Fix ExponentialDecayLengthPenalty negative logits issue

In cases where the scores are negative, ExponentialDecayLengthPenalty decreases the score of eos_token_id instead of increasing it.
To fix this issue we compute the penalty of the absolute value and add it to the original score.

* Add examples for ExponentialDecayLengthPenalty

* Fix styling issue in ExponentialDecayLengthPenalty doc

* Apply suggestions from code review
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Style and quality fix

* Fix example outputs

---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

6acc27ee

Generate: legacy mode is only triggered when `generation_config` is untouched (#25962) · 3319eb54
Joao Gante authored Sep 12, 2023

3319eb54

23 Aug, 2023 1 commit
- Generate: general test for decoder-only generation from `inputs_embeds` (#25687) · 3c2383b1
  Joao Gante authored Aug 23, 2023
```
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
```
  3c2383b1
16 Aug, 2023 1 commit
- Generate: fix default max length warning (#25539) · 3f9cb335
  Joao Gante authored Aug 16, 2023
  
  3f9cb335
10 Aug, 2023 1 commit
- Generation: strict generation config validation at save time (#25411) · 123ad536
  Joao Gante authored Aug 10, 2023
```
* strict gen config save; Add tests

* add note that the warning will be an exception in v4.34
```
  123ad536
09 Aug, 2023 1 commit

aligned sample_beam output selection with beam_search (#25375) · cb3c821c

hukuda222 authored Aug 10, 2023



* aligned sample_beam specs with beam_search

* pull origin main

* Revert "pull origin main"

This reverts commit 06d356f1137bb52272e120a03636598c44449cf3.

* update test_utils.py

* fix format

* remove comment

---------
Co-authored-by: Shogo Fujita <shogo.fujita@legalontech.jp>

cb3c821c

06 Aug, 2023 1 commit
- add CFG for .generate() (#24654) · d5334651
  Guillaume "Vermeille" Sanchez authored Aug 06, 2023
  
  d5334651
26 Jul, 2023 1 commit
- update `use_auth_token` -> `token` (#25083) · 224da5df
  Yih-Dar authored Jul 26, 2023
```
* update

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  224da5df
20 Jul, 2023 2 commits
- Contrastive Search peak memory reduction (#24120) · caf5e369
  Benjamin Badger authored Jul 20, 2023
```
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>
```
  caf5e369
- Generate: sequence bias can handle same terminations (#24822) · 89136ff7
  Joao Gante authored Jul 20, 2023
  
  89136ff7
27 Jun, 2023 2 commits

Fix TypeError: Object of type int64 is not JSON serializable (#24340) · 239ace15

Xiaoli Wang authored Jun 27, 2023

* Fix TypeError: Object of type int64 is not JSON serializable

* Convert numpy.float64 and numpy.int64 to float and int for json serialization

* Black reformatted examples/pytorch/token-classification/run_ner_no_trainer.py

* * make style

239ace15

Generate: `group_beam_search` requires `diversity_penalty>0.0` (#24456) · 5f3efdf7
Joao Gante authored Jun 27, 2023
```
* add exception

* update docs
```
5f3efdf7

23 Jun, 2023 1 commit

Replace python random with torch.rand to enable dynamo.export (#24434) · a28325e2

Bowen Bao authored Jun 23, 2023

* Replace python random with torch.rand to enable dynamo.export

* revert changes to flax model code

* Remove unused random import

* Fix torch template

* Move torch.manual_seed(0) to right location

a28325e2

21 Jun, 2023 1 commit
- Generate: add SequenceBiasLogitsProcessor (#24334) · 5f0801d1
  Joao Gante authored Jun 21, 2023
  
  5f0801d1
13 Jun, 2023 1 commit
- Generate: GenerationConfig can overwrite attributes at from_pretrained time (#24238) · b1ea6b4b
  Joao Gante authored Jun 13, 2023
```
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
```
  b1ea6b4b
07 Jun, 2023 1 commit
- Generate: increase left-padding test atol (#23448) · 612b2a1a
  Joao Gante authored Jun 07, 2023
```
increase atol
```
  612b2a1a
24 May, 2023 1 commit

Better TF docstring types (#23477) · f8b25744

Matt authored May 24, 2023

* Rework TF type hints to use | None instead of Optional[] for tf.Tensor

* Rework TF type hints to use | None instead of Optional[] for tf.Tensor

* Don't forget the imports

* Add the imports to tests too

* make fixup

* Refactor tests that depended on get_type_hints

* Better test refactor

* Fix an old hidden bug in the test_keras_fit input creation code

* Fix for the Deit tests

f8b25744

18 May, 2023 2 commits
- Less flaky `test_assisted_decoding_matches_greedy_search` (#23451) · 2406dbdc
  Yih-Dar authored May 18, 2023
```
* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  2406dbdc
- Generate: skip left-padding tests on old models (#23437) · aea7b23b
  Joao Gante authored May 18, 2023
  
  aea7b23b
16 May, 2023 1 commit
- Generate: add test to check KV format (#23403) · 918a06e2
  Joao Gante authored May 16, 2023
```
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
```
  918a06e2
08 May, 2023 1 commit
- Generate: starcoder 🤜 🤛 assisted generation (#23182) · bbfb9fc2
  Joao Gante authored May 08, 2023
```
* starcoder has joined the chat

* indexing that works for all
```
  bbfb9fc2
03 May, 2023 2 commits
- Add support for beam search's num_return_sequencs flag in flax (#23082) · c4e32e20
  Mayank Agarwal authored May 03, 2023
```
* add code for numReturnSeq

* add flax support for num return sequences

* Make Fix up for changes

* add test for num return sequences

* lint
```
  c4e32e20
- Generate: slow assisted generation test (#23125) · ce31e3c8
  Joao Gante authored May 03, 2023
  
  ce31e3c8
29 Apr, 2023 1 commit
- Generate: prepare assisted generation for release (#23052) · 849367cc
  Joao Gante authored Apr 29, 2023
  
  849367cc
24 Apr, 2023 1 commit
- Generate: assisted generation with sample (take 2) (#22949) · e4a97f82
  Joao Gante authored Apr 24, 2023
```
* temperature controls speed
```
  e4a97f82
18 Apr, 2023 2 commits

Generate: Add assisted generation (#22211) · 78cda46f

Joao Gante authored Apr 18, 2023

* working mvp

* remove breakpoint

* fix commit

* standardize outputs

* tmp commit

* tests almost ready

* tmp commit

* skip a few models

* Add streaming; Docs and examples

* document limitations

* PR commits

* Amy PR comments

78cda46f

Fix `test_eos_token_id_int_and_list_top_k_top_sampling` (#22826) · 90247d3e
Yih-Dar authored Apr 18, 2023
```
* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
90247d3e

13 Apr, 2023 1 commit
- Generate: handle text conditioning with multimodal encoder-decoder models (#22748) · 9dfd6a4b
  Joao Gante authored Apr 13, 2023
  
  9dfd6a4b
05 Apr, 2023 1 commit
- Generate: `TextIteratorStreamer` timeout (#22576) · 861ff890
  Joao Gante authored Apr 05, 2023
  
  861ff890
04 Apr, 2023 1 commit
- Generate: Add text streamer decoding options (#22544) · 1905384f
  Joao Gante authored Apr 04, 2023
  
  1905384f
03 Apr, 2023 1 commit
- Generate: `TextIteratorStreamer` (streamer for gradio) (#22501) · a55a822a
  Joao Gante authored Apr 03, 2023
```
* haha text go brrr (but in gradio)
```
  a55a822a
30 Mar, 2023 1 commit
- Generate: basic token streaming (#22449) · 228792a9
  Joao Gante authored Mar 30, 2023
```
* haha tokens go brrrr
```
  228792a9
23 Mar, 2023 1 commit
- Generate: add test for left-padding support (#22322) · 502fec77
  Joao Gante authored Mar 23, 2023
  
  502fec77
22 Mar, 2023 2 commits
- Beef up Llama tests (#22314) · fd3eb3e3
  Joao Gante authored Mar 22, 2023
```
* tmp commit

* beef up llama tests
```
  fd3eb3e3
- Generate: Export TF generate with a TF tokenizer (#22310) · 12febc20
  Joao Gante authored Mar 22, 2023
```
* Export TF generate with a TF tokenizer

* remove unused lines
```
  12febc20
21 Mar, 2023 1 commit

Time to Say Goodbye, torch 1.7 and 1.8 (#22291) · 67c2dbdb

Yih-Dar authored Mar 21, 2023



* time to say goodbye, torch 1.7 and 1.8

* clean up torch_int_div

* clean up is_torch_less_than_1_8-9

* update

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

67c2dbdb

16 Mar, 2023 1 commit

(#22204) · 5110e574

Yih-Dar authored Mar 16, 2023



* py38 + torch 2

* increment cache versions

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

5110e574

09 Mar, 2023 1 commit

Remove set_access_token usage + fail tests if FutureWarning (#22051) · 923110b7

Lucain authored Mar 09, 2023



* Remove set_access_token usage + fail tests if FutureWarning

* do not fail on FutureWarning in CI

---------
Co-authored-by: testbot <lucainp@hf.co>

923110b7