Commits · 65d7b21b77fe0d13e6a96fa37f5a6a6c4d0a550f · chenpangpang / transformers

09 May, 2023 1 commit

Sylvain Gugger authored May 09, 2023



* First draft of RWKV-4

* Add support for generate

* Style post-rebase

* Properly use state

* Write doc

* Fix doc

* More math

* Add model to README, dummies and clean config

* Fix init

* multiple fixes:

- fix common tests
- fix configuraion default values
- add CI test for checking state computation
- fix some CI tests

* correct tokenizer

* some tweaks

- fix config docstring
- fix failing tests

* fix CI tests

- add output_attention / output_hidden_states
- override test_initialization
- fix failing CIs

* fix conversion script

- fix sharded case
- add new arguments

* add slow tests + more fixes on conversion script

* add another test

* final fixes

* change single name variable

* add mock attention mask for pipeline to work

* correct eos token id

* fix nits

* add checkpoints

* Apply suggestions from code review
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* add `tie_word_embeddings` in docstring

* change tensor name

* fix final nits

* Trigger CI

---------
Co-authored-by: younesbelkada <younesbelkada@gmail.com>
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

b4d4d6fe

08 May, 2023 1 commit
- Generate: starcoder 🤜 🤛 assisted generation (#23182) · bbfb9fc2
  Joao Gante authored May 08, 2023
```
* starcoder has joined the chat

* indexing that works for all
```
  bbfb9fc2
04 May, 2023 1 commit
- Generate: text generation pipeline no longer emits `max_length` warning when it is not set (#23139) · b369e507
  Joao Gante authored May 04, 2023
  
  b369e507
29 Apr, 2023 1 commit
- Generate: prepare assisted generation for release (#23052) · 849367cc
  Joao Gante authored Apr 29, 2023
  
  849367cc
24 Apr, 2023 2 commits
- Generate: assisted generation with sample (take 2) (#22949) · e4a97f82
  Joao Gante authored Apr 24, 2023
```
* temperature controls speed
```
  e4a97f82
- Generate: Add exception path for Donut (#22955) · 2fbd6df8
  Joao Gante authored Apr 24, 2023
  
  2fbd6df8
20 Apr, 2023 2 commits

fix warning function call creating logger error (max_length and max_new_tokens) (#22889) · aa43a765
Quentin Ambard authored Apr 20, 2023

aa43a765

Generation: only search for eos_token if set (#22875) · d50db469

xloem authored Apr 20, 2023

Generation: only check for eos_token if set

The check for unfinished_sequences.max(), which is to find sequences
that have ended early via eos_token_id, creates a synchronization point
even when there is no eos_token, which slows inference down.

This change moves the calculation to inside the condition checking for
eos_token, so that such slowdown may be removed by disabling this token.
Co-authored-by: John Doe <john.doe@example.com>

d50db469

18 Apr, 2023 1 commit

Generate: Add assisted generation (#22211) · 78cda46f

Joao Gante authored Apr 18, 2023

* working mvp

* remove breakpoint

* fix commit

* standardize outputs

* tmp commit

* tests almost ready

* tmp commit

* skip a few models

* Add streaming; Docs and examples

* document limitations

* PR commits

* Amy PR comments

78cda46f

13 Apr, 2023 1 commit
- Generate: handle text conditioning with multimodal encoder-decoder models (#22748) · 9dfd6a4b
  Joao Gante authored Apr 13, 2023
  
  9dfd6a4b
30 Mar, 2023 1 commit
- Generate: basic token streaming (#22449) · 228792a9
  Joao Gante authored Mar 30, 2023
```
* haha tokens go brrrr
```
  228792a9
29 Mar, 2023 1 commit
- [`Generate`] Add conditional generation for multimodal models (#22424) · 8252e24a
  Younes Belkada authored Mar 29, 2023
```
* add conditional generation

* add comments
```
  8252e24a
27 Mar, 2023 1 commit

Changed world_size() to get_world_size() bugfix (#22381) · d5c2c71c

Charlie-Bell authored Mar 27, 2023

Edited one line in src/transormers/generation/utils.py. Changed dist.world_size() to dist.get_world_size() since world_size() doesn't exist in pytorch.dist.

d5c2c71c

22 Mar, 2023 1 commit

[deepspeed zero3] need `generate(synced_gpus=True, ...)` (#22242) · 73fdc8c5

Stas Bekman authored Mar 22, 2023



* [deepspeed zero3] need generate(synced_gpus=True, ...)

* fix

* rework per Sylvain's suggestion

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

---------
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

73fdc8c5

21 Mar, 2023 1 commit

Time to Say Goodbye, torch 1.7 and 1.8 (#22291) · 67c2dbdb

Yih-Dar authored Mar 21, 2023



* time to say goodbye, torch 1.7 and 1.8

* clean up torch_int_div

* clean up is_torch_less_than_1_8-9

* update

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

67c2dbdb

15 Mar, 2023 1 commit

Fix: unfinished_sequences with correct device (#22184) · 7b0e2cfd

浮躁的小螃蟹 authored Mar 16, 2023

Fix: unfinished_sequences with correct device 

The original code was causing errors when running torch.jit.trace due to the tensor options being incorrect. I fixed this by using torch.ones to create a tensor with the correct device and dtype. This should resolve the issue with running torch.jit.trace.

7b0e2cfd

14 Mar, 2023 1 commit
- Update 2 doctest expected values for torch 2.0.0 (#22148) · ff887035
  Yih-Dar authored Mar 14, 2023
```
update values
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  ff887035
10 Mar, 2023 1 commit
- Generate - Fix broken documentation links (#22078) · 7014fc36
  Joao Gante authored Mar 10, 2023
```
fix broken links
```
  7014fc36
23 Feb, 2023 1 commit
- Generate: Fix GIT batched captioning (#21738) · 1d4b7978
  Joao Gante authored Feb 23, 2023
  
  1d4b7978
14 Feb, 2023 2 commits
- Generate: input expansion for any model input (#21624) · a81fe4e1
  Joao Gante authored Feb 14, 2023
  
  a81fe4e1
- Generate: filter encoder inputs when its signature does not accept wildcards (#21603) · 13e03e61
  Joao Gante authored Feb 14, 2023
  
  13e03e61
13 Feb, 2023 2 commits
- Generate: correct default model input creation for decoder-only models (#21580) · fa4bdb0a
  Joao Gante authored Feb 13, 2023
  
  fa4bdb0a
- Generate: TF supports multiple eos tokens (#21571) · eb6c59bc
  Joao Gante authored Feb 13, 2023
  
  eb6c59bc
09 Feb, 2023 1 commit
- Fix missing unfinished_sequences (#21529) · 17109ecf
  Motoki Wu authored Feb 09, 2023
```
fix missing unfinished_sequences
```
  17109ecf
08 Feb, 2023 2 commits

Fix multiple `eos_token_id`s in model.generate(...) (#21461) · 9960506c

Motoki Wu authored Feb 08, 2023

* add tests with multiple eos_token_ids

* make math.prod instead of sum

* make fixup

* fix long and also use np.prod since math.prod does not exist <python 3.8

* make fixup

* add prod util

* use prod util instead of np.prod

* make fixup

* previous .long location

* use tensor ops

* remove prod

* remove prod

* update device

* make fixup

* fix none

9960506c

Generate: TF `compute_transition_scores` (#21341) · 1d9c26a4
Joao Gante authored Feb 08, 2023

1d9c26a4

07 Feb, 2023 1 commit

[CI ] Remove `past` in favor of `pat_key_values` (#21443) · 12eb528b

Arthur authored Feb 07, 2023

* fix past renamed to past_key_value

* update more `past`that were ski^êd

* fixup

* remove changes made to rag

* refactor `_reorder_cache` to use `past_key_values`

* fix git `prepare_inputs_for_generation` to pass tests when false is needed in use_cache

12eb528b

03 Feb, 2023 1 commit
- 🚨🚨 Generate: standardize beam search behavior across frameworks (#21368) · f21af262
  Joao Gante authored Feb 03, 2023
  
  f21af262
02 Feb, 2023 1 commit

Fixes bug in the creation of ExponentialDecayLengthPenalty (#21423) · 6a3d1a98

Jorge C. Gomes authored Feb 02, 2023

input_ids_seq_length doesn't exist in the GenerationConfig, it exists as local variable in the function.

Setting exponential_decay_length_penalty therefore results in an error:
`AttributeError: 'GenerationConfig' object has no attribute 'input_ids_seq_length'`

This simple change fixes this issue, and the exponential_decay_length_penalty works as expected.

6a3d1a98

01 Feb, 2023 1 commit
- Generate: decoder-only models can generate with `inputs_embeds` (#21405) · 92ce53aa
  Joao Gante authored Feb 01, 2023
  
  92ce53aa
30 Jan, 2023 1 commit
- Generate: Relaxed `max_length` and `max_new_tokens` coexistence (#21347) · 42b60f8b
  Joao Gante authored Jan 30, 2023
```
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
```
  42b60f8b
26 Jan, 2023 1 commit
- Generate: better `compute_transition_scores` examples (#21323) · 2b8feffa
  Joao Gante authored Jan 26, 2023
  
  2b8feffa
23 Jan, 2023 1 commit
- Generate: precision fix in compute_transition_scores doctests (#21251) · c8d719ff
  Joao Gante authored Jan 23, 2023
  
  c8d719ff
20 Jan, 2023 1 commit
- Generate: documented function to compute the transition scores (#21191) · af37d183
  Joao Gante authored Jan 20, 2023
```
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
```
  af37d183
19 Jan, 2023 1 commit

Add hallucination filter (#18675) · b9403e95

Karim Foda authored Jan 19, 2023



* Add hallucination penalty

* Make quality changes

* Inverse penalty

* Fix imports & quality

* Fix name spelling issue

* set encoder_repetition_penalty and fix quality

* Fix failing test

* Add to config_common_kwargs

* Fix modelling_rag error

* Update src/transformers/generation_logits_process.py
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* Remove breakpoint

* Make style fixes

* Update encoder_repetition_penalty default value

* Merge latest main changes

* Make fixup changes

* Add EncoderRepetitionPenaltyLogitsProcessor to generation/__init__.py

* Fix repo-inconsistency

* Remove venv

* Remove tensorflow-macos & add tests

* Add documentation

* Fix quality issues

* move encoder_repetition_penalty to config

* Update src/transformers/configuration_utils.py
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* Update src/transformers/generation/configuration_utils.py
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* Remove encoder_repetition_penalty from tests

* Fix type error

* Fix format error
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

b9403e95

17 Jan, 2023 2 commits

Add Epsilon- and Eta-Sampling (#21121) · 865da84a

Sherman Siu authored Jan 17, 2023

* Add epsilon- and eta-sampling.

Add epsilon- and eta-sampling, following the official code from https://github.com/john-hewitt/truncation-sampling and adapting to be more configurable, as required by Huggingface transformers.

* Add unit tests for epsilon- and eta-sampling.

* Black: fix code formatting.

* Fix docstring spacing.

* Clean up newlines.

* Fix implementation bugs and their associated tests.

* Remove epsilon- and eta-sampling parameters from PretrainedConfig.

* Clarify and clean up the documentation.

* Remove parameters for PretrainedConfig test.

865da84a

Refactoring of the text generate API docs (#21112) · 02488103

Maria Khalusova authored Jan 17, 2023

* initial commit, refactoring the text generation api reference

* removed repetitive code examples

* Refactoring the text generation docs to reduce repetition

* make style

02488103

16 Jan, 2023 1 commit

Add `min_new_tokens` argument in generate() (implementation based on... · fa906a26

Silver authored Jan 16, 2023

Add `min_new_tokens` argument in generate() (implementation based on `MinNewTokensLengthLogitsProcessor`) (#21044)

add a new parameter min_new_tokens for generate()

fa906a26

08 Jan, 2023 1 commit

Replace `past` with `past_key_values` (#20944) · f0577df6

Arthur authored Jan 08, 2023

* start cleanup

* more updates

* more models are affected

* more updates

* update generation utils

* style

* revert change that removed reorder cachce

* update generation utils

* style

* style

* remove reorder cache

f0577df6

03 Jan, 2023 1 commit

Add custom stop token ids for generation (#20727) · 45da7cec

Motoki Wu authored Jan 03, 2023

* Add StopIdStoppingCriteria

* add a working test for stop id criteria

* add to global scope

* add stop_ids to generate

* add pipeline test

* use tokenizer encode in test

* add test to generation utils

* reformat

* fixup

* make-fix-copies

* rename to stop_token_id

* use stop_tokens instead

* add to text to text generation

* make fixup

* make repo-consistency

* Add support for list of ints for eos_token_id inside generation/utils.py

* Instead of having if elses, cast the eos_token_id into a List[int]

* Add List[int] support for logits_process.py

* add List[int] for beam_search.py

* add List[int] for forced_eos_token_id

* revert stop token id stopping criteria changes

* make fixup

* fix tests

* add eos_token_id to generation/utils.py and added tests test_utils.py

* add eos_token_id type hints and fix for pad tokens

* add comments

* remove some prints and remove forced false test

* fix

* put back test_stop_sequence_stopping_criteria

* remove unused import and make fixup

* add a none check

* update docstring

* add more docstring for list ints

* make fixup

45da7cec