1. 09 May, 2023 1 commit
    • Sylvain Gugger's avatar
      Add RWKV-4 (#22797) · b4d4d6fe
      Sylvain Gugger authored
      
      
      * First draft of RWKV-4
      
      * Add support for generate
      
      * Style post-rebase
      
      * Properly use state
      
      * Write doc
      
      * Fix doc
      
      * More math
      
      * Add model to README, dummies and clean config
      
      * Fix init
      
      * multiple fixes:
      
      - fix common tests
      - fix configuraion default values
      - add CI test for checking state computation
      - fix some CI tests
      
      * correct tokenizer
      
      * some tweaks
      
      - fix config docstring
      - fix failing tests
      
      * fix CI tests
      
      - add output_attention / output_hidden_states
      - override test_initialization
      - fix failing CIs
      
      * fix conversion script
      
      - fix sharded case
      - add new arguments
      
      * add slow tests + more fixes on conversion script
      
      * add another test
      
      * final fixes
      
      * change single name variable
      
      * add mock attention mask for pipeline to work
      
      * correct eos token id
      
      * fix nits
      
      * add checkpoints
      
      * Apply suggestions from code review
      Co-authored-by: default avataramyeroberts <22614925+amyeroberts@users.noreply.github.com>
      
      * add `tie_word_embeddings` in docstring
      
      * change tensor name
      
      * fix final nits
      
      * Trigger CI
      
      ---------
      Co-authored-by: default avataryounesbelkada <younesbelkada@gmail.com>
      Co-authored-by: default avatarYounes Belkada <49240599+younesbelkada@users.noreply.github.com>
      Co-authored-by: default avataramyeroberts <22614925+amyeroberts@users.noreply.github.com>
      b4d4d6fe
  2. 08 May, 2023 1 commit
  3. 04 May, 2023 1 commit
  4. 29 Apr, 2023 1 commit
  5. 24 Apr, 2023 2 commits
  6. 20 Apr, 2023 2 commits
  7. 18 Apr, 2023 1 commit
    • Joao Gante's avatar
      Generate: Add assisted generation (#22211) · 78cda46f
      Joao Gante authored
      * working mvp
      
      * remove breakpoint
      
      * fix commit
      
      * standardize outputs
      
      * tmp commit
      
      * tests almost ready
      
      * tmp commit
      
      * skip a few models
      
      * Add streaming; Docs and examples
      
      * document limitations
      
      * PR commits
      
      * Amy PR comments
      78cda46f
  8. 13 Apr, 2023 1 commit
  9. 30 Mar, 2023 1 commit
  10. 29 Mar, 2023 1 commit
  11. 27 Mar, 2023 1 commit
  12. 22 Mar, 2023 1 commit
  13. 21 Mar, 2023 1 commit
  14. 15 Mar, 2023 1 commit
    • 娴簛鐨勫皬铻冭煿's avatar
      Fix: unfinished_sequences with correct device (#22184) · 7b0e2cfd
      娴簛鐨勫皬铻冭煿 authored
      Fix: unfinished_sequences with correct device 
      
      The original code was causing errors when running torch.jit.trace due to the tensor options being incorrect. I fixed this by using torch.ones to create a tensor with the correct device and dtype. This should resolve the issue with running torch.jit.trace.
      7b0e2cfd
  15. 14 Mar, 2023 1 commit
  16. 10 Mar, 2023 1 commit
  17. 23 Feb, 2023 1 commit
  18. 14 Feb, 2023 2 commits
  19. 13 Feb, 2023 2 commits
  20. 09 Feb, 2023 1 commit
  21. 08 Feb, 2023 2 commits
  22. 07 Feb, 2023 1 commit
    • Arthur's avatar
      [CI ] Remove `past` in favor of `pat_key_values` (#21443) · 12eb528b
      Arthur authored
      * fix past renamed to past_key_value
      
      * update more `past`that were ski^锚d
      
      * fixup
      
      * remove changes made to rag
      
      * refactor `_reorder_cache` to use `past_key_values`
      
      * fix git `prepare_inputs_for_generation` to pass tests when false is needed in use_cache
      12eb528b
  23. 03 Feb, 2023 1 commit
  24. 02 Feb, 2023 1 commit
    • Jorge C. Gomes's avatar
      Fixes bug in the creation of ExponentialDecayLengthPenalty (#21423) · 6a3d1a98
      Jorge C. Gomes authored
      input_ids_seq_length doesn't exist in the GenerationConfig, it exists as local variable in the function.
      
      Setting exponential_decay_length_penalty therefore results in an error:
      `AttributeError: 'GenerationConfig' object has no attribute 'input_ids_seq_length'`
      
      This simple change fixes this issue, and the exponential_decay_length_penalty works as expected.
      6a3d1a98
  25. 01 Feb, 2023 1 commit
  26. 30 Jan, 2023 1 commit
  27. 26 Jan, 2023 1 commit
  28. 23 Jan, 2023 1 commit
  29. 20 Jan, 2023 1 commit
  30. 19 Jan, 2023 1 commit
    • Karim Foda's avatar
      Add hallucination filter (#18675) · b9403e95
      Karim Foda authored
      
      
      * Add hallucination penalty
      
      * Make quality changes
      
      * Inverse penalty
      
      * Fix imports & quality
      
      * Fix name spelling issue
      
      * set encoder_repetition_penalty and fix quality
      
      * Fix failing test
      
      * Add to config_common_kwargs
      
      * Fix modelling_rag error
      
      * Update src/transformers/generation_logits_process.py
      Co-authored-by: default avatarJoao Gante <joaofranciscocardosogante@gmail.com>
      
      * Remove breakpoint
      
      * Make style fixes
      
      * Update encoder_repetition_penalty default value
      
      * Merge latest main changes
      
      * Make fixup changes
      
      * Add EncoderRepetitionPenaltyLogitsProcessor to generation/__init__.py
      
      * Fix repo-inconsistency
      
      * Remove venv
      
      * Remove tensorflow-macos & add tests
      
      * Add documentation
      
      * Fix quality issues
      
      * move encoder_repetition_penalty to config
      
      * Update src/transformers/configuration_utils.py
      Co-authored-by: default avatarJoao Gante <joaofranciscocardosogante@gmail.com>
      
      * Update src/transformers/generation/configuration_utils.py
      Co-authored-by: default avatarJoao Gante <joaofranciscocardosogante@gmail.com>
      
      * Remove encoder_repetition_penalty from tests
      
      * Fix type error
      
      * Fix format error
      Co-authored-by: default avatarJoao Gante <joaofranciscocardosogante@gmail.com>
      b9403e95
  31. 17 Jan, 2023 2 commits
    • Sherman Siu's avatar
      Add Epsilon- and Eta-Sampling (#21121) · 865da84a
      Sherman Siu authored
      * Add epsilon- and eta-sampling.
      
      Add epsilon- and eta-sampling, following the official code from https://github.com/john-hewitt/truncation-sampling and adapting to be more configurable, as required by Huggingface transformers.
      
      * Add unit tests for epsilon- and eta-sampling.
      
      * Black: fix code formatting.
      
      * Fix docstring spacing.
      
      * Clean up newlines.
      
      * Fix implementation bugs and their associated tests.
      
      * Remove epsilon- and eta-sampling parameters from PretrainedConfig.
      
      * Clarify and clean up the documentation.
      
      * Remove parameters for PretrainedConfig test.
      865da84a
    • Maria Khalusova's avatar
      Refactoring of the text generate API docs (#21112) · 02488103
      Maria Khalusova authored
      * initial commit, refactoring the text generation api reference
      
      * removed repetitive code examples
      
      * Refactoring the text generation docs to reduce repetition
      
      * make style
      02488103
  32. 16 Jan, 2023 1 commit
  33. 08 Jan, 2023 1 commit
    • Arthur's avatar
      Replace `past` with `past_key_values` (#20944) · f0577df6
      Arthur authored
      * start cleanup
      
      * more updates
      
      * more models are affected
      
      * more updates
      
      * update generation utils
      
      * style
      
      * revert change that removed reorder cachce
      
      * update generation utils
      
      * style
      
      * style
      
      * remove reorder cache
      f0577df6
  34. 03 Jan, 2023 1 commit
    • Motoki Wu's avatar
      Add custom stop token ids for generation (#20727) · 45da7cec
      Motoki Wu authored
      * Add StopIdStoppingCriteria
      
      * add a working test for stop id criteria
      
      * add to global scope
      
      * add stop_ids to generate
      
      * add pipeline test
      
      * use tokenizer encode in test
      
      * add test to generation utils
      
      * reformat
      
      * fixup
      
      * make-fix-copies
      
      * rename to stop_token_id
      
      * use stop_tokens instead
      
      * add to text to text generation
      
      * make fixup
      
      * make repo-consistency
      
      * Add support for list of ints for eos_token_id inside generation/utils.py
      
      * Instead of having if elses, cast the eos_token_id into a List[int]
      
      * Add List[int] support for logits_process.py
      
      * add List[int] for beam_search.py
      
      * add List[int] for forced_eos_token_id
      
      * revert stop token id stopping criteria changes
      
      * make fixup
      
      * fix tests
      
      * add eos_token_id to generation/utils.py and added tests test_utils.py
      
      * add eos_token_id type hints and fix for pad tokens
      
      * add comments
      
      * remove some prints and remove forced false test
      
      * fix
      
      * put back test_stop_sequence_stopping_criteria
      
      * remove unused import and make fixup
      
      * add a none check
      
      * update docstring
      
      * add more docstring for list ints
      
      * make fixup
      45da7cec