1. 27 Oct, 2023 11 commits
  2. 26 Oct, 2023 13 commits
  3. 25 Oct, 2023 8 commits
  4. 24 Oct, 2023 8 commits
    • Tom Aarsen's avatar
      Fix RoPE config validation for FalconConfig + various config typos (#26929) · 6cbc1369
      Tom Aarsen authored
      * Resolve incorrect ValueError in RoPE config for Falcon
      
      * Add broken codeblock tag in Falcon Config
      
      * Fix typo: an float -> a float
      
      * Implement copy functionality for Fuyu and Persimmon
      
      for RoPE scaling validation
      
      * Make style
      6cbc1369
    • JB (Don)'s avatar
      Add a default decoder_attention_mask for EncoderDecoderModel during training (#26752) · a0fd3448
      JB (Don) authored
      * Add a default decoder_attention_mask for EncoderDecoderModel during training
      
      Since we are already creating the default decoder_input_ids from the labels, we should also
      create a default decoder_attention_mask to go with it.
      
      * Fix test constant that relied on manual_seed()
      
      The test was changed to use a decoder_attention_mask that ignores padding instead (which is
      the default one created by BERT when attention_mask is None).
      
      * Create the decoder_attention_mask using decoder_input_ids instead of labels
      
      * Fix formatting in test
      a0fd3448
    • Maria Khalusova's avatar
      [docs] Performance docs refactor p.2 (#26791) · 9333bf07
      Maria Khalusova authored
      
      
      * initial edits
      
      * improvements for clarity and flow
      
      * improvements for clarity and flow, removed the repetead section
      
      * removed two docs that had no content
      
      * Revert "removed two docs that had no content"
      
      This reverts commit e98fa2fa0d8e171163f15cb8a04bdada1053543b.
      
      * Apply suggestions from code review
      Co-authored-by: default avatarSteven Liu <59462357+stevhliu@users.noreply.github.com>
      
      * feedback addressed
      
      * more feedback addressed
      
      * feedback addressed
      
      ---------
      Co-authored-by: default avatarSteven Liu <59462357+stevhliu@users.noreply.github.com>
      9333bf07
    • Patrick von Platen's avatar
      Fix config silent copy in from_pretrained (#27043) · 13ef14e1
      Patrick von Platen authored
      * Fix config modeling utils
      
      * fix more
      
      * fix attn mask bug
      
      * Update src/transformers/modeling_utils.py
      13ef14e1
    • Alex McKinney's avatar
      Device agnostic testing (#25870) · 9da45171
      Alex McKinney authored
      
      
      * adds agnostic decorators and availability fns
      
      * renaming decorators and fixing imports
      
      * updating some representative example tests
      bloom, opt, and reformer for now
      
      * wip device agnostic functions
      
      * lru cache to device checking functions
      
      * adds `TRANSFORMERS_TEST_DEVICE_SPEC`
      if present, imports the target file and updates device to function
      mappings
      
      * comments `TRANSFORMERS_TEST_DEVICE_SPEC` code
      
      * extra checks on device name
      
      * `make style; make quality`
      
      * updates default functions for agnostic calls
      
      * applies suggestions from review
      
      * adds `is_torch_available` guard
      
      * Add spec file to docs, rename function dispatch names to backend_*
      
      * add backend import to docs example for spec file
      
      * change instances of  to
      
      * Move register backend to before device check as per @statelesshz changes
      
      * make style
      
      * make opt test require fp16 to run
      
      ---------
      Co-authored-by: default avatararsalanu <arsalanu@graphcore.ai>
      Co-authored-by: default avatararsalanu <hzji210@gmail.com>
      9da45171
    • Marc Sun's avatar
      Add fuyu device map (#26949) · 41496b95
      Marc Sun authored
      * add _no_split_modules
      
      * style
      
      * fix _no_split_modules
      
      * add doc
      41496b95
    • Leandro von Werra's avatar
      add info on TRL docs (#27024) · b18e3140
      Leandro von Werra authored
      * add info on TRL docs
      
      * add TRL link
      
      * tweak text
      
      * tweak text
      b18e3140
    • amyeroberts's avatar
      Safe import of rgb_to_id from FE modules (#27037) · cb0c6806
      amyeroberts authored
      Safe import from FE modules
      cb0c6806