1. 04 Mar, 2024 5 commits
    • NielsRogge's avatar
      Add UDOP (#22940) · 836921fd
      NielsRogge authored
      
      
      * First draft
      
      * More improvements
      
      * More improvements
      
      * More fixes
      
      * Fix copies
      
      * More improvements
      
      * More fixes
      
      * More improvements
      
      * Convert checkpoint
      
      * More improvements, set up tests
      
      * Fix more tests
      
      * Add UdopModel
      
      * More improvements
      
      * Fix equivalence test
      
      * More fixes
      
      * Redesign model
      
      * Extend conversion script
      
      * Use real inputs for conversion script
      
      * Add image processor
      
      * Improve conversion script
      
      * Add UdopTokenizer
      
      * Add fast tokenizer
      
      * Add converter
      
      * Update README's
      
      * Add processor
      
      * Add fully fledged tokenizer
      
      * Add fast tokenizer
      
      * Use processor in conversion script
      
      * Add tokenizer tests
      
      * Fix one more test
      
      * Fix more tests
      
      * Fix tokenizer tests
      
      * Enable fast tokenizer tests
      
      * Fix more tests
      
      * Fix additional_special_tokens of fast tokenizer
      
      * Fix tokenizer tests
      
      * Fix more tests
      
      * Fix equivalence test
      
      * Rename image to pixel_values
      
      * Rename seg_data to bbox
      
      * More renamings
      
      * Remove vis_special_token
      
      * More improvements
      
      * Add docs
      
      * Fix copied from
      
      * Update slow tokenizer
      
      * Update fast tokenizer design
      
      * Make text input optional
      
      * Add first draft of processor tests
      
      * Fix more processor tests
      
      * Fix decoder_start_token_id
      
      * Fix test_initialization
      
      * Add integration test
      
      * More improvements
      
      * Improve processor, add test
      
      * Add more copied from
      
      * Add more copied from
      
      * Add more copied from
      
      * Add more copied from
      
      * Remove print statement
      
      * Update README and auto mapping
      
      * Delete files
      
      * Delete another file
      
      * Remove code
      
      * Fix test
      
      * Fix docs
      
      * Remove asserts
      
      * Add doc tests
      
      * Include UDOP in exotic model tests
      
      * Add expected tesseract decodings
      
      * Add sentencepiece
      
      * Use same design as T5
      
      * Add UdopEncoderModel
      
      * Add UdopEncoderModel to tests
      
      * More fixes
      
      * Fix fast tokenizer
      
      * Fix one more test
      
      * Remove parallelisable attribute
      
      * Fix copies
      
      * Remove legacy file
      
      * Copy from T5Tokenizer
      
      * Fix rebase
      
      * More fixes, copy from T5
      
      * More fixes
      
      * Fix init
      
      * Use ArthurZ/udop for tests
      
      * Make all model tests pass
      
      * Remove UdopForConditionalGeneration from auto mapping
      
      * Fix more tests
      
      * fixups
      
      * more fixups
      
      * fix the tokenizers
      
      * remove un-necessary changes
      
      * nits
      
      * nits
      
      * replace truncate_sequences_boxes with truncate_sequences for fix-copies
      
      * nit current path
      
      * add a test for input ids
      
      * ids that we should get taken from c9f7a32f57440d90ff79890270d376a1cc0acb68
      
      * nits converting
      
      * nits
      
      * apply ruff
      
      * nits
      
      * nits
      
      * style
      
      * fix slow order of addition
      
      * fix udop fast range as well
      
      * fixup
      
      * nits
      
      * Add docstrings
      
      * Fix gradient checkpointing
      
      * Update code examples
      
      * Skip tests
      
      * Update integration test
      
      * Address comment
      
      * Make fixup
      
      * Remove extra ids from tokenizer
      
      * Skip test
      
      * Apply suggestions from code review
      Co-authored-by: default avatarArthur <48595927+ArthurZucker@users.noreply.github.com>
      
      * Update year
      
      * Address comment
      
      * Address more comments
      
      * Address comments
      
      * Add copied from
      
      * Update CI
      
      * Rename script
      
      * Update model id
      
      * Add AddedToken, skip tests
      
      * Update CI
      
      * Fix doc tests
      
      * Do not use Tesseract for the doc tests
      
      * Remove kwargs
      
      * Add original inputs
      
      * Update casting
      
      * Fix doc test
      
      * Update question
      
      * Update question
      
      * Use LayoutLMv3ImageProcessor
      
      * Update organization
      
      * Improve docs
      
      * Update forward signature
      
      * Make images optional
      
      * Remove deprecated device argument
      
      * Add comment, add add_prefix_space
      
      * More improvements
      
      * Remove kwargs
      
      ---------
      Co-authored-by: default avatarArthurZucker <arthur.zucker@gmail.com>
      Co-authored-by: default avatarArthur <48595927+ArthurZucker@users.noreply.github.com>
      836921fd
    • Donggeun Yu's avatar
      DeformableDETR support bfloat16 (#29232) · ed74d978
      Donggeun Yu authored
      
      
      * Update ms_deform_attn_cuda.cu
      
      * Update ms_deform_attn_cuda.cuh
      
      * Update modeling_deformable_detr.py
      
      * Update src/transformers/models/deformable_detr/modeling_deformable_detr.py
      Co-authored-by: default avataramyeroberts <22614925+amyeroberts@users.noreply.github.com>
      
      * Update modeling_deformable_detr.py
      
      * python utils/check_copies.py --fix_and_overwrite
      
      * Fix dtype missmatch error
      
      * Update test_modeling_deformable_detr.py
      
      * Update test_modeling_deformable_detr.py
      
      * Update modeling_deformable_detr.py
      
      * Update modeling_deformable_detr.py
      
      * Support DeformableDETR with bfloat16
      
      * Add test code
      
      * Use AT_DISPATCH_FLOATING_TYPES_AND2
      
      Use AT_DISPATCH_FLOATING_TYPES_AND2
      
      * Update tests/models/deformable_detr/test_modeling_deformable_detr.py
      Co-authored-by: default avataramyeroberts <22614925+amyeroberts@users.noreply.github.com>
      
      * Update tests/models/deformable_detr/test_modeling_deformable_detr.py
      Co-authored-by: default avataramyeroberts <22614925+amyeroberts@users.noreply.github.com>
      
      * Fix not found require_torch_bf16 function
      
      ---------
      Co-authored-by: default avataramyeroberts <22614925+amyeroberts@users.noreply.github.com>
      ed74d978
    • Zach Mueller's avatar
      🚨 Fully revert atomic checkpointing 🚨 (#29370) · 1681a6d4
      Zach Mueller authored
      Fully revert atomic checkpointing
      1681a6d4
    • Nick DeGroot's avatar
      Fix OneFormer `post_process_instance_segmentation` for panoptic tasks (#29304) · 8ef98628
      Nick DeGroot authored
      * 🐛 Fix oneformer instance post processing when using panoptic task type
      
      * 
      
       Add unit test for oneformer instance post processing panoptic bug
      
      ---------
      Co-authored-by: default avatarNick DeGroot <1966472+nickthegroot@users.noreply.github.com>
      8ef98628
    • Fanli Lin's avatar
      [tests] enable automatic speech recognition pipeline tests on XPU (#29308) · aade711d
      Fanli Lin authored
      * use require_torch_gpu
      
      * enable on XPU
      aade711d
  2. 01 Mar, 2024 5 commits
  3. 29 Feb, 2024 4 commits
  4. 28 Feb, 2024 3 commits
  5. 27 Feb, 2024 3 commits
  6. 26 Feb, 2024 3 commits
  7. 23 Feb, 2024 1 commit
    • Matt's avatar
      Allow remote code repo names to contain "." (#29175) · 371b572e
      Matt authored
      * stash commit
      
      * stash commit
      
      * It works!
      
      * Remove unnecessary change
      
      * We don't actually need the cache_dir!
      
      * Update docstring
      
      * Add test
      
      * Add test with custom cache dir too
      
      * Update model repo path
      371b572e
  8. 22 Feb, 2024 1 commit
  9. 21 Feb, 2024 3 commits
  10. 20 Feb, 2024 10 commits
  11. 19 Feb, 2024 2 commits
    • Titus's avatar
      Bnb test fix for different hardwares (#29066) · 5ce90f32
      Titus authored
      
      
      * generated text on A10G
      
      * generated text in CI
      
      * Apply suggestions from code review
      
      add explanatory comments
      Co-authored-by: default avatarYounes Belkada <49240599+younesbelkada@users.noreply.github.com>
      
      ---------
      Co-authored-by: default avatarYounes Belkada <49240599+younesbelkada@users.noreply.github.com>
      5ce90f32
    • Max Baak's avatar
      ENH: added new output_logits option to generate function (#28667) · 08cd694e
      Max Baak authored
      output_logits option behaves like output_scores, but returns the raw, unprocessed prediction logit scores,
      ie. the values before they undergo logit processing and/or warping. The latter happens by default for the
      regular output scores.
      
      It's useful to have the unprocessed logit scores in certain circumstances. For example, unprocessed logit scores
      are very useful with causallm models when one wants to determine the probability of a certain answer, e.g.
      when asking a question with a yes/no answer. In that case getting the next-token probabilities of both "yes" and
      "no" (and/or their relative ratio) is of interest for classification. The reason for getting these _before_ logit
      processing and/or warping is b/c a) that can change the probabilities or b) reject the tokens of interest / reduce
      the number of tokens to just 1.
      
      For an example use-case see paper TabLLM: Few-shot Classification of Tabular Data with Large Language Models
      by Stefan Hegselmann, Alejandro Buendia, Hunter Lang, Monica Agrawal, Xiaoyi Jiang, and David Sontag.
      https://arxiv.org/abs/2210.10723
      
      
      
      In addition:
      - added dedicated unit test: tests/generation/test_utils/test_return_unprocessed_logit_scores
        which tests return of logics with output_logits=True in generation.
      - set output_logits=True in all other generation unit tests, that also have output_scores=True.
      
      Implemented @gante's and @amyeroberts review feedback
      Co-authored-by: default avatarkx79wq <max.baak@ing.com>
      08cd694e