1. 10 Jun, 2024 3 commits
    • Yih-Dar's avatar
      Fix Cohere CI (#31263) · 8fff07de
      Yih-Dar authored
      
      
      * [run-slow] cohere
      
      * [run-slow] cohere
      
      * [run-slow] cohere
      
      ---------
      Co-authored-by: default avatarydshieh <ydshieh@users.noreply.github.com>
      8fff07de
    • Pavel Iakubovskii's avatar
      Decorators for deprecation and named arguments validation (#30799) · 517df566
      Pavel Iakubovskii authored
      
      
      * Fix do_reduce_labels for maskformer image processor
      
      * Deprecate reduce_labels in favor to do_reduce_labels
      
      * Deprecate reduce_labels in favor to do_reduce_labels (segformer)
      
      * Deprecate reduce_labels in favor to do_reduce_labels (oneformer)
      
      * Deprecate reduce_labels in favor to do_reduce_labels (maskformer)
      
      * Deprecate reduce_labels in favor to do_reduce_labels (mask2former)
      
      * Fix typo
      
      * Update mask2former test
      
      * fixup
      
      * Update segmentation examples
      
      * Update docs
      
      * Fixup
      
      * Imports fixup
      
      * Add deprecation decorator draft
      
      * Add deprecation decorator
      
      * Fixup
      
      * Add deprecate_kwarg decorator
      
      * Validate kwargs decorator
      
      * Kwargs validation (beit)
      
      * fixup
      
      * Kwargs validation (mask2former)
      
      * Kwargs validation (maskformer)
      
      * Kwargs validation (oneformer)
      
      * Kwargs validation (segformer)
      
      * Better message
      
      * Fix oneformer processor save-load test
      
      * Update src/transformers/utils/deprecation.py
      Co-authored-by: default avataramyeroberts <22614925+amyeroberts@users.noreply.github.com>
      
      * Update src/transformers/utils/deprecation.py
      Co-authored-by: default avataramyeroberts <22614925+amyeroberts@users.noreply.github.com>
      
      * Update src/transformers/utils/deprecation.py
      Co-authored-by: default avatarPablo Montalvo <39954772+molbap@users.noreply.github.com>
      
      * Update src/transformers/utils/deprecation.py
      Co-authored-by: default avatarPablo Montalvo <39954772+molbap@users.noreply.github.com>
      
      * Better handle classmethod warning
      
      * Fix typo, remove warn
      
      * Add header
      
      * Docs and `additional_message`
      
      * Move to filter decorator ot generic
      
      * Proper deprecation for semantic segm scripts
      
      * Add to __init__ and update import
      
      * Basic tests for filter decorator
      
      * Fix doc
      
      * Override `to_dict()` to pop depracated `_max_size`
      
      * Pop unused parameters
      
      * Fix trailing whitespace
      
      * Add test for deprecation
      
      * Add deprecation warning control parameter
      
      * Update generic test
      
      * Fixup deprecation tests
      
      * Introduce init service kwargs
      
      * Revert popping unused params
      
      * Revert oneformer test
      
      * Allow "metadata" to pass
      
      * Better docs
      
      * Fix test
      
      * Add notion in docstring
      
      * Fix notification for both names
      
      * Add func name to warning message
      
      * Fixup
      
      ---------
      Co-authored-by: default avataramyeroberts <22614925+amyeroberts@users.noreply.github.com>
      Co-authored-by: default avatarPablo Montalvo <39954772+molbap@users.noreply.github.com>
      517df566
    • Pablo Montalvo's avatar
      Fix paligemma inverted mask (#31207) · 6b11f89c
      Pablo Montalvo authored
      * pass inverted causal mask
      
      * add sanity check for paligemma finetuning
      
      * [run-slow]paligemma
      6b11f89c
  2. 07 Jun, 2024 6 commits
    • amyeroberts's avatar
      Rename test_model_common_attributes -> test_model_get_set_embeddings (#31321) · 25245ec2
      amyeroberts authored
      * Rename to test_model_common_attributes
      The method name is misleading - it is testing being able to get and set embeddings, not common attributes to all models
      
      * Explicitly skip
      25245ec2
    • BHUVAN M's avatar
      interpolation added for TVP. (#30863) · 3b9174f2
      BHUVAN M authored
      * Update TVP model to interpolate pre-trained image pad prompter encodings
      
      * feat: Add 2D positional embeddings interpolation in TvpVisualInputEmbedding
      
      * added required comments
      
      * Update TVP model to interpolate pre-trained image pad prompter encodings
      
      * feat: Add 2D positional embeddings interpolation in TvpVisualInputEmbedding
      
      * added required comments
      
      * docstring and argument fix
      
      * doc fixes and test case fix suggested in review.
      
      * varibale typo fix
      
      * styling and name fixes for padding interpolation flag.
      3b9174f2
    • Matt's avatar
      Remove ConversationalPipeline and Conversation object (#31165) · 065729a6
      Matt authored
      * Remove ConversationalPipeline and Conversation object, as they have been deprecated for some time and are due for removal
      
      * Update not-doctested.txt
      
      * Fix JA and ZH docs
      
      * Fix JA and ZH docs some more
      
      * Fix JA and ZH docs some more
      065729a6
    • 조준래's avatar
      Implement JSON dump conversion for torch_dtype in TrainingArguments (#31224) · 60861fe1
      조준래 authored
      
      
      * Implement JSON dump conversion for torch_dtype in TrainingArguments
      
      * Add unit test for converting torch_dtype in TrainingArguments to JSON
      
      * move unit test for converting torch_dtype into TrainerIntegrationTest class
      
      * reformating using ruff
      
      * convert dict_torch_dtype_to_str to private method _dict_torch_dtype_to_str
      
      ---------
      Co-authored-by: default avatarjun.4 <jun.4@kakaobrain.com>
      60861fe1
    • Benjamin Badger's avatar
      Extend save_pretrained to offloaded models (#27412) · ff689f57
      Benjamin Badger authored
      
      
      * added hidden subset
      
      * debugged hidden subset contrastive search
      
      * added contrastive search compression
      
      * debugged compressed contrastive search
      
      * memory reduction for contrastive search
      
      * debugged mem red
      
      * added low memory option feature
      
      * debugged mem optmimization output stack
      
      * debugged mem optmimization output stack
      
      * debugged low mem
      
      * added low mem cache
      
      * fixed 2047 tensor view
      
      * debugged 2042 past key val inputs
      
      * reformatted tensors
      
      * changed low mem output
      
      * final clean
      
      * removed subset hidden csearch
      
      * fixed hidden device
      
      * fixed hidden device
      
      * changed compressor dtype
      
      * removed hstate compression
      
      * integrated csearch in generate
      
      * test csearch integration into generation
      
      exit()
      
      * fixed csearch kwarg integration with generation
      
      * final wrap and added doc
      
      * Update src/transformers/generation/utils.py
      Co-authored-by: default avatarJoao Gante <joaofranciscocardosogante@gmail.com>
      
      * Update src/transformers/generation/utils.py
      Co-authored-by: default avatarJoao Gante <joaofranciscocardosogante@gmail.com>
      
      * Update src/transformers/generation/utils.py
      Co-authored-by: default avatarJoao Gante <joaofranciscocardosogante@gmail.com>
      
      * added debug print
      
      * direct hstate cat
      
      * direct hstate cat
      
      * direct hstate cat debug
      
      * direct hstate cat debug
      
      * expanded full hidden state stack
      
      * expanded full hidden state stack
      
      * matched dims for hstates
      
      * matched dims for hstates
      
      * logits fix
      
      * equality test
      
      * equality hidden debug
      
      * debug
      
      * added prints for debug
      
      * added prints for debug
      
      * equality check
      
      * switched squeeze dim
      
      * input format debug
      
      * tracing top_k_ids
      
      * removed trace
      
      * added test context
      
      * added jitter
      
      * added jitter
      
      * added jitter
      
      * returned state
      
      * rebuilt past key value reconstruction
      
      * debugged
      
      * cleaned traces
      
      * added selection for pkv
      
      * changed output to dict
      
      * cleaned
      
      * cleaned
      
      * cleaned up contrastive search test
      
      * moved low_memory kwarg
      
      * debugged
      
      * changed low mem test batch size to 1
      
      * removed output
      
      * debugged test input shape
      
      * reformatted csearch test
      
      * added trace
      
      * removed unsqueeze on final forward pass
      
      * replaced unsqueeze with view
      
      * removed traces
      
      * cleaned
      
      * debugged model kwargs
      
      * removed special models from test
      
      * ran make quality
      
      * Update src/transformers/generation/configuration_utils.py
      Co-authored-by: default avatarJoao Gante <joaofranciscocardosogante@gmail.com>
      
      * Update src/transformers/generation/configuration_utils.py
      Co-authored-by: default avatarJoao Gante <joaofranciscocardosogante@gmail.com>
      
      * refactored
      
      * refactored
      
      * refactored
      
      * make fixup
      
      * renamed flag sequential
      
      * renamed flag sequential
      
      * iterative onloading
      
      * black style and test utils
      
      * added traces for integrated test
      
      * debugged
      
      * added traces
      
      * make style
      
      * removed traces, make style
      
      * included suggestions and added test
      
      * debugged test
      
      * added offload module check and make style
      
      * is_accelerate_available and make style
      
      * added test decorator
      
      * changed test model and config spec
      
      * added offload condition
      
      * added lazy loading for each shard
      
      * debugged
      
      * modified sharding
      
      * debugged
      
      * added traces
      
      * removed safe serialization
      
      * no index overload;
      
      * trace on safe save ptrs
      
      * added ptr condition
      
      * debugged
      
      * debugged ptr
      
      * moved module map init
      
      * remake shard only for offloaded modules
      
      * refactored
      
      * debugged
      
      * refactored
      
      * debugged
      
      * cleaned and make style
      
      * cleaned and make style
      
      * added trace
      
      * sparse module map
      
      * debugged
      
      * removed module map conditional
      
      * refactored
      
      * debug
      
      * debugged
      
      * added traces
      
      * added shard mem trace
      
      * added shard mem trace
      
      * removed underlying storage check
      
      * refactored
      
      * memory leak removal and make style
      
      * cleaned
      
      * swapped test decs and make style
      
      * added mem checks and make style
      
      * added free mem warning
      
      * implemented some suggestions
      
      * moved onloading to accelerate
      
      * refactored for accelerate integration
      
      * cleaned test
      
      * make style
      
      * debugged offload map name
      
      * cleaned and make style
      
      * replaced meta device check for sharding
      
      * cleaned and make style
      
      * implemented some suggestions
      
      * more suggestions
      
      * update warning
      Co-authored-by: default avatarMarc Sun <57196510+SunMarc@users.noreply.github.com>
      
      * more suggestions
      
      * make style
      
      * new make style
      
      * Update src/transformers/modeling_utils.py
      Co-authored-by: default avatarMarc Sun <57196510+SunMarc@users.noreply.github.com>
      
      * Update src/transformers/modeling_utils.py
      Co-authored-by: default avatarMarc Sun <57196510+SunMarc@users.noreply.github.com>
      
      * Update src/transformers/modeling_utils.py
      Co-authored-by: default avatarMarc Sun <57196510+SunMarc@users.noreply.github.com>
      
      * Update src/transformers/modeling_utils.py
      Co-authored-by: default avataramyeroberts <22614925+amyeroberts@users.noreply.github.com>
      
      ---------
      Co-authored-by: default avatarJoao Gante <joaofranciscocardosogante@gmail.com>
      Co-authored-by: default avatarMarc Sun <57196510+SunMarc@users.noreply.github.com>
      Co-authored-by: default avataramyeroberts <22614925+amyeroberts@users.noreply.github.com>
      ff689f57
    • Cyril Vallez's avatar
      Fix jetmoe model (#31279) · 8bcf9c8d
      Cyril Vallez authored
      * Fix jetmoe model
      
      * Remove skip-tests
      8bcf9c8d
  3. 06 Jun, 2024 7 commits
  4. 05 Jun, 2024 2 commits
  5. 04 Jun, 2024 7 commits
  6. 03 Jun, 2024 6 commits
  7. 31 May, 2024 2 commits
  8. 30 May, 2024 2 commits
  9. 29 May, 2024 2 commits
  10. 28 May, 2024 3 commits