1. 07 Jun, 2024 6 commits
    • amyeroberts's avatar
      Rename test_model_common_attributes -> test_model_get_set_embeddings (#31321) · 25245ec2
      amyeroberts authored
      * Rename to test_model_common_attributes
      The method name is misleading - it is testing being able to get and set embeddings, not common attributes to all models
      
      * Explicitly skip
      25245ec2
    • BHUVAN M's avatar
      interpolation added for TVP. (#30863) · 3b9174f2
      BHUVAN M authored
      * Update TVP model to interpolate pre-trained image pad prompter encodings
      
      * feat: Add 2D positional embeddings interpolation in TvpVisualInputEmbedding
      
      * added required comments
      
      * Update TVP model to interpolate pre-trained image pad prompter encodings
      
      * feat: Add 2D positional embeddings interpolation in TvpVisualInputEmbedding
      
      * added required comments
      
      * docstring and argument fix
      
      * doc fixes and test case fix suggested in review.
      
      * varibale typo fix
      
      * styling and name fixes for padding interpolation flag.
      3b9174f2
    • Matt's avatar
      Remove ConversationalPipeline and Conversation object (#31165) · 065729a6
      Matt authored
      * Remove ConversationalPipeline and Conversation object, as they have been deprecated for some time and are due for removal
      
      * Update not-doctested.txt
      
      * Fix JA and ZH docs
      
      * Fix JA and ZH docs some more
      
      * Fix JA and ZH docs some more
      065729a6
    • 조준래's avatar
      Implement JSON dump conversion for torch_dtype in TrainingArguments (#31224) · 60861fe1
      조준래 authored
      
      
      * Implement JSON dump conversion for torch_dtype in TrainingArguments
      
      * Add unit test for converting torch_dtype in TrainingArguments to JSON
      
      * move unit test for converting torch_dtype into TrainerIntegrationTest class
      
      * reformating using ruff
      
      * convert dict_torch_dtype_to_str to private method _dict_torch_dtype_to_str
      
      ---------
      Co-authored-by: default avatarjun.4 <jun.4@kakaobrain.com>
      60861fe1
    • Benjamin Badger's avatar
      Extend save_pretrained to offloaded models (#27412) · ff689f57
      Benjamin Badger authored
      
      
      * added hidden subset
      
      * debugged hidden subset contrastive search
      
      * added contrastive search compression
      
      * debugged compressed contrastive search
      
      * memory reduction for contrastive search
      
      * debugged mem red
      
      * added low memory option feature
      
      * debugged mem optmimization output stack
      
      * debugged mem optmimization output stack
      
      * debugged low mem
      
      * added low mem cache
      
      * fixed 2047 tensor view
      
      * debugged 2042 past key val inputs
      
      * reformatted tensors
      
      * changed low mem output
      
      * final clean
      
      * removed subset hidden csearch
      
      * fixed hidden device
      
      * fixed hidden device
      
      * changed compressor dtype
      
      * removed hstate compression
      
      * integrated csearch in generate
      
      * test csearch integration into generation
      
      exit()
      
      * fixed csearch kwarg integration with generation
      
      * final wrap and added doc
      
      * Update src/transformers/generation/utils.py
      Co-authored-by: default avatarJoao Gante <joaofranciscocardosogante@gmail.com>
      
      * Update src/transformers/generation/utils.py
      Co-authored-by: default avatarJoao Gante <joaofranciscocardosogante@gmail.com>
      
      * Update src/transformers/generation/utils.py
      Co-authored-by: default avatarJoao Gante <joaofranciscocardosogante@gmail.com>
      
      * added debug print
      
      * direct hstate cat
      
      * direct hstate cat
      
      * direct hstate cat debug
      
      * direct hstate cat debug
      
      * expanded full hidden state stack
      
      * expanded full hidden state stack
      
      * matched dims for hstates
      
      * matched dims for hstates
      
      * logits fix
      
      * equality test
      
      * equality hidden debug
      
      * debug
      
      * added prints for debug
      
      * added prints for debug
      
      * equality check
      
      * switched squeeze dim
      
      * input format debug
      
      * tracing top_k_ids
      
      * removed trace
      
      * added test context
      
      * added jitter
      
      * added jitter
      
      * added jitter
      
      * returned state
      
      * rebuilt past key value reconstruction
      
      * debugged
      
      * cleaned traces
      
      * added selection for pkv
      
      * changed output to dict
      
      * cleaned
      
      * cleaned
      
      * cleaned up contrastive search test
      
      * moved low_memory kwarg
      
      * debugged
      
      * changed low mem test batch size to 1
      
      * removed output
      
      * debugged test input shape
      
      * reformatted csearch test
      
      * added trace
      
      * removed unsqueeze on final forward pass
      
      * replaced unsqueeze with view
      
      * removed traces
      
      * cleaned
      
      * debugged model kwargs
      
      * removed special models from test
      
      * ran make quality
      
      * Update src/transformers/generation/configuration_utils.py
      Co-authored-by: default avatarJoao Gante <joaofranciscocardosogante@gmail.com>
      
      * Update src/transformers/generation/configuration_utils.py
      Co-authored-by: default avatarJoao Gante <joaofranciscocardosogante@gmail.com>
      
      * refactored
      
      * refactored
      
      * refactored
      
      * make fixup
      
      * renamed flag sequential
      
      * renamed flag sequential
      
      * iterative onloading
      
      * black style and test utils
      
      * added traces for integrated test
      
      * debugged
      
      * added traces
      
      * make style
      
      * removed traces, make style
      
      * included suggestions and added test
      
      * debugged test
      
      * added offload module check and make style
      
      * is_accelerate_available and make style
      
      * added test decorator
      
      * changed test model and config spec
      
      * added offload condition
      
      * added lazy loading for each shard
      
      * debugged
      
      * modified sharding
      
      * debugged
      
      * added traces
      
      * removed safe serialization
      
      * no index overload;
      
      * trace on safe save ptrs
      
      * added ptr condition
      
      * debugged
      
      * debugged ptr
      
      * moved module map init
      
      * remake shard only for offloaded modules
      
      * refactored
      
      * debugged
      
      * refactored
      
      * debugged
      
      * cleaned and make style
      
      * cleaned and make style
      
      * added trace
      
      * sparse module map
      
      * debugged
      
      * removed module map conditional
      
      * refactored
      
      * debug
      
      * debugged
      
      * added traces
      
      * added shard mem trace
      
      * added shard mem trace
      
      * removed underlying storage check
      
      * refactored
      
      * memory leak removal and make style
      
      * cleaned
      
      * swapped test decs and make style
      
      * added mem checks and make style
      
      * added free mem warning
      
      * implemented some suggestions
      
      * moved onloading to accelerate
      
      * refactored for accelerate integration
      
      * cleaned test
      
      * make style
      
      * debugged offload map name
      
      * cleaned and make style
      
      * replaced meta device check for sharding
      
      * cleaned and make style
      
      * implemented some suggestions
      
      * more suggestions
      
      * update warning
      Co-authored-by: default avatarMarc Sun <57196510+SunMarc@users.noreply.github.com>
      
      * more suggestions
      
      * make style
      
      * new make style
      
      * Update src/transformers/modeling_utils.py
      Co-authored-by: default avatarMarc Sun <57196510+SunMarc@users.noreply.github.com>
      
      * Update src/transformers/modeling_utils.py
      Co-authored-by: default avatarMarc Sun <57196510+SunMarc@users.noreply.github.com>
      
      * Update src/transformers/modeling_utils.py
      Co-authored-by: default avatarMarc Sun <57196510+SunMarc@users.noreply.github.com>
      
      * Update src/transformers/modeling_utils.py
      Co-authored-by: default avataramyeroberts <22614925+amyeroberts@users.noreply.github.com>
      
      ---------
      Co-authored-by: default avatarJoao Gante <joaofranciscocardosogante@gmail.com>
      Co-authored-by: default avatarMarc Sun <57196510+SunMarc@users.noreply.github.com>
      Co-authored-by: default avataramyeroberts <22614925+amyeroberts@users.noreply.github.com>
      ff689f57
    • Cyril Vallez's avatar
      Fix jetmoe model (#31279) · 8bcf9c8d
      Cyril Vallez authored
      * Fix jetmoe model
      
      * Remove skip-tests
      8bcf9c8d
  2. 06 Jun, 2024 7 commits
  3. 05 Jun, 2024 2 commits
  4. 04 Jun, 2024 7 commits
  5. 03 Jun, 2024 6 commits
  6. 31 May, 2024 2 commits
  7. 30 May, 2024 2 commits
  8. 29 May, 2024 2 commits
  9. 28 May, 2024 6 commits