1. 17 May, 2024 3 commits
    • Darshana S's avatar
      Enable device map (#30870) · 3802e786
      Darshana S authored
      * added_no_split_modules
      
      * added LlavaNextVisionAttention to _no_split_modules
      3802e786
    • amyeroberts's avatar
      Remove deprecated logic and warnings (#30743) · 57c965a8
      amyeroberts authored
      * Remove deprecated logic and warnings
      
      * Add back some code that seems to be important...
      
      * Let's just add all he nllb stuff back; removing it is a bit more involved
      
      * Remove kwargs
      
      * Remove more kwargs
      57c965a8
    • Younes Belkada's avatar
      TEST: Add llama logits tests (#30835) · 3d7d3a87
      Younes Belkada authored
      * add llama logits test
      
      * fix
      
      * fix tests
      "
      
      "
      
      * fix for a10
      
      * format
      
      * format
      
      * fix
      
      * [run-slow] remove fmt: skip
      
      * Your commit message
      
      * test commit
      
      * Revert "test commit"
      
      This reverts commit b66e01e55f5e31d4c0479cac4bcacc0f123dc9d2.
      
      * [run-slow]llama
      
      * Update tests/models/llama/test_modeling_llama.py
      
      * [run-slow]llama
      
      * empty commit
      3d7d3a87
  2. 16 May, 2024 10 commits
  3. 15 May, 2024 14 commits
  4. 14 May, 2024 10 commits
  5. 13 May, 2024 3 commits
    • fxmarty's avatar
      CI: update to ROCm 6.0.2 and test MI300 (#30266) · 37bba2a3
      fxmarty authored
      
      
      * update to ROCm 6.0.2 and test MI300
      
      * add callers for mi300
      
      * update dockerfile
      
      * fix trainer tests
      
      * remove apex
      
      * style
      
      * Update tests/trainer/test_trainer_seq2seq.py
      
      * Update tests/trainer/test_trainer_seq2seq.py
      
      * Update tests/trainer/test_trainer_seq2seq.py
      
      * Update tests/trainer/test_trainer_seq2seq.py
      
      * update to torch 2.3
      
      * add workflow dispatch target
      
      * we may need branches: mi300-ci after all
      
      * nit
      
      * fix docker build
      
      * nit
      
      * add check runner
      
      * remove docker-gpu
      
      * fix issues
      
      * fix
      
      ---------
      Co-authored-by: default avatarYih-Dar <2521628+ydshieh@users.noreply.github.com>
      Co-authored-by: default avatarydshieh <ydshieh@users.noreply.github.com>
      37bba2a3
    • Marc Sun's avatar
      skip low_cpu_mem_usage tests (#30782) · 539ed75d
      Marc Sun authored
      539ed75d
    • amyeroberts's avatar
      Deprecate models script (#30184) · 0f8fefd4
      amyeroberts authored
      * Add utility for finding candidate models for deprecation
      
      * Update model init
      
      * Make into configurable script
      
      * Fix path
      
      * Add sorting of base object alphabetically
      
      * Tidy
      
      * Refactor __init__ alpha ordering
      
      * Update script with logging
      
      * fix import
      
      * Fix logger
      
      * Fix logger
      
      * Get config file before moving files
      
      * Take models from CLI
      
      * Split models into lines to make easier to feed to deprecate_models script
      
      * Update
      
      * Use posix path
      
      * Print instead
      
      * Add example in module docstring
      
      * Fix up
      
      * Add clarifying comments; add models to DEPRECATE_MODELS
      
      * Address PR comments
      
      * Don't update relative paths on the same level
      0f8fefd4