1. 21 Mar, 2024 1 commit
  2. 20 Mar, 2024 1 commit
    • Matt's avatar
      Support sharded safetensors in TF (#29350) · 11ef35e8
      Matt authored
      
      
      * Initial commit (still lots of unfinished bits)
      
      * (Still untested) add safetensors sharding to save_pretrained
      
      * Fix savetensors saving, update default shard size to match PT
      
      * Add proper loading of TF-format safetensors
      
      * Revert default size in case that changes things
      
      * Fix incorrect index name
      
      * Update loading priority
      
      * Update tests
      
      * Make the tests a little more stringent
      
      * Expand tests
      
      * Add sharded cross-test
      
      * Fix argument name
      
      * One more test fix
      
      * Adding mlx to the list of allowed formats
      
      * Remove irrelevant block for safetensors
      
      * Refactor warning logging into a separate function
      
      * Remove unused skip_logger_warnings arg
      
      * Update src/transformers/modeling_tf_utils.py
      Co-authored-by: default avataramyeroberts <22614925+amyeroberts@users.noreply.github.com>
      
      * Move function def
      
      ---------
      Co-authored-by: default avataramyeroberts <22614925+amyeroberts@users.noreply.github.com>
      11ef35e8
  3. 15 Mar, 2024 1 commit
  4. 30 Jan, 2024 1 commit
    • Matt's avatar
      Add tf_keras imports to prepare for Keras 3 (#28588) · 415e9a09
      Matt authored
      * Port core files + ESM (because ESM code is odd)
      
      * Search-replace in modelling code
      
      * Fix up transfo_xl as well
      
      * Fix other core files + tests (still need to add correct import to tests)
      
      * Fix cookiecutter
      
      * make fixup, fix imports in some more core files
      
      * Auto-add imports to tests
      
      * Cleanup, add imports to sagemaker tests
      
      * Use correct exception for importing tf_keras
      
      * Fixes in modeling_tf_utils
      
      * make fixup
      
      * Correct version parsing code
      
      * Ensure the pipeline tests correctly revert to float32 after each test
      
      * Ensure the pipeline tests correctly revert to float32 after each test
      
      * More tf.keras -> keras
      
      * Add dtype cast
      
      * Better imports of tf_keras
      
      * Add a cast for tf.assign, just in case
      
      * Fix callback imports
      415e9a09
  5. 14 Dec, 2023 1 commit
  6. 13 Nov, 2023 2 commits
  7. 31 Oct, 2023 1 commit
  8. 15 Jun, 2023 1 commit