"src/git@developer.sourcefind.cn:gaoqiong/migraphx.git" did not exist on "9c5b88a9b71b64a635075681672c10c039e2546c"
  • Matt's avatar
    Improve TF weight loading, especially PT crossloading (#21792) · acfb714b
    Matt authored
    * First commit for the improved PT-TF weight loading
    
    * Remove workarounds from TFEncoderDecoder tests
    
    * Allow a custom weight renaming function in from_pretrained and use that to clean up EncoderDecoder
    
    * make fixup
    
    * First attempt at visionencoderdecoder
    
    * Disable tensorfloat32 in tests to get consistent outputs
    
    * Quick fix to tf_vision_encoder_decoder tests
    
    * make fixup
    
    * Update Blenderbot tests
    
    * Remove unused arg in modeling_tf_opt
    
    * load_tf_sharded_weights had strict=True! This meant transfer learning was impossible, so I'm setting it to False.
    
    * Support prefixes when loading sharded TF checkpoints
    
    * make fixup
    
    * Add test to load sharded models with a weight prefix
    
    * Fix sharded weight loading test
    
    * Add a test for transfer from a sharded checkpoint
    
    * make fixup
    
    * Add test to check that crossloading from PT with a prefix works
    
    * Refactor from_pretrained in the encoderdecoder classes
    
    * Refactor from_pretrained in the encoderdecoder classes
    
    * missmatched -> mismatched
    
    * Explicitly check for None
    
    * No comments showing my very impressive and attractive knowledge of Py3.9+
    
    * Disable TF32 across all TF tests
    acfb714b
test_modeling_tf_common.py 118 KB