"examples/research_projects/distillation/train.py" did not exist on "9f995b99d4c4067662c3bd4f1274315c0839deeb"
  • Suraj Patil's avatar
    Fix Flax params dtype (#13098) · e92190c0
    Suraj Patil authored
    
    
    * fix inits
    
    * fix embed dtype
    
    * fix embed dtype
    
    * add test to check default dtype
    
    * quality
    
    * add type conversion methods for flax models
    
    * more robust casting
    
    * cast sinusoidal positions
    
    * update pegasus
    
    * update albert
    
    * update test
    
    * make sure dtype is passed to every module
    
    * style
    
    * fix electra dense
    
    * fix t5
    
    * quality
    
    * add more tests
    
    * better name
    
    * use the dtype for lm head computation
    
    * fix albert
    
    * style
    
    * fix albert embed dtype
    
    * more tests
    
    * fix vision enc-dec
    
    * cleanup
    
    * fix embed dtype pegasus
    
    * fix default param test
    
    * doc
    
    * update template
    
    * fix final_logits_bias dtype
    
    * Apply suggestions from code review
    Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
    
    * fix doc
    
    * fix doc
    
    * add detailed docstring for dtype parameter
    
    * remove un-necessary import
    Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
    e92190c0
test_modeling_flax_common.py 35.3 KB