• Jiarui Fang's avatar
    [hotfix] fx get comm size bugs (#1233) · 0e199d71
    Jiarui Fang authored
    
    
    * init a checkpoint dir
    
    * [checkpoint]support resume for cosinewarmuplr
    
    * [checkpoint]add unit test
    
    * fix some bugs but still not OK
    
    * fix bugs
    
    * make it faster
    
    * [checkpoint]support generalized scheduler
    
    * polish
    
    * [tensor] torch function return colotensor
    
    * polish
    
    * fix bugs
    
    * remove debug info
    
    * polish
    
    * polish
    
    * [tensor] test_model pass unittests
    
    * polish
    
    * [hotfix] fx get comm size bug
    Co-authored-by: default avatarZhaoYi1222 <zhaoyi9499@gmail.com>
    0e199d71
test_model.py 12 KB