• FoolPlayer's avatar
    [shardformer] Align bert value (#3907) · f1cb5ac6
    FoolPlayer authored
    * add bert align test, fix dist loss bug
    
    * forward and backward align
    
    * add ignore index
    
    * add shardformer CI
    
    * add gather_output optional for user in shardconfig
    
    * update readme with optional gather_ouput
    
    * add dist crossentropy loss test, remove unused files
    
    * remove unused file
    
    * remove unused file
    
    * rename the file
    
    * polish code
    f1cb5ac6
test_distcrossentropy.py 1.41 KB