• Hongxin Liu's avatar
    [gemini] support amp o3 for gemini (#4872) · df635641
    Hongxin Liu authored
    * [gemini] support no reuse fp16 chunk
    
    * [gemini] support no master weight for optim
    
    * [gemini] support no master weight for gemini ddp
    
    * [test] update gemini tests
    
    * [test] update gemini tests
    
    * [plugin] update gemini plugin
    
    * [test] fix gemini checkpointio test
    
    * [test] fix gemini checkpoint io
    df635641
gemini_ddp.py 36.9 KB