• ver217's avatar
    Optimize pipeline schedule (#94) · 96780e6e
    ver217 authored
    
    
    * add pipeline shared module wrapper and update load batch
    
    * added model parallel process group for amp and clip grad (#86)
    
    * added model parallel process group for amp and clip grad
    
    * update amp and clip with model parallel process group
    
    * remove pipeline_prev/next group (#88)
    
    * micro batch offload
    
    * optimize pipeline gpu memory usage
    
    * pipeline can receive tensor shape (#93)
    
    * optimize pipeline gpu memory usage
    
    * fix grad accumulation step counter
    
    * rename classes and functions
    Co-authored-by: default avatarFrank Lee <somerlee.9@gmail.com>
    96780e6e
add_your_parallel.md 4.36 KB