"docs/source/en/vscode:/vscode.git/clone" did not exist on "e0a1c1321ce6751686e184476d520e173c1d6b8e"
Optimize pipeline schedule (#94)
* add pipeline shared module wrapper and update load batch
* added model parallel process group for amp and clip grad (#86)
* added model parallel process group for amp and clip grad
* update amp and clip with model parallel process group
* remove pipeline_prev/next group (#88)
* micro batch offload
* optimize pipeline gpu memory usage
* pipeline can receive tensor shape (#93)
* optimize pipeline gpu memory usage
* fix grad accumulation step counter
* rename classes and functions
Co-authored-by:
Frank Lee <somerlee.9@gmail.com>
Showing
Please register or sign in to comment