[Distributed][PP] only create embedding & lm head when necessary (#6455)
original title: [Distributed][Model] Rank-based Component Creation for Pipeline Parallelism Memory Optimization
Showing
Please register or sign in to comment
original title: [Distributed][Model] Rank-based Component Creation for Pipeline Parallelism Memory Optimization