• Rishi Puri's avatar
    Minimal gpt pipeline parallel (builds off of minimal_bert_pipeline_parallel)... · ab7af058
    Rishi Puri authored
    
    Minimal gpt pipeline parallel (builds off of minimal_bert_pipeline_parallel) including cpu-offloading (#1222)
    
    * minimal bert pipeline parallel test
    
    * first draft of gpt minimal test
    
    * first draft of gpt minimal test
    
    * first draft of gpt minimal test
    
    * first draft of gpt minimal test
    
    * first draft of gpt minimal test
    
    * first draft of gpt minimal test
    
    * first draft of gpt minimal test
    
    * first draft of gpt minimal test
    
    * first draft of gpt minimal test
    
    * first draft of gpt minimal test
    
    * first draft of gpt minimal test
    
    * first draft of gpt minimal test
    
    * first draft of gpt minimal test
    
    * first draft of gpt minimal test
    
    * first draft of gpt minimal test
    
    * first draft of gpt minimal test
    
    * first draft of gpt minimal test
    
    * first draft of gpt minimal test
    
    * first draft of gpt minimal test
    
    * first draft of gpt minimal test
    
    * first draft of gpt minimal test
    
    * first draft of gpt minimal test
    
    * first draft of gpt minimal test
    
    * first draft of gpt minimal test
    
    * first draft of gpt minimal test
    
    * first draft of gpt minimal test
    
    * first draft of gpt minimal test
    
    * first draft of gpt minimal test
    
    * first draft of gpt minimal test
    
    * first draft of gpt minimal test
    
    * first draft of gpt minimal test
    
    * first draft of gpt minimal test
    
    * first draft of gpt minimal test
    
    * first draft of gpt minimal test
    
    * first draft of gpt minimal test
    
    * first draft of gpt minimal test
    
    * first draft of gpt minimal test
    
    * first draft of gpt minimal test
    
    * first draft of gpt minimal test
    
    * first draft of gpt minimal test
    
    * first draft of gpt minimal test
    
    * first draft of gpt minimal test
    
    * first draft of gpt minimal test
    
    * first draft of gpt minimal test
    
    * first draft of gpt minimal test
    
    * first draft of gpt minimal test
    
    * first draft of gpt minimal test
    
    * first draft of gpt minimal test
    
    * first draft of gpt minimal test
    
    * first draft of gpt minimal test
    
    * first draft of gpt minimal test
    
    * first draft of gpt minimal test
    
    * first draft of gpt minimal test
    
    * first draft of gpt minimal test
    
    * first draft of gpt minimal test
    
    * first draft of gpt minimal test
    
    * framework to scale up the gpt2 test for variety of distributed setups
    
    * framework to scale up the gpt2 test for variety of distributed setups
    
    * framework to scale up the gpt2 test for variety of distributed setups
    
    * framework to scale up the gpt2 test for variety of distributed setups
    
    * framework to scale up the gpt2 test for variety of distributed setups
    
    * framework to scale up the gpt2 test for variety of distributed setups
    
    * framework to scale up the gpt2 test for variety of distributed setups
    
    * framework to scale up the gpt2 test for variety of distributed setups
    
    * framework to scale up the gpt2 test for variety of distributed setups
    
    * framework to scale up the gpt2 test for variety of distributed setups
    
    * framework to scale up the gpt2 test for variety of distributed setups
    
    * framework to scale up the gpt2 test for variety of distributed setups
    
    * framework to scale up the gpt2 test for variety of distributed setups
    
    * framework to scale up the gpt2 test for variety of distributed setups
    
    * framework to scale up the gpt2 test for variety of distributed setups
    
    * framework to scale up the gpt2 test for variety of distributed setups
    
    * framework to scale up the gpt2 test for variety of distributed setups
    
    * framework to scale up the gpt2 test for variety of distributed setups
    
    * framework to scale up the gpt2 test for variety of distributed setups
    
    * framework to scale up the gpt2 test for variety of distributed setups
    
    * framework to scale up the gpt2 test for variety of distributed setups
    
    * framework to scale up the gpt2 test for variety of distributed setups
    
    * framework to scale up the gpt2 test for variety of distributed setups
    
    * framework to scale up the gpt2 test for variety of distributed setups
    
    * framework to scale up the gpt2 test for variety of distributed setups
    
    * framework to scale up the gpt2 test for variety of distributed setups
    
    * framework to scale up the gpt2 test for variety of distributed setups
    
    * framework to scale up the gpt2 test for variety of distributed setups
    
    * framework to scale up the gpt2 test for variety of distributed setups
    
    * framework to scale up the gpt2 test for variety of distributed setups
    
    * framework to scale up the gpt2 test for variety of distributed setups
    
    * framework to scale up the gpt2 test for variety of distributed setups
    
    * framework to scale up the gpt2 test for variety of distributed setups
    
    * framework to scale up the gpt2 test for variety of distributed setups
    
    * framework to scale up the gpt2 test for variety of distributed setups
    
    * framework to scale up the gpt2 test for variety of distributed setups
    
    * framework to scale up the gpt2 test for variety of distributed setups
    
    * framework to scale up the gpt2 test for variety of distributed setups
    
    * framework to scale up the gpt2 test for variety of distributed setups
    
    * framework to scale up the gpt2 test for variety of distributed setups
    
    * framework to scale up the gpt2 test for variety of distributed setups
    
    * framework to scale up the gpt2 test for variety of distributed setups
    
    * framework to scale up the gpt2 test for variety of distributed setups
    
    * framework to scale up the gpt2 test for variety of distributed setups
    
    * framework to scale up the gpt2 test for variety of distributed setups
    
    * framework to scale up the gpt2 test for variety of distributed setups
    
    * framework to scale up the gpt2 test for variety of distributed setups
    
    * framework to scale up the gpt2 test for variety of distributed setups
    
    * framework to scale up the gpt2 test for variety of distributed setups
    
    * framework to scale up the gpt2 test for variety of distributed setups
    
    * framework to scale up the gpt2 test for variety of distributed setups
    
    * framework to scale up the gpt2 test for variety of distributed setups
    
    * framework to scale up the gpt2 test for variety of distributed setups
    
    * framework to scale up the gpt2 test for variety of distributed setups
    
    * framework to scale up the gpt2 test for variety of distributed setups
    
    * framework to scale up the gpt2 test for variety of distributed setups
    
    * framework to scale up the gpt2 test for variety of distributed setups
    
    * framework to scale up the gpt2 test for variety of distributed setups
    
    * framework to scale up the gpt2 test for variety of distributed setups
    
    * framework to scale up the gpt2 test for variety of distributed setups
    
    * framework to scale up the gpt2 test for variety of distributed setups
    
    * framework to scale up the gpt2 test for variety of distributed setups
    
    * framework to scale up the gpt2 test for variety of distributed setups
    
    * framework to scale up the gpt2 test for variety of distributed setups
    
    * framework to scale up the gpt2 test for variety of distributed setups
    
    * adding gpt_minimal_test to list of multigpu tests
    Co-authored-by: default avatarEddie Yan <eddiey@nvidia.com>
    Co-authored-by: default avatarriship <riship@nvidia.com>
    ab7af058
gpt_scaling_test.py 2.81 KB