"vscode:/vscode.git/clone" did not exist on "73a89efb30cda676f255f8792a0cad70dc10bf71"
  • Kai Zhang's avatar
    Auto scale config for multi-node training · e87ed5f0
    Kai Zhang authored
    Summary:
    Pull Request resolved: https://github.com/facebookresearch/d2go/pull/62
    
    Lightning trainer set max step to cfg.SOLVER.MAX_ITER. However, this is the max iteration for all nodes, in multi-node training, we need to scale it down, as well as eval period and other configs.
    This diff calls `auto_scale_world_size` before passing the config to trainer.
    
    Reviewed By: wat3rBro
    
    Differential Revision: D28140877
    
    fbshipit-source-id: 2639ae58773a4ec2a0cc59dfefb2f5d9b1afe1a8
    e87ed5f0
lightning_train_net.py 7.23 KB