• Lingfan Yu's avatar
    [Model] Support Multi-GPU for Transformer model (#356) · 29dd22e6
    Lingfan Yu authored
    * multi-process version of transformer
    
    * lots of fix
    
    * fix bugs and accum gradients for multiple batches
    
    * many fixes
    
    * minor
    
    * upd
    
    * set torch device
    
    * fix bugs
    
    * fix and minor
    
    * comments and clean up
    
    * uncomment viz code
    29dd22e6
models.py 9.97 KB