• Toby Boyd's avatar
    Transformer instrumented for benchmarking (#6734) · 40543869
    Toby Boyd authored
    * Add first benchmark and return stats.
    
    * Remove print statements update training steps.
    
    * Revert print T: in print statement.
    
    * Remove print(stats)
    
    * add 2 gpu accuracy test for base.
    
    * Fixed total_batch_size when using gpu + gFile deprecations.
    
    * 8 GPU test name fix
    
    * Add 4 and 8 GPU tests.
    
    * typo fixes.
    
    * Clean up test names and methods.
    
    * bleu uncased.  docstring format fix.
    40543869
tokenizer.py 22.1 KB