1. 25 Apr, 2022 1 commit
    • user4543's avatar
      Bug - Fix bug of duration feature for model benchmarks in distributed mode. (#347) · b5b1c3da
      user4543 authored
      **Description**
      Fix bug of duration feature for model benchmarks in distributed mode.
      
      **Major Revision**
      - Add all_reduce to sync the result of is_finished(the function to judge whether the model benchmark should be stopped) in each step 
        - to avoid inconsistency between different ranks to determine duration end (some rank may enter one more step and can never finish)
      - Add torch.cuda.synchronize() before and after step time measuring in train_step() for all model benchmarks
        - some operations in train_step() maybe async resulting incorrect step time records (for example, lstm) 
      b5b1c3da
  2. 22 Mar, 2022 1 commit
  3. 17 Mar, 2022 1 commit
  4. 06 Mar, 2022 1 commit
  5. 16 Jun, 2021 1 commit
    • Yifan Xiong's avatar
      Bug bash - Fix bugs and refine log in single GPU benchmarks (#97) · ddbc51a1
      Yifan Xiong authored
      Fix bugs and refine log in single GPU benchmarks:
      
      * Fix none framework issue
      * Fix empty parameter bug
      * Remove missed mobilenet_v3 models
      * Change benchmark registration log to debug level
      * Add pid in logging
      * Add missing benchmarks in default config
      * Fix deprecated logging warn
      ddbc51a1
  6. 26 Apr, 2021 1 commit
  7. 20 Apr, 2021 1 commit
  8. 16 Apr, 2021 1 commit
  9. 26 Mar, 2021 1 commit