1. 23 Apr, 2023 1 commit
    • aspanday's avatar
      Updating BLOCK_SIZE to 1024 in all optimizers. (#103) · 06053e19
      aspanday authored
      * Updating BLOCK_SIZE to 1024.
      tests/L0/run_optimizers/test_fused_optimizer.py test passes except for bfloat16 for Adam. There seems to be a bug in this test that needs to be resolved.
      For now skipping test_bfloat16 for Adam in the unittest.
      Ran 17 other tests and ALL other tests pass!
      More details on the effects of these changes can be found here -  https://confluence.amd.com/display/MLSE/Apex+Kernel+Optimization
      
      .
      This commit changes BLOCK_SIZE=1024 ONLY FOR different optimizers.
      L2norm kernels (part of LAMB optimizer algorithm) still maintain BLOCK_SIZE=512 otherwise Allclose fails.
      
      * Updating tests/L0/run_optimizers/test_fused_optimizer.py with @skipifRocm to skip test_bfloat16 in Adam.
      Co-authored-by: default avataraspanday <aspanday@amd.com>
      06053e19
  2. 25 Feb, 2021 1 commit
  3. 05 Aug, 2020 1 commit
  4. 21 May, 2020 1 commit
  5. 20 May, 2020 1 commit
  6. 12 May, 2020 1 commit
  7. 03 Jul, 2019 1 commit
  8. 27 May, 2019 1 commit
  9. 10 May, 2019 1 commit
  10. 26 Apr, 2019 2 commits
  11. 25 Apr, 2019 1 commit
  12. 22 Apr, 2019 1 commit
  13. 18 Apr, 2019 1 commit
  14. 11 Mar, 2019 2 commits
  15. 09 Mar, 2019 1 commit
  16. 08 Mar, 2019 5 commits