"vscode:/vscode.git/clone" did not exist on "d5c10e7a962871ad8bd0371312ef04296d6a36f8"
  1. 08 May, 2023 1 commit
  2. 23 Apr, 2023 3 commits
  3. 25 Jan, 2023 1 commit
    • aspanday's avatar
      Updating BLOCK_SIZE to 1024 in all optimizers. (#103) · 14db5c27
      aspanday authored
      * Updating BLOCK_SIZE to 1024.
      tests/L0/run_optimizers/test_fused_optimizer.py test passes except for bfloat16 for Adam. There seems to be a bug in this test that needs to be resolved.
      For now skipping test_bfloat16 for Adam in the unittest.
      Ran 17 other tests and ALL other tests pass!
      More details on the effects of these changes can be found here -  https://confluence.amd.com/display/MLSE/Apex+Kernel+Optimization
      
      .
      This commit changes BLOCK_SIZE=1024 ONLY FOR different optimizers.
      L2norm kernels (part of LAMB optimizer algorithm) still maintain BLOCK_SIZE=512 otherwise Allclose fails.
      
      * Updating tests/L0/run_optimizers/test_fused_optimizer.py with @skipifRocm to skip test_bfloat16 in Adam.
      Co-authored-by: default avataraspanday <aspanday@amd.com>
      14db5c27
  4. 06 Dec, 2022 2 commits
  5. 10 Aug, 2022 1 commit
  6. 08 Aug, 2022 1 commit
  7. 23 Jun, 2022 1 commit
    • Tim Moon's avatar
      Move distributed Adam unit test to contrib dir (#1406) · 57f890a7
      Tim Moon authored
      * Increase default bucket size in distributed Adam
      
      * Move distributed Adam unit test to contrib tests
      
      Integrate into unit testing framework
      
      * Tweak hyperparameters for dist Adam optimizer test
      
      Improves numerical stability so we can keep tight tolerances. Adopting suggestions from @crcrpar.
      
      * Use distributed test infrastructure in distributed Adam unit test
      
      Suggestion from @crcrpar.
      57f890a7
  8. 22 Jun, 2022 1 commit
  9. 14 Jun, 2022 2 commits
  10. 14 Dec, 2021 1 commit
  11. 09 Dec, 2021 2 commits
  12. 19 Oct, 2021 1 commit
  13. 15 Apr, 2021 1 commit
    • Sudhakar Singh's avatar
      Add unit tests for Fused NovoGrad (#1065) · 59d2f7ac
      Sudhakar Singh authored
      * Add unit tests for fused-novograd
      
      * Fix: tensors should reside on the same device
      
      * Fix: Cudastream should be called on the same device on which the tensors reside on. Found this during debugging fused novograd multi-device unit test
      
      * fixed issues mentioned in the comments
      59d2f7ac
  14. 21 Jan, 2021 1 commit
  15. 18 Jan, 2021 1 commit
  16. 31 Dec, 2020 2 commits
  17. 01 Dec, 2020 1 commit
  18. 05 Aug, 2020 1 commit
  19. 07 Jul, 2020 1 commit
  20. 23 Jun, 2020 3 commits
  21. 26 May, 2020 1 commit
  22. 14 May, 2020 1 commit
  23. 03 Sep, 2019 1 commit
    • Deyu Fu's avatar
      Fix issues in fused_dam (#469) · 7fa74925
      Deyu Fu authored
      * move import of amp_C to __init__()
      
      * make fp16/32 separate lists to support mixed param types, disable double test
      
      * make zero_grad consistent between adam/novograd/lamb
      7fa74925
  24. 17 Aug, 2019 1 commit
  25. 13 Aug, 2019 1 commit
    • Deyu Fu's avatar
      Reverse to Fused* naming, clean up accordingly: · 007c5947
      Deyu Fu authored
      FusedSGD now work as before
      FusedAdam now work with o1/o2, no longer fuse scaling and casting
      Removed special backend handling for FusedAdam
      Moved and updated test for FusedAdam into run_optimizers
      Removed legacy tests for optimizers.FP16_optimizer and FusedAdam in run_mixed_adam
      007c5947
  26. 12 Aug, 2019 1 commit