Commits · a651e2c24ecf97cbf367fd3f330df36760e1c597 · OpenDAS / apex

01 Dec, 2020 1 commit

DistributedFusedAdam Model Parallelism Support (Megatron) (#981) · 6b7e77b0

Kexin Yu authored Dec 01, 2020



DistributedFusedAdam Model Parallelism Support (Megatron)
Co-authored-by: Kexin Yu <kexiny@nvidia.com>
Co-authored-by: Kexin Yu <kexinznzn@gmail.com>

6b7e77b0

05 Aug, 2020 1 commit

set device guard for multi tensor optimizer implementations (#927) · 274cc063

ngimel authored Aug 05, 2020

* add device guards to the optimizers

* add untracked file

* set deviceGuard in multi_tensor_apply

* address review comments; fix lamb

* indent

* typo

274cc063

23 Jun, 2020 3 commits
- add test case for non-zero weight decay · ad50ce9a
  Kexin Yu authored Jun 23, 2020
  
  ad50ce9a
- test nvlamb; hyperparams consistent with adam/adagrad tests · cd3d6d12
  Kexin Yu authored Jun 23, 2020
  
  cd3d6d12
- add test for FusedLAMB · 9774ce0d
  Kexin Yu authored Jun 22, 2020
  
  9774ce0d
14 May, 2020 1 commit
- Add FusedAdagrad (#822) · 3bae8c83
  Andrew Tulloch authored May 14, 2020
  
  3bae8c83
03 Sep, 2019 1 commit

Fix issues in fused_dam (#469) · 7fa74925

Deyu Fu authored Sep 03, 2019

* move import of amp_C to __init__()

* make fp16/32 separate lists to support mixed param types, disable double test

* make zero_grad consistent between adam/novograd/lamb

7fa74925

17 Aug, 2019 1 commit
- disable breaking test until switch to test against upstream v1.2.0 · f855f856
  Deyu Fu authored Aug 16, 2019
  
  f855f856
13 Aug, 2019 1 commit

Reverse to Fused* naming, clean up accordingly: · 007c5947

Deyu Fu authored Aug 13, 2019

FusedSGD now work as before
FusedAdam now work with o1/o2, no longer fuse scaling and casting
Removed special backend handling for FusedAdam
Moved and updated test for FusedAdam into run_optimizers
Removed legacy tests for optimizers.FP16_optimizer and FusedAdam in run_mixed_adam

007c5947

12 Aug, 2019 1 commit
- keep old fused* name and rename new optimizers without prefix · adad5996
  Deyu Fu authored Aug 12, 2019
  
  adad5996