Upgrade apex version, turn off legacy fusion (#205)
* update apex version to feb 5th commit * use gradient clipping instead of max grad norm in tests * add warning when user provides max_grad_norm * update examples commit
Showing
deepspeed/pt/deepspeed_constants.py
100644 → 100755
Please register or sign in to comment