"...git@developer.sourcefind.cn:chenpangpang/transformers.git" did not exist on "27b402cab0a27f2a57067ce8aa6b3e35fc48612e"
Unverified Commit 8581fbaa authored by Iulian Taiatu's avatar Iulian Taiatu Committed by GitHub
Browse files

changed "ot" to "to" (#21488)

parent fa0ae179
...@@ -176,7 +176,7 @@ class AdamWeightDecay(Adam): ...@@ -176,7 +176,7 @@ class AdamWeightDecay(Adam):
with the m and v parameters in strange ways as shown in [Decoupled Weight Decay with the m and v parameters in strange ways as shown in [Decoupled Weight Decay
Regularization](https://arxiv.org/abs/1711.05101). Regularization](https://arxiv.org/abs/1711.05101).
Instead we want ot decay the weights in a manner that doesn't interact with the m/v parameters. This is equivalent Instead we want to decay the weights in a manner that doesn't interact with the m/v parameters. This is equivalent
to adding the square of the weights to the loss with plain (non-momentum) SGD. to adding the square of the weights to the loss with plain (non-momentum) SGD.
Args: Args:
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment