fix for negative learning rate with warmup_linear in BertAdam (happens when...
fix for negative learning rate with warmup_linear in BertAdam (happens when t_total is specified incorrectly) + copied BERT optimization warmup functions to OpenAI optimization file + added comments
Showing
Please register or sign in to comment