• Ethan Perez's avatar
    Fixing unused weight_decay argument · 28ba345e
    Ethan Perez authored
    Currently the L2 regularization is hard-coded to "0.01", even though there is a --weight_decay flag implemented (that is unused). I'm making this flag control the weight decay used for fine-tuning in this script.
    28ba345e
run_openai_gpt.py 14 KB