• 21jun's avatar
    fix typo in gradient_checkpointing arg (#12855) · 5c673efa
    21jun authored
    help for `ModelArguments.gradient_checkpointing` should be
    "If True, use gradient checkpointing to save memory
    at the expense of slower backward pass."
    not "Whether to freeze the feature extractor layers of the model."
    (which is duplicated from `freeze_feature_extractor` arg)
    5c673efa
run_pretrain.py 15.3 KB