• R茅mi Louf's avatar
    load the pretrained weights for encoder-decoder · 1c71ecc8
    R茅mi Louf authored
    We currently save the pretrained_weights of the encoder and decoder in
    two separate directories `encoder` and `decoder`. However, for the
    `from_pretrained` function to operate with automodels we need to
    specify the type of model in the path to the weights.
    
    The path to the encoder/decoder weights is handled by the
    `PreTrainedEncoderDecoder` class in the `save_pretrained` function. Sice
    there is no easy way to infer the type of model that was initialized for
    the encoder and decoder we add a parameter `model_type` to the function.
    This is not an ideal solution as it is error prone, and the model type
    should be carried by the Model classes somehow.
    
    This is a temporary fix that should be changed before merging.
    1c71ecc8
run_summarization_finetuning.py 15.8 KB