"docs/source/en/main_classes/trainer.md" did not exist on "d61d7476275a6f31fcf68df334c68ac6bf739166"
  • Daniel Stancl's avatar
    Add separated decoder_head_mask for T5 Models (#9634) · 2ebbbf55
    Daniel Stancl authored
    * Add decoder_head_mask for PyTorch T5 model
    
    * Add decoder_head_mask args into T5Model and T5ForConditionalGeneration
    
    * Slightly change the order of input args to be in accordance
    with the convention from BART-based models introduced within the PR #9569.
    
    * Make style for modeling_t5.py
    
    * Add decoder_head_mask for TF T5 models
    
    * Separate head_mask and decoder_head_mask args in TF T5 models
    
    * Slightly change the order of input args to follow convention
    of BART-based models updated in PR #9569
    
    * Update test_forward_signature tests/test_modeling_tf_common.py
    w.r.t. the changed order of input args
    
    * Add FutureWarnings for T5 and TFT5 models
    
    * Add FutureWarnings for T5 and TFT5 models warning a user that
    input argument `head_mask` was split into two arguments -
    `head_mask` and `decoder_head_mask`
    
    * Add default behaviour - `decoder_head_mask` is set to copy
    `head_mask`
    
    * Fix T5 modeling and FutureWarning
    
    * Make proper usage of head_mask and decoder_head_mask
    in cross_attention
    
    * Fix conditions for raising FutureWarning
    
    * Reformat FutureWarning in T5 modeling
    
    * Refactor the warning message
    2ebbbf55
test_modeling_tf_common.py 53.9 KB