• Daniel Stancl's avatar
    TF BART models - Add `cross_attentions` to model output and fix... · 38a716cd
    Daniel Stancl authored
    TF BART models - Add `cross_attentions` to model output and fix cross-attention head masking (#10699)
    
    * Add cross_attn_head_mask to BART
    
    * Fix cross_attentions in TFBart-like models
    
    * This commit enables returning of `cross_attentions`
    for TFBart-like models
    
    * It also fixes attention head masking in cross-attenion module
    
    * Update TF model templates
    
    * Fix missing , in TF model templates
    
    * Fix typo: congig -> config
    38a716cd
test_modeling_tf_marian.py 17.2 KB