Fix computation of attention_probs when head_mask is provided. (#9853)
* Fix computation of attention_probs when head_mask is provided. Signed-off-by:Morgan Funtowicz <funtowiczmo@gmail.com> * Apply changes to the template Co-authored-by:
Lysandre <lysandre.debut@reseau.eseo.fr>
Showing
Please register or sign in to comment