BART - Fix attention mask device issue on copied models (#18540)
* attempt to fix attn mask device * fix bart `_prepare_decoder_attention_mask` - add correct device - run `make fix-copies` to propagate the fix
Showing
Please register or sign in to comment