"vscode:/vscode.git/clone" did not exist on "1334f0eac2cfe84cd928b6dd1ff3ca53c7047588"
-
lexhuismans authored
* Add 3D attention mask to T5 model (#9643) Added code for 3D attention mask in T5 model. Similar to BERT model. * Add test for 3D attention mask Added test for 3D attention mask: test_decoder_model_past_with_3d_attn_mask() 3D attention mask of the shape [Batch_size, Seq_length, Seq_length] both for attention mask and decoder attention mask. Test is passing.
91cf2915