Commit 12a47a64 authored by alexeib's avatar alexeib Committed by Myle Ott
Browse files

fix flag copy paste (decoder-normalize-before)

parent f6a5a54e
...@@ -301,7 +301,7 @@ class TransformerDecoderLayer(nn.Module): ...@@ -301,7 +301,7 @@ class TransformerDecoderLayer(nn.Module):
) )
self.dropout = args.dropout self.dropout = args.dropout
self.relu_dropout = args.relu_dropout self.relu_dropout = args.relu_dropout
self.normalize_before = args.encoder_normalize_before self.normalize_before = args.decoder_normalize_before
self.encoder_attn = MultiheadAttention( self.encoder_attn = MultiheadAttention(
self.embed_dim, args.decoder_attention_heads, self.embed_dim, args.decoder_attention_heads,
dropout=args.attention_dropout, dropout=args.attention_dropout,
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment