Update modeling_tf_longformer.py (#7359)

correct a very small mistake

Update modeling_tf_longformer.py (#7359)
correct a very small mistake
03fb8e79 · Daquan Lin · GitHub · 1ff5bd38 · 03fb8e79
Unverified Commit 03fb8e79 authored Sep 24, 2020 by Daquan Lin Committed by GitHub Sep 24, 2020
Show whitespace changes
Inline Side-by-side

Showing with 1 addition and 1 deletion

src/transformers/modeling_tf_longformer.py src/transformers/modeling_tf_longformer.py +1 -1

No files found.
--- a/src/transformers/modeling_tf_longformer.py
+++ b/src/transformers/modeling_tf_longformer.py
@@ -348,7 +348,7 @@ class TFLongformerSelfAttention(tf.keras.layers.Layer):
        # matrix multipication
        # bcxd: batch_size * num_heads x chunks x 2window_overlap x head_dim
        # bcyd: batch_size * num_heads x chunks x 2window_overlap x head_dim
-        # bcxy: batch_size * num_heads x chunks x 2window_overlap x window_overlap
+        # bcxy: batch_size * num_heads x chunks x 2window_overlap x 2window_overlap
        chunked_attention_scores = tf.einsum("bcxd,bcyd->bcxy", chunked_query, chunked_key)  # multiply
        # convert diagonals into columns