Enforce boolean attention mask type (#49)

* Enforce boolean attention mask type Signed-off-by: Kirthi Shankar Sivamani <ksivamani@nvidia.com> * fix tests Signed-off-by: Kirthi Shankar Sivamani <ksivamani@nvidia.com> Signed-off-by: Kirthi Shankar Sivamani <ksivamani@nvidia.com>

Enforce boolean attention mask type (#49)
* Enforce boolean attention mask type Signed-off-by: Kirthi Shankar Sivamani <ksivamani@nvidia.com> * fix tests Signed-off-by: Kirthi Shankar Sivamani <ksivamani@nvidia.com> Signed-off-by: Kirthi Shankar Sivamani <ksivamani@nvidia.com>
0c9c0ba1 · Kirthi Shankar Sivamani · GitHub · d6ff6f4d · 0c9c0ba1
Unverified Commit 0c9c0ba1 authored Jan 04, 2023 by Kirthi Shankar Sivamani Committed by GitHub Jan 04, 2023
Hide whitespace changes
Inline Side-by-side

Showing with 10 additions and 0 deletions

transformer_engine/pytorch/transformer.py transformer_engine/pytorch/transformer.py +10 -0

No files found.
--- a/transformer_engine/pytorch/transformer.py
+++ b/transformer_engine/pytorch/transformer.py
@@ -520,6 +520,11 @@ class MultiHeadAttention(torch.nn.Module):
        """MultiHeadAttention FWD"""
        # hidden_states: [sq, b, h]

+        if attention_mask is not None:
+            assert (
+                attention_mask.dtype == torch.bool
+            ), "Attention mask must be a boolean tensor"
+
        # =================================================
        # Pre-allocate memory for key-values for inference.
        # =================================================
@@ -1006,6 +1011,11 @@ class TransformerLayer(torch.nn.Module):

        hidden_states = hidden_states.contiguous()

+        if attention_mask is not None:
+            assert (
+                attention_mask.dtype == torch.bool
+            ), "Attention mask must be a boolean tensor"
+
        # For AMP
        if torch.is_autocast_enabled():
            hidden_states = cast_if_needed(