Unverified Commit cb23d1b2 authored by Francesco Cariaggi's avatar Francesco Cariaggi Committed by GitHub
Browse files

Pass missing arguments to `SeamlessM4Tv2ConformerEncoderLayer.forward()` when...

Pass missing arguments to `SeamlessM4Tv2ConformerEncoderLayer.forward()` when gradient checkpointing is enabled (#31945)

* pass missing arguments when gradient checkpointing is enabled for SeamlessM4Tv2

* fix same bug in SeamlessM4Tv1

* pass args, not kwargs
parent bc36c26f
...@@ -842,6 +842,8 @@ class SeamlessM4TConformerEncoder(nn.Module): ...@@ -842,6 +842,8 @@ class SeamlessM4TConformerEncoder(nn.Module):
hidden_states, hidden_states,
attention_mask, attention_mask,
relative_position_embeddings, relative_position_embeddings,
output_attentions,
conv_attention_mask,
) )
else: else:
layer_outputs = layer( layer_outputs = layer(
......
...@@ -788,6 +788,8 @@ class SeamlessM4Tv2ConformerEncoder(nn.Module): ...@@ -788,6 +788,8 @@ class SeamlessM4Tv2ConformerEncoder(nn.Module):
layer.__call__, layer.__call__,
hidden_states, hidden_states,
attention_mask, attention_mask,
output_attentions,
conv_attention_mask,
) )
else: else:
layer_outputs = layer( layer_outputs = layer(
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment