@@ -139,7 +140,7 @@ class SwinMaskedImageModelingOutput(ModelOutput):
...
@@ -139,7 +140,7 @@ class SwinMaskedImageModelingOutput(ModelOutput):
Args:
Args:
loss (`torch.FloatTensor` of shape `(1,)`, *optional*, returned when `bool_masked_pos` is provided):
loss (`torch.FloatTensor` of shape `(1,)`, *optional*, returned when `bool_masked_pos` is provided):
Masked image modeling (MLM) loss.
Masked image modeling (MLM) loss.
logits (`torch.FloatTensor` of shape `(batch_size, num_channels, height, width)`):
reconstruction (`torch.FloatTensor` of shape `(batch_size, num_channels, height, width)`):
Reconstructed pixel values.
Reconstructed pixel values.
hidden_states (`tuple(torch.FloatTensor)`, *optional*, returned when `output_hidden_states=True` is passed or when `config.output_hidden_states=True`):
hidden_states (`tuple(torch.FloatTensor)`, *optional*, returned when `output_hidden_states=True` is passed or when `config.output_hidden_states=True`):
Tuple of `torch.FloatTensor` (one for the output of the embeddings + one for the output of each stage) of
Tuple of `torch.FloatTensor` (one for the output of the embeddings + one for the output of each stage) of
...
@@ -161,11 +162,20 @@ class SwinMaskedImageModelingOutput(ModelOutput):
...
@@ -161,11 +162,20 @@ class SwinMaskedImageModelingOutput(ModelOutput):
@@ -142,7 +143,7 @@ class Swinv2MaskedImageModelingOutput(ModelOutput):
...
@@ -142,7 +143,7 @@ class Swinv2MaskedImageModelingOutput(ModelOutput):
Args:
Args:
loss (`torch.FloatTensor` of shape `(1,)`, *optional*, returned when `bool_masked_pos` is provided):
loss (`torch.FloatTensor` of shape `(1,)`, *optional*, returned when `bool_masked_pos` is provided):
Masked image modeling (MLM) loss.
Masked image modeling (MLM) loss.
logits (`torch.FloatTensor` of shape `(batch_size, num_channels, height, width)`):
reconstruction (`torch.FloatTensor` of shape `(batch_size, num_channels, height, width)`):
Reconstructed pixel values.
Reconstructed pixel values.
hidden_states (`tuple(torch.FloatTensor)`, *optional*, returned when `output_hidden_states=True` is passed or when `config.output_hidden_states=True`):
hidden_states (`tuple(torch.FloatTensor)`, *optional*, returned when `output_hidden_states=True` is passed or when `config.output_hidden_states=True`):
Tuple of `torch.FloatTensor` (one for the output of the embeddings + one for the output of each stage) of
Tuple of `torch.FloatTensor` (one for the output of the embeddings + one for the output of each stage) of
...
@@ -164,11 +165,20 @@ class Swinv2MaskedImageModelingOutput(ModelOutput):
...
@@ -164,11 +165,20 @@ class Swinv2MaskedImageModelingOutput(ModelOutput):