Unverified Commit 52585019 authored by t11s's avatar t11s Committed by GitHub
Browse files

🚨 fix(SigLip): remove spurious exclusion of first vision output token (#30952)

fix(SigLip): remove spurious exclusion of first vision output token in classifier
parent 6a05f68f
......@@ -1527,7 +1527,7 @@ class SiglipForImageClassification(SiglipPreTrainedModel):
sequence_output = outputs[0]
# average pool the patch tokens
sequence_output = torch.mean(sequence_output[:, 1:, :], dim=1)
sequence_output = torch.mean(sequence_output, dim=1)
# apply classifier
logits = self.classifier(sequence_output)
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment