[Model][Bugfix] Fix MiDashengLM audio encoder mask by removing incorrect `logical_not` (#25925)

Signed-off-by: zhoukz <me@zhoukz.com>

[Model][Bugfix] Fix MiDashengLM audio encoder mask by removing incorrect `logical_not` (#25925)
Signed-off-by: zhoukz <me@zhoukz.com>
2e1b8bc2 · zhoukz · GitHub · e47433b3 · 2e1b8bc2
Unverified Commit 2e1b8bc2 authored Sep 30, 2025 by zhoukz Committed by GitHub Sep 30, 2025
Hide whitespace changes
Inline Side-by-side

Showing with 1 addition and 2 deletions

vllm/model_executor/models/midashenglm.py vllm/model_executor/models/midashenglm.py +1 -2

No files found.
--- a/vllm/model_executor/models/midashenglm.py
+++ b/vllm/model_executor/models/midashenglm.py
@@ -426,8 +426,7 @@ class DashengAudioTransformer(nn.Module):
            assert x_length.ndim == 1, "Lengths are of size (B,)"
            scaled_lengths = (x_length / (self.hop_length * 4)).long()
            mask = self._to_mask(max_length=t, lengths=scaled_lengths)
-            split_masks = mask.logical_not().split(target_length_in_patches,
+            split_masks = mask.split(target_length_in_patches, dim=-1)
-                                                   dim=-1)
        else:
            mask = None
            split_masks = [None] * len(input_splits)