seems that solution 1 is still the fatest

i.e., expand bias using torch.repeat_interleave directly

seems that solution 1 is still the fatest
i.e., expand bias using torch.repeat_interleave directly
1dc7e73e · Jiezhong Qiu · c65039da · 1dc7e73e
Commit 1dc7e73e authored Feb 26, 2021 by Jiezhong Qiu
Hide whitespace changes
Inline Side-by-side

Showing with 5 additions and 5 deletions

fmoe/layers.py fmoe/layers.py +5 -5

No files found.
--- a/fmoe/layers.py
+++ b/fmoe/layers.py
@@ -70,13 +70,13 @@ class FMoELinear(nn.Module):
            # like MOELinear.apply(x, weight, bias, count)
            # Solution 1
-            # bias = torch.repeat_interleave(self.bias,
+            bias = torch.repeat_interleave(self.bias,
-            #        fwd_expert_count.to(self.bias.device), dim=0)
+                fwd_expert_count.to(self.bias.device), dim=0)
            # Solution 2
-            bias_idx = torch.arange(self.num_expert)\
+            # bias_idx = torch.arange(self.num_expert)\
-                .repeat_interleave(fwd_expert_count)
+            #     .repeat_interleave(fwd_expert_count)
-            bias = self.bias[bias_idx]
+            # bias = self.bias[bias_idx]
            # Solution 3
            # bias = []