[GDN] Fuse b.sigmoid(), fused_gdn_gating and unsqueeze into one kernel: up to...
[GDN] Fuse b.sigmoid(), fused_gdn_gating and unsqueeze into one kernel: up to 0.85% e2e speedup (#12508)
Showing
Please register or sign in to comment
[GDN] Fuse b.sigmoid(), fused_gdn_gating and unsqueeze into one kernel: up to 0.85% e2e speedup (#12508)