Removed all stale backwards kernel code
Also match the gradient output to the input, in terms of memory layout
Showing
This diff is collapsed.
Please register or sign in to comment
Also match the gradient output to the input, in terms of memory layout