Explicit broadcast in image normalization for better performance (#6551)
With trivial model, it improves the data input pipeline throughput from 12.5K to 15K on a DGX1 V100 machine.
Showing
Please register or sign in to comment
With trivial model, it improves the data input pipeline throughput from 12.5K to 15K on a DGX1 V100 machine.