Dpp opts for wavefront 32 (#951)
Checks wavefront size, then changes implementation and number of threads for DPP reduce
Showing
src/targets/gpu/device/layernorm.cpp
100755 → 100644
Please register or sign in to comment
Checks wavefront size, then changes implementation and number of threads for DPP reduce