"...composable_kernel-1.git" did not exist on "fa2d894be1b3c0213da06d58af0df2de2c5308ad"
use ford/for instead of static_ford/static_for in threadwise copy, somehow...
use ford/for instead of static_ford/static_for in threadwise copy, somehow register spill is greatly reduced on AMD
Showing
Please register or sign in to comment