use ford/for instead of static_ford/static_for in threadwise copy, somehow...
use ford/for instead of static_ford/static_for in threadwise copy, somehow register spill is greatly reduced on AMD
Showing
Please register or sign in to comment
use ford/for instead of static_ford/static_for in threadwise copy, somehow register spill is greatly reduced on AMD