-
Chao Liu authored
use ford/for instead of static_ford/static_for in threadwise copy, somehow register spill is greatly reduced on AMD
bc9ea646
use ford/for instead of static_ford/static_for in threadwise copy, somehow register spill is greatly reduced on AMD