[Performance] Reduces OpenMP atomic additions with zero inputs (#1527)
* cirtical performance fix - reduce atomic
* training faster up to 50%
* leave a TODO
Co-authored-by:
Quan (Andy) Gan <coin2028@hotmail.com>
Showing
Please register or sign in to comment