"tests/vscode:/vscode.git/clone" did not exist on "42b6322c7dddc89e5e2b479d7a499bb48ad0e358"
faster histogram sum up (#418)
* some refactor. * two stage sum up to reduce sum up error. * add more two-stage sumup. * some refactor. * add alignment. * change name to aligned_allocator. * remove some useless sumup. * fix a warning. * add -march=native . * remove the padding of gradients. * no alignment. * fix test. * change KNumSumupGroup to 32768. * change gcc flags.
Showing
Please register or sign in to comment