"...git@developer.sourcefind.cn:OpenDAS/lmdeploy.git" did not exist on "fe46dac2c2ea1a988929fba05e9d3d3c9b11dfd7"
[Feature] Add CUDA support for `min` and `max` reducer in heterogeneous API...
[Feature] Add CUDA support for `min` and `max` reducer in heterogeneous API for unary message functions (#3566) * CUDA support max/min reducer on forward pass * docstring * concised UpdateGradMinMax_hetero * reorganized UpdateGradMinMax_hetero * CUDA kernels for max/min reducer * variable name * lint check * changed CUDA 2D thread mapping to 1D * removed legacy cusparse for min/max reducer * git CI issue * restarting git CI * adding namespace std Co-authored-by:Israt Nisa <nisisrat@amazon.com> Co-authored-by:
Quan (Andy) Gan <coin2028@hotmail.com>
Showing
Please register or sign in to comment