"vscode:/vscode.git/clone" did not exist on "e9843e1574defa9fe0286f85077e8d719021d79e"
  • Qianfeng's avatar
    BatchNorm backward implementation (#461) · 44789d99
    Qianfeng authored
    * Implemented batchnorm-backward Blockwise and Multiblock kernels
    
    * Add batchnorm-backward device op
    
    * Add batchnorm-backward host-reference op
    
    * Add batchnorm-backward example
    
    * Parameters renaming in batchnorm backward kernels and device op
    
    * Change in the example to loose the threshold for ScaleDiff checking
    
    * Add comments to explain the implementation of batchnorm-backward
    
    * Parameters renaming again in batchnorm backward kernels
    
    * Improve the expression calculation for performance
    
    * Add batchnorm backward to README
    
    * Add comments to explain inv-variance in batchnorm forward and backward
    
    * Renaming the batchnorm forward training and inferring examples
    
    * Add/update the comments for batchnorm-backward kernels
    
    * Renaming again
    
    * Add block_sync_lds between two consecutive blockwise reductions
    
    * Move common expression 1/N out of the static_for loops
    
    * Add dy_elementwise_op
    
    * Renaming in backward example again
    
    * Add checking for reduceDims in reference_batchnorm_backward
    
    * Update to comments and codes format
    
    * Rename in the comments
    
    * Remove common expression out of the loop in reference_batchnorm_backward_nhwc_c
    
    * Add block_sync_lds() between blockwise reduction again
    
    * Fix comments again
    
    * Remove int8 from batchnorm-forward instances since it is not needed for forward training and could fail test
    44789d99
README.md 2.63 KB