• rocking5566's avatar
    Improve normalization (#580) · 6a6163a3
    rocking5566 authored
    * Sync the order of type string with template parameter
    
    * Add more instances
    
    * Check the vector size and remove redundant var
    
    * Extract var to static, prepare to separate sweep once kernel
    
    * Separate sweeponce flow and optimize the flow
    
    * 1. Rename AccDatatype in normalization to computeData
    2. Rename AccElementwiseOperation to YElementwiseOperation in normalization
    
    * Remove useless code
    
    * Update naive variance kernel
    
    * Refine string
    
    * Fix typo
    
    * Support naive variance for device_normalization
    
    * Check the blocksize
    
    * Share the VGPR of x and y
    
    * Share the VGPR of gamma and beta
    
    * Add more instances
    
    * Support fp16 sqrt for experiment
    
    * Add CHANGELOG
    
    * Fix typo
    
    * clang-format
    6a6163a3
layernorm2d.cpp 6.59 KB