Optimize layer normalization for AMD GPUs (#66)
* Optimize fused layer normalization for MI100 * Optimize cuComputePartGradGammaBeta for AMD GPUs
Showing
Please register or sign in to comment
* Optimize fused layer normalization for MI100 * Optimize cuComputePartGradGammaBeta for AMD GPUs