`FastLayerNorm` compat with `autocast` (#1203)
* Persistent LayerNorm: Multi-CTA Rewrite
* autocast support
Co-authored-by:
Young-Jun Ko <youngjun.ko@gmail.com>
Showing
This diff is collapsed.
This diff is collapsed.
Please register or sign in to comment