1) multihead_attn 2) xentropy 3) fused_adam and distributed_fused_adam
Attach a file by drag & drop or click to upload