Merge pull request #910 from szmigacz/smigacz/mha_xavier_init_gain_fix
Fixed weight init for fused weight matrices in fused MHA by adding correct gain factor
Showing
Please register or sign in to comment
Fixed weight init for fused weight matrices in fused MHA by adding correct gain factor