* fix attention mask * fix slow test * refactor attn masks * fix fp16 generate test
Attach a file by drag & drop or click to upload