-
Juan Acevedo authored
reverts accidental change that removes attn_mask in attn. Improves flux ptxla by using flash block sizes. Moves encoding outside the for loop. Co-authored-by:Juan Acevedo <jfacevedo@google.com>
6b9a3334
reverts accidental change that removes attn_mask in attn. Improves flux ptxla by using flash block sizes. Moves encoding outside the for loop.
Co-authored-by:
Juan Acevedo <jfacevedo@google.com>