flash_attn/bert_padding.py · cbb4cf5f4654c8be42ce086f8528ccbb5a786458 · gaoqiong / flash-attention

"...git@developer.sourcefind.cn:sugon_wxj/megatron-lm.git" did not exist on "690291dd85d369fbf2495f2f3f0b3c03bd945c31"

Antoine Adam authored Oct 06, 2022

According to the `setup.py` file, only dependencies are torch and einops. But the `bert_padding.py` file requires `numpy` only to multiply the elements of a `torch.Size` object. This change aims at allowing the use of FlashAttention without numpy.

4e38df05

bert_padding.py 5.76 KB

Replace bert_padding.py