[v1] Add real sliding window calculation to FlexAttention direct BlockMask building (#26015)
Signed-off-by:Isotr0py <mozf@mail2.sysu.edu.cn> Signed-off-by:
baonudesifeizhai <baonudesifeizhai@gmail.com> Co-authored-by:
baonudesifeizhai <baonudesifeizhai@gmail.com>
Showing
Please register or sign in to comment