"composable_kernel/include/utility/integral_constant.hpp" did not exist on "81497a93a0840d5a1b5e84c1e47a90ae39d0fee6"
[ft_attention] Fix for seqlen=8136 (#488)
When seqlen=8136, `smem_sz = 48840`, and apparently starting the kernel returns an `invalid argument` CUDA error. `48840 < 48 * 1024` but apparently it's still above the limit somehow..? Tested on A100
Showing
Please register or sign in to comment