examples/flash_attention/example_gqa_bwd_tma_reduce.py · 0af3fd7c70711f1c78da9bc087293826ecba451e · OpenDAS / tilelang

[BugFix] Refactor attention kernel to handle OOB positions by filling with... · 0af3fd7c

Tong WU authored Nov 15, 2025

[BugFix] Refactor attention kernel to handle OOB positions by filling with `-inf` instead of clearing accumulators. (#1222)

* Refactor attention kernel to handle OOB positions by filling with `-inf` instead of clearing accumulators.

* lint

* pre-commit

* Update imports in flash attention test file to use new backward and forward examples for better clarity and consistency.

0af3fd7c

example_gqa_bwd_tma_reduce.py 25.5 KB

Replace example_gqa_bwd_tma_reduce.py