examples/flash_attention/example_gqa_fwd_bshd.py · c2b9b59d72693c286b0dc023e8058b8654356d3e · OpenDAS / tilelang

[Dev] Add new example for FlashAttention with pipelined execution (#200) · c2b9b59d

Yu Cheng authored Mar 13, 2025

- Introduce `example_gqa_fwd_bshd_wgmma_pipelined.py` demonstrating a pipelined implementation of FlashAttention.
- Update sequence length parameter in existing example to 8192 and adjust number of stages for improved performance.
- Enhance argument parsing to accommodate new configurations for batch size, heads, and groups.

c2b9b59d

example_gqa_fwd_bshd.py 10.1 KB

Replace example_gqa_fwd_bshd.py