"...git@developer.sourcefind.cn:yangql/composable_kernel.git" did not exist on "67423a22754e7879893827eabe2c25f3bfc5227b"
[Dev] Add GQA backward example (#205)
- Introduce `example_gqa_bwd.py` demonstrating the backward pass of FlashAttention with pipelined execution. - Implement forward and backward functions for FlashAttention, including preprocessing and postprocessing steps. - Enhance argument parsing for batch size, heads, context size, and dimensions. - Include a reference implementation for validation and performance benchmarking.
Showing
Please register or sign in to comment