"docs/git@developer.sourcefind.cn:OpenDAS/nni.git" did not exist on "3d4f122a8991158e5a023b4f3b0bb948e3b5abc8"
[Dev] Add GQA backward example (#205)
- Introduce `example_gqa_bwd.py` demonstrating the backward pass of FlashAttention with pipelined execution. - Implement forward and backward functions for FlashAttention, including preprocessing and postprocessing steps. - Enhance argument parsing for batch size, heads, context size, and dimensions. - Include a reference implementation for validation and performance benchmarking.
Showing
Please register or sign in to comment