"ts/webui/vscode:/vscode.git/clone" did not exist on "5df75c33cb543cb55e7a1616bfa2a4a3416243b8"
-
Yu Cheng authored
- Introduce `example_gqa_bwd.py` demonstrating the backward pass of FlashAttention with pipelined execution. - Implement forward and backward functions for FlashAttention, including preprocessing and postprocessing steps. - Enhance argument parsing for batch size, heads, context size, and dimensions. - Include a reference implementation for validation and performance benchmarking.
a55f3686