"ts/webui/vscode:/vscode.git/clone" did not exist on "5df75c33cb543cb55e7a1616bfa2a4a3416243b8"
  • Yu Cheng's avatar
    [Dev] Add GQA backward example (#205) · a55f3686
    Yu Cheng authored
    - Introduce `example_gqa_bwd.py` demonstrating the backward pass of FlashAttention with pipelined execution.
    - Implement forward and backward functions for FlashAttention, including preprocessing and postprocessing steps.
    - Enhance argument parsing for batch size, heads, context size, and dimensions.
    - Include a reference implementation for validation and performance benchmarking.
    a55f3686
example_gqa_bwd.py 15.7 KB