"vscode:/vscode.git/clone" did not exist on "639f7eaf82fdf83a6d39b2986e9ffb393f26d4ea"
Unverified Commit 792e5d5b authored by Zhengju Tang's avatar Zhengju Tang Committed by GitHub
Browse files

[Feature] Add GQA backward kernel with varlen input (#1082)

* [Feature] Add GQA backward kernel with varlen input

* [Lint]

* [BugFix] Freeze the memory order of all atomic_add operations

* [Lint]

* [Lint]

* [BugFix] Use release order to boost performance
parent bb8b3cd7
This diff is collapsed.
......@@ -8,8 +8,6 @@ from einops import rearrange, repeat
from tilelang.profiler import do_bench
from varlen_utils import generate_random_padding_mask, generate_qkv
tilelang.disable_cache()
def attention_ref(
q,
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment