Unverified Commit 5ea71ff4 authored by Giancarlo Delfin's avatar Giancarlo Delfin Committed by GitHub
Browse files

[V1] reduce block size for tree attention correctness test to fix 'ou… (#22207)


Signed-off-by: default avatarGiancarlo Delfin <gdelfin@meta.com>
parent 71758176
...@@ -155,7 +155,7 @@ def test_tree_attn_correctness() -> None: ...@@ -155,7 +155,7 @@ def test_tree_attn_correctness() -> None:
dim_per_head = 128 dim_per_head = 128
num_kv_heads = 2 num_kv_heads = 2
block_size = 128 block_size = 32
max_sequence_length = 8192 max_sequence_length = 8192
randomize_blocks = True randomize_blocks = True
for batch_size in [1, 16, 32]: for batch_size in [1, 16, 32]:
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment