[PyTorch] Lower atol/rtol for F16 attention tests (#1157)

* reduce atol/rtol for F16 tests Signed-off-by: Charlene Yang <8636796+cyanguwa@users.noreply.github.com> * relax the tols for Ampere Signed-off-by: Charlene Yang <8636796+cyanguwa@users.noreply.github.com> --------- Signed-off-by: Charlene Yang <8636796+cyanguwa@users.noreply.github.com>

[PyTorch] Lower atol/rtol for F16 attention tests (#1157)
* reduce atol/rtol for F16 tests Signed-off-by: Charlene Yang <8636796+cyanguwa@users.noreply.github.com> * relax the tols for Ampere Signed-off-by: Charlene Yang <8636796+cyanguwa@users.noreply.github.com> --------- Signed-off-by: Charlene Yang <8636796+cyanguwa@users.noreply.github.com>
e6e06030 · Charlene Yang · GitHub · 2d57db8b · e6e06030
Unverified Commit e6e06030 authored Sep 11, 2024 by Charlene Yang Committed by GitHub Sep 11, 2024
Show whitespace changes
Inline Side-by-side

Showing with 3 additions and 3 deletions

tests/pytorch/fused_attn/test_fused_attn.py tests/pytorch/fused_attn/test_fused_attn.py +3 -3

No files found.
--- a/tests/pytorch/fused_attn/test_fused_attn.py
+++ b/tests/pytorch/fused_attn/test_fused_attn.py
@@ -233,9 +233,9 @@ def test_dot_product_attention(
    """Test DotProductAttention module"""
    # Get configs
-    tols = dict(atol=5e-3, rtol=5e-3)
+    tols = dict(atol=1e-3, rtol=1e-3)
    if dtype == torch.bfloat16:
-        tols = dict(atol=2.5e-2, rtol=2.5e-2)
+        tols = dict(atol=1.5e-2, rtol=1.5e-2)
    config = model_configs[model]
    is_mla = config.head_dim_qk != config.head_dim_v
    if qkv_layout is None:
@@ -1035,7 +1035,7 @@ def test_transformer_layer(
    # Get configs
    config = model_configs[model]
-    tols = dict(atol=5e-1, rtol=5e-2)
+    tols = dict(atol=5e-2, rtol=5e-2)
    workspace_opt = True
    # Test backend availability