Unverified Commit 2ec7d3f7 authored by Tri Dao's avatar Tri Dao Committed by GitHub
Browse files

Merge pull request #105 from jamaliki/patch-1

Change default dropout value in documentation
parents ef085cfc 41cb9097
......@@ -16,7 +16,7 @@ class FlashAttention(nn.Module):
(default: 1/sqrt(d_keys) where d_keys is computed at
runtime)
attention_dropout: The dropout rate to apply to the attention
(default: 0.1)
(default: 0.0)
"""
def __init__(self, softmax_scale=None, attention_dropout=0.0, device=None, dtype=None):
super().__init__()
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment