".github/workflows/windows_release_package.yml" did not exist on "17636d083fd0d5fcc4a8566be4296ee3040ca4cb"
Unverified Commit 2ec7d3f7 authored by Tri Dao's avatar Tri Dao Committed by GitHub
Browse files

Merge pull request #105 from jamaliki/patch-1

Change default dropout value in documentation
parents ef085cfc 41cb9097
...@@ -16,7 +16,7 @@ class FlashAttention(nn.Module): ...@@ -16,7 +16,7 @@ class FlashAttention(nn.Module):
(default: 1/sqrt(d_keys) where d_keys is computed at (default: 1/sqrt(d_keys) where d_keys is computed at
runtime) runtime)
attention_dropout: The dropout rate to apply to the attention attention_dropout: The dropout rate to apply to the attention
(default: 0.1) (default: 0.0)
""" """
def __init__(self, softmax_scale=None, attention_dropout=0.0, device=None, dtype=None): def __init__(self, softmax_scale=None, attention_dropout=0.0, device=None, dtype=None):
super().__init__() super().__init__()
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment