Add README section on issues

450b64fe · Tri Dao · c0daa62e · 450b64fe
Commit 450b64fe authored Jun 27, 2022 by Tri Dao
Hide whitespace changes
Inline Side-by-side

Showing with 10 additions and 0 deletions

README.md README.md +10 -0

No files found.
--- a/README.md
+++ b/README.md
@@ -104,6 +104,16 @@ T4 GPUs are commonly used for inference, so we also measure speedup on the forwa
 We see speedups between 2.5x-4.5x on the forward pass.
+## When you encounter issues
+This alpha release of FlashAttention contains code written for a research
+project to validate ideas on speeding up attention. 
+We have tested it on several models (BERT, GPT2, ViT). 
+However, there might still be bugs in the implementation that we hope to iron
+out in the next few months.
+If you encounter any of these bugs, please open a respective GitHub Issue!
 ## Acknowledgments
 Our implementation uses Apex's
 [FMHA](https://github.com/NVIDIA/apex/tree/master/apex/contrib/csrc/fmha) code