"git@developer.sourcefind.cn:OpenDAS/tilelang.git" did not exist on "e2bc1cb6257d78a90fa93c2e4703f060fc22cfea"
[Dev][Doc] Enhance Flash Attention Implementation in GQA Decoding Example and Fix Typo (#139)
- Add non-split flash attention macro for more flexible kernel generation - Implement `main_no_split` function to handle single-split scenarios - Modify kernel selection logic to dynamically choose between split and non-split implementations
Showing
Please register or sign in to comment