"scripts/git@developer.sourcefind.cn:Wenxuan/LightX2V.git" did not exist on "ec79c1453d11515aa89c92303f9b48ea1a65de24"
[Dev][Doc] Enhance Flash Attention Implementation in GQA Decoding Example and Fix Typo (#139)
- Add non-split flash attention macro for more flexible kernel generation - Implement `main_no_split` function to handle single-split scenarios - Modify kernel selection logic to dynamically choose between split and non-split implementations
Showing
Please register or sign in to comment