[Fix] Set max dynamic smem size for decoder MHA to support context length > 8k (#377)
* Fix crash when context window size is large by setting max dynamic smem size * fix linting
Showing
Please register or sign in to comment
* Fix crash when context window size is large by setting max dynamic smem size * fix linting