fused qknorm+rope kernel optimization for SM9.0 (#37376)
Signed-off-by:EricccYang <yangyang4991@gmail.com> Signed-off-by:
Kaicheng Yang <53411596+EricccYang@users.noreply.github.com> Co-authored-by:
Claude Sonnet 4.6 <noreply@anthropic.com>
Showing
csrc/async_util.cuh
0 → 100644
Please register or sign in to comment