Fix deprecated calls in multihead_attn and ninja build failure (#746)
* disable ninja for multihead_attn
* fix getCurrentStream in multihead_attn
Co-authored-by:
pbialecki <pbialecki@nvidia.com>
Showing
Please register or sign in to comment