contrib/fmha: Add option to zero out tensors before math (#1322)
* extend api to allow forced memory zeroing (empty() does not do it) * typo fix * ctx change * move zeroing flag to ctx * update test Co-authored-by:mchochowski <mchochowski@nvidia.com> Co-authored-by:
Masaki Kozuki <mkozuki@nvidia.com>
Showing
Please register or sign in to comment