[contrib] Fix the reference implementation of multihead_attn (#1423)
* follow the current signature Signed-off-by:Masaki Kozuki <mkozuki@nvidia.com> * call .backward on outputs Signed-off-by:
Masaki Kozuki <mkozuki@nvidia.com> * update the other caller of _softmax_backward_data Signed-off-by:
Masaki Kozuki <mkozuki@nvidia.com>
Showing
Please register or sign in to comment