move rp_bucket to relative_attention_bias' device
otherwise, `rp_bucket` will always be on cpu and fail if `self.relative_attention_bias` is on cuda
Showing
Please register or sign in to comment
otherwise, `rp_bucket` will always be on cpu and fail if `self.relative_attention_bias` is on cuda