Change how `take_along_axis` is computed in DeBERTa to stop confusing XLA (#18256)
* Change how `take_along_axis` is computed in DeBERTa to stop confusing XLA * Greatly simplify take_along_axis() since the code wasn't using most of it
Showing
Please register or sign in to comment