[PyTorch] Fix bug in micro batched inference with rotary embeddings (#536)
[fix] fixed micro batched inference with RoPE Signed-off-by:Fabian Joswig <fabian.joswig@deepl.com> Co-authored-by:
cyanguwa <8636796+cyanguwa@users.noreply.github.com>
Showing
Please register or sign in to comment