Prevent Dynamo graph fragmentation in GPTNeoX with torch.baddbmm fix (#24941)
* Pass a Python scalar for alpha in torch.baddbmm
* fixup
---------
Co-authored-by:
Arthur Zucker <arthur.zucker@gmail.com>
Showing
Please register or sign in to comment