"vscode:/vscode.git/clone" did not exist on "c8f3acf9c015fbbba11456df5e829e0e7f57eaf2"
Make handling of FP8 scales more consisent (#2666)
Change `fp8_quantize` so that we can pass around reciprocals everywhere, so scales are always passed around in the checkpoint format. I also noticed that we ignore any input scales that we might have when fbgemm is available. Skip this path if we already have a scale.
Showing
Please register or sign in to comment