fix(l4): fix fp8 logic on l4 (#2277)
* fix(l4): fix fp8 logic on l4 * also quant weights with single scale * use marlin even on 89
Showing
Please register or sign in to comment
* fix(l4): fix fp8 logic on l4 * also quant weights with single scale * use marlin even on 89