• Umang Yadav's avatar
    disable quant_dot test for CPU backend, it requires removing downcast converts... · b50d8b24
    Umang Yadav authored
    disable quant_dot test for CPU backend, it requires removing downcast converts in "ref"implementation. Inside the "ref" impelmentation, it downcasting one of the inputs of the GEMM from float to fp8. Which is a lossy conversion and while computing gemm, it upcasts it back to  float.  CPU backend removes converts completely since it is going from "float->fp8->float". due to lossy cast, results are coming out different.
    b50d8b24
main.cpp 3.14 KB