- 07 Dec, 2023 4 commits
-
-
Umang Yadav authored
-
Umang Yadav authored
-
Umang Yadav authored
-
Umang Yadav authored
-
- 06 Dec, 2023 19 commits
-
-
Umang Yadav authored
-
Umang Yadav authored
-
Umang Yadav authored
-
Umang Yadav authored
-
Umang Yadav authored
-
Umang Yadav authored
-
Umang Yadav authored
-
Paul Fultz II authored
-
Paul Fultz II authored
-
Umang Yadav authored
-
Paul Fultz II authored
-
turneram authored
-
Umang Yadav authored
-
Umang Yadav authored
-
Umang Yadav authored
-
Umang Yadav authored
-
Umang Yadav authored
-
Umang Yadav authored
-
Umang Yadav authored
-
- 05 Dec, 2023 17 commits
-
-
Umang Yadav authored
-
Umang Yadav authored
-
Umang Yadav authored
-
Umang Yadav authored
-
Umang Yadav authored
-
Umang Yadav authored
-
Umang Yadav authored
-
Umang Yadav authored
-
Umang Yadav authored
-
Umang Yadav authored
-
Umang Yadav authored
disable quant_dot test for CPU backend, it requires removing downcast converts in "ref"implementation. Inside the "ref" impelmentation, it downcasting one of the inputs of the GEMM from float to fp8. Which is a lossy conversion and while computing gemm, it upcasts it back to float. CPU backend removes converts completely since it is going from "float->fp8->float". due to lossy cast, results are coming out different.
-
Umang Yadav authored
-
Umang Yadav authored
-
Umang Yadav authored
-
Umang Yadav authored
-
Umang Yadav authored
-
Umang Yadav authored
-