"src/include/blockwise_4d_tensor_op.cuh" did not exist on "adf4b173b30f463d56d111c42116e1d20e194cf4"
CGEMM examples bf16, fp32, int8 (#332)
* Add int8 specialization for elementwise Add and Subtract.
* CGEMM examples bf16, fp32, int8
* Add convert reference output to CDataType.
* Skip BF16 data type during testing.
* Lower K value to get rid of accumulation error.
* Fix merge artifact.
* Fix changed function name: GetElementSpaceSize()
* Fix merge artifact.
Co-authored-by:
Adam Osewski <aosewski@amd.com>
Showing
Please register or sign in to comment