"...composable_kernel_onnx.git" did not exist on "ad41aa0e7a0a3c3a5aeafb376518910310eccc57"
refactored deviceBatchedGemm; removed GridwiseBatchedGemm; added fp32 and int8 to profiler (#120)
changed long_index_t to index_t when computing memory offset uncomment other ops in profiler added test for batched_gemm
Showing
This diff is collapsed.
This diff is collapsed.
Please register or sign in to comment