"script/process_perf_data.py" did not exist on "639147432b6922bd8e4051ba751e4e63dd4eb196"
refactored deviceBatchedGemm; removed GridwiseBatchedGemm; added fp32 and int8 to profiler (#120)
changed long_index_t to index_t when computing memory offset uncomment other ops in profiler added test for batched_gemm
Showing
Please register or sign in to comment