"torchvision/git@developer.sourcefind.cn:OpenDAS/vision.git" did not exist on "3db30442a52e250c569ba131485fc432c8300f16"
Batched gemm and reduction (#156)
* adding batched_gemm_and_reduction * batched_gemm_reduce works with bactch_count=1 * fix a bug in grid_size; batched_gemm_reduce works for batch_count > 1 * adding profiler for batched_gemm_fp16 * fixed a bug in declaration of d1 and d0; both example and profiler work * clang-format * cleanup * batched_gemm_reduce: add test * minor change * fixed some typo in function names
Showing
This diff is collapsed.
Please register or sign in to comment