-
Phuong Nguyen authored
* added alignment requirements for CuBLAS heuristics Signed-off-by:
Phuong Nguyen <phuonguyen@nvidia.com> * minor rewords Signed-off-by:
Phuong Nguyen <phuonguyen@nvidia.com> * added unit test for gemm with unaligned inputs Signed-off-by:
Phuong Nguyen <phuonguyen@nvidia.com> * added pytest skip if fp8 is not available Signed-off-by:
Phuong Nguyen <phuonguyen@nvidia.com> * changed offset so that it has alignment with 128 Signed-off-by:
Phuong Nguyen <phuonguyen@nvidia.com> --------- Signed-off-by:
Phuong Nguyen <phuonguyen@nvidia.com>
01801633