"git@developer.sourcefind.cn:kecinstone/2024-pra-vllm.git" did not exist on "58df2883cb3d3813e1d09ba691744773d9dcae58"
Make `CanonicalizeGemmInput()` support non-TN layout FP8 GEMM on Blackwell...
Make `CanonicalizeGemmInput()` support non-TN layout FP8 GEMM on Blackwell with column-wise/transposed data (#2233)
Modified CanonicalizeGemmInput() logic to pull from column-wise data for FP8 GEMM on Blackwell when row-wise is not available.
Signed-off-by:
Alp Dener <adener@nvidia.com>
Showing
Please register or sign in to comment