Add support for wNa16 int 2:4 compressed-tensors checkpoints (#2758)
This change adds support for wNa16 int checkpoints with 2:4 sparsity using Marlin 2:4 kernels.
Showing
Please register or sign in to comment
This change adds support for wNa16 int checkpoints with 2:4 sparsity using Marlin 2:4 kernels.