You need to sign in or sign up before continuing.
spconv: filter DTK MaskImplicitGemm descriptors
Limit the DTK SIMT MaskImplicitGemm forward descriptor set for the MV2DFusion VVM/SubM fp32 shape family on sm_93. For kv=27, input channels=128, and output channels=64, keep only the SIMT descriptor validated by dense-oracle replay. Apply the same selection rule in both the Python tuner path and generated C++ tuner path to keep build-time and runtime behavior aligned.
Showing
Please register or sign in to comment