tests/pytorch/test_fusible_ops.py · 41909dc8695ba22ba8b04836b7cc0d086136dd06 · OpenDAS / TransformerEngine

[PyTorch] Linear op avoids saving input tensor if weight grad is not needed (#1817) · 41909dc8

Tim Moon authored May 28, 2025



* Linear op avoids saving input tensor if weight grad is not needed
Signed-off-by: Tim Moon <tmoon@nvidia.com>

* Linear op forward avoids producing quantized tensors with unnecessary usages
Signed-off-by: Tim Moon <tmoon@nvidia.com>

* Fix linter warnings
Signed-off-by: Tim Moon <tmoon@nvidia.com>

* Avoid unnecessary usages in fused linear ops
Signed-off-by: Tim Moon <tmoon@nvidia.com>

---------
Signed-off-by: Tim Moon <tmoon@nvidia.com>

41909dc8

test_fusible_ops.py 71 KB

Replace test_fusible_ops.py