Adding remove_caches API to Float8Tensor class (#1425)

* add remove_caches api Signed-off-by: Youngeun Kwon <youngeunk@nvidia.com> * Update transformer_engine/pytorch/tensor/float8_tensor.py Co-authored-by: Tim Moon <4406448+timmoon10@users.noreply.github.com> Signed-off-by: Youngeun Kwon <youngeunk@nvidia.com> * explicit delete Signed-off-by: Youngeun Kwon <youngeunk@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci --------- Signed-off-by: Youngeun Kwon <youngeunk@nvidia.com> Co-authored-by: Tim Moon <4406448+timmoon10@users.noreply.github.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

Adding remove_caches API to Float8Tensor class (#1425)
* add remove_caches api Signed-off-by: Youngeun Kwon <youngeunk@nvidia.com> * Update transformer_engine/pytorch/tensor/float8_tensor.py Co-authored-by: Tim Moon <4406448+timmoon10@users.noreply.github.com> Signed-off-by: Youngeun Kwon <youngeunk@nvidia.com> * explicit delete Signed-off-by: Youngeun Kwon <youngeunk@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci --------- Signed-off-by: Youngeun Kwon <youngeunk@nvidia.com> Co-authored-by: Tim Moon <4406448+timmoon10@users.noreply.github.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
94c92919 · Youngeun Kwon · GitHub · 9351a179 · 94c92919
Unverified Commit 94c92919 authored Feb 25, 2025 by Youngeun Kwon Committed by GitHub Feb 25, 2025
Show whitespace changes
Inline Side-by-side

Showing with 8 additions and 0 deletions

transformer_engine/pytorch/tensor/float8_tensor.py transformer_engine/pytorch/tensor/float8_tensor.py +8 -0

No files found.
--- a/transformer_engine/pytorch/tensor/float8_tensor.py
+++ b/transformer_engine/pytorch/tensor/float8_tensor.py
@@ -334,6 +334,14 @@ class Float8Tensor(Float8TensorBase, QuantizedTensor):
        """
        self._transpose_invalid = True

+    def remove_caches(self) -> None:
+        """
+        Remove transpose cache and mark it as invalid.
+        """
+        self._transpose_invalid = True
+        del self._transpose  # explicitly deletes the data for safety
+        self._transpose = None
+
    def clear(self):
        """Deallocate this tensor's memory. Typically not needed and must be used carefully."""
        self._data = torch.Tensor() if self._data is not None else None