set cache in recorded layers

Summary: Pull Request resolved: https://github.com/facebookresearch/d2go/pull/433 Distillation uses a module called `CachedLayer` to record the outputs of a layer to an internal dict. This dict is typically initialized by the object itself and any value is overwritten every time the model runs. However, sometimes we need more than one output run of the layer (e.g., domain adaptation => we run the model on real, then synthetic data and need to use both outputs). This diff adds a helper to set externally set the cache dict of a model. In other words, we can run `set_cache_dict` on some model to change the dict used by all `CachedLayer` in the model. This allows us to run the model and record some outputs, then change the cache dict and rerun the model to save different outputs. Differential Revision: D40970577 fbshipit-source-id: 49cb851af49ae193d0c8ac9218e02fdaf4e6587b

set cache in recorded layers
Summary: Pull Request resolved: https://github.com/facebookresearch/d2go/pull/433 Distillation uses a module called `CachedLayer` to record the outputs of a layer to an internal dict. This dict is typically initialized by the object itself and any value is overwritten every time the model runs. However, sometimes we need more than one output run of the layer (e.g., domain adaptation => we run the model on real, then synthetic data and need to use both outputs). This diff adds a helper to set externally set the cache dict of a model. In other words, we can run `set_cache_dict` on some model to change the dict used by all `CachedLayer` in the model. This allows us to run the model and record some outputs, then change the cache dict and rerun the model to save different outputs. Differential Revision: D40970577 fbshipit-source-id: 49cb851af49ae193d0c8ac9218e02fdaf4e6587b
30ac5858 · Matthew Yu · Facebook GitHub Bot · 19c5392d · 30ac5858 · 30ac5858
Commit 30ac5858 authored Nov 30, 2022 by Matthew Yu Committed by Facebook GitHub Bot Nov 30, 2022
Show whitespace changes
Inline Side-by-side

Showing with 22 additions and 0 deletions

d2go/modeling/distillation.py d2go/modeling/distillation.py +7 -0

tests/modeling/test_modeling_distillation.py tests/modeling/test_modeling_distillation.py +15 -0

No files found.
--- a/d2go/modeling/distillation.py
+++ b/d2go/modeling/distillation.py
@@ -566,6 +566,13 @@ class CachedLayer(nn.Module):
        return output
+def set_cache_dict(model: nn.Module, cache: Dict) -> None:
+    """Sets the cache in all CachedLayers to input cache"""
+    for module in model.modules():
+        if isinstance(module, CachedLayer):
+            module.cache = cache
 def record_layers(model: nn.Module, layer_names: Set[str]) -> Dict[str, torch.Tensor]:
    """Save the outputs of layer_names in model

--- a/tests/modeling/test_modeling_distillation.py
+++ b/tests/modeling/test_modeling_distillation.py
@@ -28,6 +28,7 @@ from d2go.modeling.distillation import (
    PseudoLabeler,
    record_layers,
    RelabelTargetInBatch,
+    set_cache_dict,
    unrecord_layers,
 )
 from d2go.registry.builtin import (
@@ -365,6 +366,20 @@ class TestDistillation(unittest.TestCase):
        self.assertEqual(output["add"], layer0_cache["l00"] + layer1_cache["l10"])
        self.assertEqual(output["div"], layer0_cache["l01"] / layer1_cache["l11"])
+    def test_set_cache_dict(self):
+        """Check we can swap the cache dict used when recording layers"""
+        model = AddLayers()
+        cache = record_layers(model, ["", "layer0", "layer1", "layer2"])
+        new_cache = {}
+        set_cache_dict(model, new_cache)
+        input = torch.Tensor([0])
+        output = model(input)
+        self.assertEqual(cache, {})
+        torch.testing.assert_close(new_cache["layer0"], torch.Tensor([1]))
+        torch.testing.assert_close(new_cache["layer1"], torch.Tensor([2]))
+        torch.testing.assert_close(new_cache["layer2"], torch.Tensor([3]))
+        torch.testing.assert_close(new_cache[""], output)
 class TestPseudoLabeler(unittest.TestCase):
    def test_noop(self):