Unverified Commit 53cd7f86 authored by Zhengxu Chen's avatar Zhengxu Chen Committed by GitHub
Browse files

[compile] Recompile graph module during Dynamo cache loading. (#30743)


Signed-off-by: default avatarZhengxu Chen <zhxchen17@fb.com>
parent 7b966ae2
......@@ -104,6 +104,7 @@ class VllmSerializableFunction(SerializableCallable):
state = pickle.loads(data)
fake_mode = FakeTensorMode(shape_env=ShapeEnv())
state["graph_module"] = GraphPickler.loads(state["graph_module"], fake_mode)
state["graph_module"].recompile()
state["example_inputs"] = GraphPickler.loads(state["example_inputs"], fake_mode)
vllm_backend = VllmBackend(get_current_vllm_config(), state["prefix"])
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment