Unverified Commit c4336b2b authored by Lianmin Zheng's avatar Lianmin Zheng Committed by GitHub
Browse files

Use custom allreduce w/ torch.compile (#2185)

parent 4d62bca5
......@@ -65,7 +65,8 @@ def patch_model(
_to_torch(model)
monkey_patch_vllm_all_gather()
backup_ca_comm = tp_group.ca_comm
tp_group.ca_comm = None
# Use custom-allreduce here
# tp_group.ca_comm = None
yield torch.compile(
torch.no_grad()(model.forward), mode="max-autotune-no-cudagraphs"
)
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment