"vllm/vscode:/vscode.git/clone" did not exist on "005ba458b52fe2cf9837201d05644eadcdf10ca0"
Fix cuda graph capture for grouped gemm (#1345)
* retain_graph=True for grouped gemm Signed-off-by:Xiaowei Ren <xren@nvidia.com> * remove an unnecessary retain_graph=True Signed-off-by:
Xiaowei Ren <xren@nvidia.com> * make retain_graph in graph capture configurable Signed-off-by:
Xiaowei Ren <xren@nvidia.com> * typo fix Signed-off-by:
Xiaowei Ren <xren@nvidia.com> --------- Signed-off-by:
Xiaowei Ren <xren@nvidia.com>
Showing
Please register or sign in to comment