"...git@developer.sourcefind.cn:kecinstone/2024-pra-vllm.git" did not exist on "0a11a2e5ca764af37254fc962e5e6d35295d499b"
Fix cuda graph capture for grouped gemm (#1345)
* retain_graph=True for grouped gemm Signed-off-by:Xiaowei Ren <xren@nvidia.com> * remove an unnecessary retain_graph=True Signed-off-by:
Xiaowei Ren <xren@nvidia.com> * make retain_graph in graph capture configurable Signed-off-by:
Xiaowei Ren <xren@nvidia.com> * typo fix Signed-off-by:
Xiaowei Ren <xren@nvidia.com> --------- Signed-off-by:
Xiaowei Ren <xren@nvidia.com>
Showing
Please register or sign in to comment