Unverified Commit 811a9381 authored by drbh's avatar drbh Committed by GitHub
Browse files

feat: sort cuda graphs in descending order (#2104)

parent 197c47a3
......@@ -15,6 +15,13 @@ if cuda_graphs is not None:
else:
cuda_graphs = None
# sorting the cuda graphs in descending order helps reduce the
# memory impact and results in less memory usage
if cuda_graphs is not None:
cuda_graphs.sort(reverse=True)
CUDA_GRAPHS = cuda_graphs
# This is overridden at model loading.
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment