Restruct gpu_memory_settings in a unify function and relax max_cuda_graph_bs (#10372)
Co-authored-by:Lianmin Zheng <lianminzheng@gmail.com> Co-authored-by:
sglang-bot <sglangbot@gmail.com>
Showing
Please register or sign in to comment
Co-authored-by:Lianmin Zheng <lianminzheng@gmail.com> Co-authored-by:
sglang-bot <sglangbot@gmail.com>