[Fix] Adjust default chunked prefill size and cuda graph max bs according to...
[Fix] Adjust default chunked prefill size and cuda graph max bs according to GPU memory capacity (#2044)
Showing
Please register or sign in to comment
[Fix] Adjust default chunked prefill size and cuda graph max bs according to GPU memory capacity (#2044)