"vscode:/vscode.git/clone" did not exist on "023ce416c3a2f5dee9746421b0d6e152d00a3ee6"
Optimize model execution with CUDA graph (#1926)
Co-authored-by:Chen Shen <scv119@gmail.com> Co-authored-by:
Antoni Baum <antoni.baum@protonmail.com>
Showing
| ... | ... | @@ -12,3 +12,4 @@ fastapi |
| uvicorn[standard] | ||
| pydantic == 1.10.13 # Required for OpenAI server. | ||
| aioprometheus[starlette] | ||
| cupy-cuda12x # Required for CUDA graphs. CUDA 11.8 users should install cupy-cuda11x instead. # FIXME: Fix this in setup.py. |
Please register or sign in to comment