Merge branch 'v0.7.2-dev-wm' into 'v0.7.2-dev'
[feat]添加VLLM_ENFORCE_EAGER_BS_THRESHOLD环境变量,支持cudagraph模式下,当bs大于阈值时,强制切换为eager模式,对大bs有效果 See merge request dcutoolkit/deeplearing/vllm!95
Showing
Please register or sign in to comment
[feat]添加VLLM_ENFORCE_EAGER_BS_THRESHOLD环境变量,支持cudagraph模式下,当bs大于阈值时,强制切换为eager模式,对大bs有效果 See merge request dcutoolkit/deeplearing/vllm!95