Merge branch 'v0.9.2-dev-fth' into 'v0.9.2-dev'
接入新的concat算子,包含decode和prefill,并根据size的不同进行选择 See merge request dcutoolkit/deeplearing/vllm!207
Showing
Please register or sign in to comment
接入新的concat算子,包含decode和prefill,并根据size的不同进行选择 See merge request dcutoolkit/deeplearing/vllm!207