Merge branch 'v0.15.1-dev_yql_3.18' into 'v0.15.1-dev'
x接入mla_cat算子仅在nmz和kvcache-fp8情况下生效,默认关闭,开启需要export VLLM_USE_CAT_MLA=1 See merge request dcutoolkit/deeplearing/vllm!513
Showing
Please register or sign in to comment
x接入mla_cat算子仅在nmz和kvcache-fp8情况下生效,默认关闭,开启需要export VLLM_USE_CAT_MLA=1 See merge request dcutoolkit/deeplearing/vllm!513