vllm:export VLLM_CUSTOM_CACHE=1 dtk:export HIP_KERNEL_EVENT_SYSTENFENCE=1 2、kvcache支持fp8
Attach a file by drag & drop or click to upload