The gfx928 architecture force to set VLLM_W8A8_BACKEND == 1 See merge request dcutoolkit/deeplearing/vllm!533
Attach a file by drag & drop or click to upload