运行vllm serve报错
按照https://sourcefind.cn/#/model-zoo/1885899210378838017的文档,使用docker拉取镜像之后,直接进行推理,模型拉取的是Qwen-14B。
执行vllm serve /home/DeepSeek-R1-Distill-Qwen-14B/ --tensor-parallel-size 2 --max-model-len 32768 --enforce-eager之后报错。
RuntimeError: HIP error: invalid device function
HIP kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect.
For debugging consider passing AMD_SERIALIZE_KERNEL=3.
Compile with TORCH_USE_HIP_DSA to enable device-side assertions.
请问要怎么解决,谢谢。硬件是DCU Z100