Commit fe7b314d authored by raojy's avatar raojy 💬
Browse files

Update README.md

parent d34e005b
......@@ -260,9 +260,8 @@ vllm serve Qwen/Qwen2.5-VL-72B-Instruct \
--served-model-name "qwen-vl" \
--tensor-parallel-size 4 \
--gpu-memory-utilization 0.95 \
--max-model-len 4096 \
--max-model-len 32768 \
--dtype bfloat16 \
--enforce-eager \
--trust-remote-code \
--port 8000
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment