vllm_serve.sh 116 Bytes
Newer Older
zzg_666's avatar
zzg_666 committed
1
vllm serve inference-net/Schematron-3B  --trust-remote-code --dtype bfloat16 -tp 1 --max-model-len 32768 --port 8010