Unverified Commit a55c9509 authored by Huy Do's avatar Huy Do Committed by GitHub
Browse files

Correctly kill vLLM processes after finishing serving benchmarks (#21641)


Signed-off-by: default avatarHuy Do <huydhn@gmail.com>
parent 97349fe2
...@@ -95,12 +95,14 @@ json2args() { ...@@ -95,12 +95,14 @@ json2args() {
} }
kill_gpu_processes() { kill_gpu_processes() {
pkill -f python pkill -f '[p]ython'
pkill -f python3 pkill -f '[p]ython3'
pkill -f tritonserver pkill -f '[t]ritonserver'
pkill -f pt_main_thread pkill -f '[p]t_main_thread'
pkill -f text-generation pkill -f '[t]ext-generation'
pkill -f lmdeploy pkill -f '[l]mdeploy'
# vLLM now names the process with VLLM prefix after https://github.com/vllm-project/vllm/pull/21445
pkill -f '[V]LLM'
while [ "$(nvidia-smi --query-gpu=memory.used --format=csv,noheader,nounits | head -n 1)" -ge 1000 ]; do while [ "$(nvidia-smi --query-gpu=memory.used --format=csv,noheader,nounits | head -n 1)" -ge 1000 ]; do
sleep 1 sleep 1
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment