Correctly kill vLLM processes after finishing serving benchmarks (#21641)

Signed-off-by: Huy Do <huydhn@gmail.com>

Correctly kill vLLM processes after finishing serving benchmarks (#21641)
Signed-off-by: Huy Do <huydhn@gmail.com>
a55c9509 · Huy Do · GitHub · 97349fe2 · a55c9509
Unverified Commit a55c9509 authored Jul 25, 2025 by Huy Do Committed by GitHub Jul 25, 2025
Hide whitespace changes
Inline Side-by-side

Showing with 8 additions and 6 deletions

.buildkite/nightly-benchmarks/scripts/run-nightly-benchmarks.sh ...kite/nightly-benchmarks/scripts/run-nightly-benchmarks.sh +8 -6

No files found.
--- a/.buildkite/nightly-benchmarks/scripts/run-nightly-benchmarks.sh
+++ b/.buildkite/nightly-benchmarks/scripts/run-nightly-benchmarks.sh
@@ -95,12 +95,14 @@ json2args() {
 }
 kill_gpu_processes() {
-  pkill -f python
+  pkill -f '[p]ython'
-  pkill -f python3
+  pkill -f '[p]ython3'
-  pkill -f tritonserver
+  pkill -f '[t]ritonserver'
-  pkill -f pt_main_thread
+  pkill -f '[p]t_main_thread'
-  pkill -f text-generation
+  pkill -f '[t]ext-generation'
-  pkill -f lmdeploy
+  pkill -f '[l]mdeploy'
+  # vLLM now names the process with VLLM prefix after https://github.com/vllm-project/vllm/pull/21445
+  pkill -f '[V]LLM'
  while [ "$(nvidia-smi --query-gpu=memory.used --format=csv,noheader,nounits | head -n 1)" -ge 1000 ]; do
    sleep 1