Document multi-proc method selection for profiling (#23802)

Signed-off-by: jdebache <jdebache@nvidia.com>

Document multi-proc method selection for profiling (#23802)
Signed-off-by: jdebache <jdebache@nvidia.com>
41c80698 · Julien Debache · GitHub · 7c8271cd · 41c80698
Unverified Commit 41c80698 authored Sep 01, 2025 by Julien Debache Committed by GitHub Sep 01, 2025
Show whitespace changes
Inline Side-by-side

Showing with 2 additions and 0 deletions

docs/contributing/profiling.md docs/contributing/profiling.md +2 -0

No files found.
--- a/docs/contributing/profiling.md
+++ b/docs/contributing/profiling.md
@@ -73,6 +73,8 @@ apt install nsight-systems-cli
 ### Example commands and usage
+When profiling with `nsys`, it is advisable to set the environment variable `VLLM_WORKER_MULTIPROC_METHOD=spawn`. The default is to use the `fork` method instead of `spawn`. More information on the topic can be found in the [Nsight Systems release notes](https://docs.nvidia.com/nsight-systems/ReleaseNotes/index.html#general-issues).
 #### Offline Inference
 For basic usage, you can just append `nsys profile -o report.nsys-rep --trace-fork-before-exec=true --cuda-graph-trace=node` before any existing script you would run for offline inference.