Unverified Commit 41c80698 authored by Julien Debache's avatar Julien Debache Committed by GitHub
Browse files

Document multi-proc method selection for profiling (#23802)


Signed-off-by: default avatarjdebache <jdebache@nvidia.com>
parent 7c8271cd
...@@ -73,6 +73,8 @@ apt install nsight-systems-cli ...@@ -73,6 +73,8 @@ apt install nsight-systems-cli
### Example commands and usage ### Example commands and usage
When profiling with `nsys`, it is advisable to set the environment variable `VLLM_WORKER_MULTIPROC_METHOD=spawn`. The default is to use the `fork` method instead of `spawn`. More information on the topic can be found in the [Nsight Systems release notes](https://docs.nvidia.com/nsight-systems/ReleaseNotes/index.html#general-issues).
#### Offline Inference #### Offline Inference
For basic usage, you can just append `nsys profile -o report.nsys-rep --trace-fork-before-exec=true --cuda-graph-trace=node` before any existing script you would run for offline inference. For basic usage, you can just append `nsys profile -o report.nsys-rep --trace-fork-before-exec=true --cuda-graph-trace=node` before any existing script you would run for offline inference.
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment