docs: fix the README link to the perf.sh file (#1501)

0bba09a4 · richardhuo-nv · GitHub · 8585c300 · 0bba09a4 · 0bba09a4
Unverified Commit 0bba09a4 authored Jun 12, 2025 by richardhuo-nv Committed by GitHub Jun 12, 2025
Hide whitespace changes
Inline Side-by-side

Showing with 2 additions and 2 deletions

examples/llm/benchmarks/README.md examples/llm/benchmarks/README.md +1 -1

examples/tensorrt_llm/README.md examples/tensorrt_llm/README.md +1 -1

No files found.
--- a/examples/llm/benchmarks/README.md
+++ b/examples/llm/benchmarks/README.md
@@ -266,7 +266,7 @@ For more information see [Collecting Performance Numbers](#collecting-performanc

 ## Collecting Performance Numbers

-Currently, there is no consistent way of obtaining the configuration of deployment service. Hence, we need to provide this information to the script in form of command line arguments. The benchmarking script `/workspace/examples/llm/benchmarks/perf.sh` uses GenAI-Perf tool to collect the performance numbers at various different request concurrencies. The perf.sh script can be run multiple times to collect numbers for various different deployments. Each script execution will create a new artifacts directory in `artifacts_root` and dump these numbers in it. See [Plotting Pareto Graphs](#plotting-pareto-graphs) to learn how to convert the data from this `artifacts_root` to generate pareto graphs for the performance.
+Currently, there is no consistent way of obtaining the configuration of deployment service. Hence, we need to provide this information to the script in form of command line arguments. The benchmarking script `/workspace/benchmarks/llm/perf.sh` uses GenAI-Perf tool to collect the performance numbers at various different request concurrencies. The perf.sh script can be run multiple times to collect numbers for various different deployments. Each script execution will create a new artifacts directory in `artifacts_root` and dump these numbers in it. See [Plotting Pareto Graphs](#plotting-pareto-graphs) to learn how to convert the data from this `artifacts_root` to generate pareto graphs for the performance.

 Note: As each `perf.sh` adds a new artifacts directory in the `artifacts_root` always, proper care should be taken that we are starting experiment with clean `artifacts_root` so we include only results from runs that we want to compare.


--- a/examples/tensorrt_llm/README.md
+++ b/examples/tensorrt_llm/README.md
@@ -286,7 +286,7 @@ See [close deployment](../../docs/guides/dynamo_serve.md#close-deployment) secti
 ### Benchmarking

 To benchmark your deployment with GenAI-Perf, see this utility script, configuring the
-`model` name and `host` based on your deployment: [perf.sh](../llm/benchmarks/perf.sh)
+`model` name and `host` based on your deployment: [perf.sh](../../benchmarks/llm/perf.sh)

 ### Future Work