docs: Fix KV cache transfer UCX configuration instructions (#5247)

Co-authored-by: Claude Opus 4.5 <noreply@anthropic.com>

docs: Fix KV cache transfer UCX configuration instructions (#5247)
Co-authored-by: Claude Opus 4.5 <noreply@anthropic.com>
52ce68e6 · Neelay Shah · GitHub · 12f6e6a8 · 52ce68e6
Unverified Commit 52ce68e6 authored Jan 12, 2026 by Neelay Shah Committed by GitHub Jan 12, 2026
Show whitespace changes
Inline Side-by-side

Showing with 3 additions and 5 deletions

docs/backends/trtllm/kv-cache-transfer.md docs/backends/trtllm/kv-cache-transfer.md +3 -5

No files found.
--- a/docs/backends/trtllm/kv-cache-transfer.md
+++ b/docs/backends/trtllm/kv-cache-transfer.md
@@ -34,9 +34,7 @@ TODO: Add instructions for how to specify different backends for NIXL.

 ## Alternative Method: UCX

-TensorRT-LLM can also leverage **UCX** (Unified Communication X) directly for KV cache transfer between prefill and decode workers. There are two ways to enable UCX as the KV cache transfer backend:
+TensorRT-LLM can also leverage **UCX** (Unified Communication X) directly for KV cache transfer between prefill and decode workers. To enable UCX as the KV cache transfer backend, set `cache_transceiver_config.backend: UCX` in your engine configuration YAML file.

-1. **Recommended:** Set `cache_transceiver_config.backend: UCX` in your engine configuration YAML file.
-2. Alternatively, set the environment variable `TRTLLM_USE_UCX_KV_CACHE=1` and configure `cache_transceiver_config.backend: DEFAULT` in the engine configuration YAML.
-
-This flexibility allows users to choose the most suitable method for their deployment and compatibility requirements.
+> [!Note]
+> The environment variable `TRTLLM_USE_UCX_KV_CACHE=1` with `cache_transceiver_config.backend: DEFAULT` does not enable UCX. You must explicitly set `backend: UCX` in the configuration.