docs: Add note for LMCache ARM support (#2535)

Co-authored-by: Dmitry Tokarev <dtokarev@nvidia.com>

docs: Add note for LMCache ARM support (#2535)
Co-authored-by: Dmitry Tokarev <dtokarev@nvidia.com>
b9cbee0b · ZichengMa · GitHub · e5b6a054 · b9cbee0b
Unverified Commit b9cbee0b authored Aug 19, 2025 by ZichengMa Committed by GitHub Aug 19, 2025
Hide whitespace changes
Inline Side-by-side

Showing with 4 additions and 0 deletions

components/backends/vllm/LMCache_Integration.md components/backends/vllm/LMCache_Integration.md +4 -0

No files found.
--- a/components/backends/vllm/LMCache_Integration.md
+++ b/components/backends/vllm/LMCache_Integration.md
@@ -11,6 +11,10 @@ This document describes how LMCache is integrated into Dynamo's vLLM backend to
 - **Memory Offloading**: Intelligent KV cache placement across CPU/GPU/storage tiers
 - **Improved Throughput**: Reduced GPU memory pressure enables higher batch sizes
+## Platform Support
+**Important Note**: LMCache integration currently only supports x86 architecture. ARM64 is not supported at this time.
 ## Aggregated Serving