Unverified Commit 2f2c1d73 authored by Russell Bryant's avatar Russell Bryant Committed by GitHub
Browse files

[Docs] Upgrade dynamic LoRA warning to admonition block (#35218)


Signed-off-by: default avatarRussell Bryant <rbryant@redhat.com>
parent fb3e78ab
......@@ -106,7 +106,8 @@ curl http://localhost:8000/v1/completions \
In addition to serving LoRA adapters at server startup, the vLLM server supports dynamically configuring LoRA adapters at runtime through dedicated API endpoints and plugins. This feature can be particularly useful when the flexibility to change models on-the-fly is needed.
Note: Enabling this feature in production environments is risky as users may participate in model adapter management.
!!! warning
This feature comes with security risks. It should not be used in production unless it is an isolated, fully trusted environment.
To enable dynamic LoRA configuration, ensure that the environment variable `VLLM_ALLOW_RUNTIME_LORA_UPDATING`
is set to `True`.
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment