docs: update LoRA model name in docs (#5210)

975870d5 · Biswa Panda · GitHub · f7f2fb26 · 975870d5
Unverified Commit 975870d5 authored Jan 06, 2026 by Biswa Panda Committed by GitHub Jan 06, 2026
Hide whitespace changes
Inline Side-by-side

Showing with 8 additions and 8 deletions

examples/backends/vllm/launch/lora/README.md examples/backends/vllm/launch/lora/README.md +8 -8

No files found.
--- a/examples/backends/vllm/launch/lora/README.md
+++ b/examples/backends/vllm/launch/lora/README.md
@@ -39,8 +39,8 @@ Run the setup script to start MinIO and download/upload a LoRA adapter from Hugg
 This script will:
 - Start MinIO in a Docker container
- Download a LoRA adapter from Hugging Face Hub (default: `Neural-Hacker/Qwen3-Math-Reasoning-LoRA`)
+- Download a LoRA adapter from Hugging Face Hub (default: `codelion/Qwen3-0.6B-accuracy-recovery-lora`)
- Upload the LoRA to MinIO at `s3://my-loras/Neural-Hacker/Qwen3-Math-Reasoning-LoRA`
+- Upload the LoRA to MinIO at `s3://my-loras/codelion/Qwen3-0.6B-accuracy-recovery-lora`
 #### Script Options
@@ -103,9 +103,9 @@ Load a LoRA from S3-compatible storage backend (e.g. MinIO):
 curl -X POST http://localhost:8081/v1/loras \
  -H "Content-Type: application/json" \
  -d '{
-    "lora_name": "Neural-Hacker/Qwen3-Math-Reasoning-LoRA",
+    "lora_name": "codelion/Qwen3-0.6B-accuracy-recovery-lora",
    "source": {
-      "uri": "s3://my-loras/Neural-Hacker/Qwen3-Math-Reasoning-LoRA"
+      "uri": "s3://my-loras/codelion/Qwen3-0.6B-accuracy-recovery-lora"
    }
  }' | jq .
 ```
@@ -114,8 +114,8 @@ Expected response:
 ```json
 {
  "status": "success",
-  "message": "LoRA adapter 'Neural-Hacker/Qwen3-Math-Reasoning-LoRA' loaded successfully",
+  "message": "LoRA adapter 'codelion/Qwen3-0.6B-accuracy-recovery-lora' loaded successfully",
-  "lora_name": "Neural-Hacker/Qwen3-Math-Reasoning-LoRA",
+  "lora_name": "codelion/Qwen3-0.6B-accuracy-recovery-lora",
  "lora_id": 1207343256
 }
 ```
@@ -146,7 +146,7 @@ You should see both the base model and the LoRA adapter listed.
 curl -X POST http://localhost:8000/v1/chat/completions \
  -H "Content-Type: application/json" \
  -d '{
-    "model": "Neural-Hacker/Qwen3-Math-Reasoning-LoRA",
+    "model": "codelion/Qwen3-0.6B-accuracy-recovery-lora",
    "messages": [{
      "role": "user",
      "content": "What is good low risk investment strategy?"
@@ -176,7 +176,7 @@ curl -X POST http://localhost:8000/v1/chat/completions \
 When you no longer need a LoRA, unload it to free up resources:
 ```bash
-curl -X DELETE http://localhost:8081/v1/loras/Neural-Hacker/Qwen3-Math-Reasoning-LoRA | jq .
+curl -X DELETE http://localhost:8081/v1/loras/codelion/Qwen3-0.6B-accuracy-recovery-lora | jq .
 ```
 Expected response: