Unverified Commit 975870d5 authored by Biswa Panda's avatar Biswa Panda Committed by GitHub
Browse files

docs: update LoRA model name in docs (#5210)

parent f7f2fb26
...@@ -39,8 +39,8 @@ Run the setup script to start MinIO and download/upload a LoRA adapter from Hugg ...@@ -39,8 +39,8 @@ Run the setup script to start MinIO and download/upload a LoRA adapter from Hugg
This script will: This script will:
- Start MinIO in a Docker container - Start MinIO in a Docker container
- Download a LoRA adapter from Hugging Face Hub (default: `Neural-Hacker/Qwen3-Math-Reasoning-LoRA`) - Download a LoRA adapter from Hugging Face Hub (default: `codelion/Qwen3-0.6B-accuracy-recovery-lora`)
- Upload the LoRA to MinIO at `s3://my-loras/Neural-Hacker/Qwen3-Math-Reasoning-LoRA` - Upload the LoRA to MinIO at `s3://my-loras/codelion/Qwen3-0.6B-accuracy-recovery-lora`
#### Script Options #### Script Options
...@@ -103,9 +103,9 @@ Load a LoRA from S3-compatible storage backend (e.g. MinIO): ...@@ -103,9 +103,9 @@ Load a LoRA from S3-compatible storage backend (e.g. MinIO):
curl -X POST http://localhost:8081/v1/loras \ curl -X POST http://localhost:8081/v1/loras \
-H "Content-Type: application/json" \ -H "Content-Type: application/json" \
-d '{ -d '{
"lora_name": "Neural-Hacker/Qwen3-Math-Reasoning-LoRA", "lora_name": "codelion/Qwen3-0.6B-accuracy-recovery-lora",
"source": { "source": {
"uri": "s3://my-loras/Neural-Hacker/Qwen3-Math-Reasoning-LoRA" "uri": "s3://my-loras/codelion/Qwen3-0.6B-accuracy-recovery-lora"
} }
}' | jq . }' | jq .
``` ```
...@@ -114,8 +114,8 @@ Expected response: ...@@ -114,8 +114,8 @@ Expected response:
```json ```json
{ {
"status": "success", "status": "success",
"message": "LoRA adapter 'Neural-Hacker/Qwen3-Math-Reasoning-LoRA' loaded successfully", "message": "LoRA adapter 'codelion/Qwen3-0.6B-accuracy-recovery-lora' loaded successfully",
"lora_name": "Neural-Hacker/Qwen3-Math-Reasoning-LoRA", "lora_name": "codelion/Qwen3-0.6B-accuracy-recovery-lora",
"lora_id": 1207343256 "lora_id": 1207343256
} }
``` ```
...@@ -146,7 +146,7 @@ You should see both the base model and the LoRA adapter listed. ...@@ -146,7 +146,7 @@ You should see both the base model and the LoRA adapter listed.
curl -X POST http://localhost:8000/v1/chat/completions \ curl -X POST http://localhost:8000/v1/chat/completions \
-H "Content-Type: application/json" \ -H "Content-Type: application/json" \
-d '{ -d '{
"model": "Neural-Hacker/Qwen3-Math-Reasoning-LoRA", "model": "codelion/Qwen3-0.6B-accuracy-recovery-lora",
"messages": [{ "messages": [{
"role": "user", "role": "user",
"content": "What is good low risk investment strategy?" "content": "What is good low risk investment strategy?"
...@@ -176,7 +176,7 @@ curl -X POST http://localhost:8000/v1/chat/completions \ ...@@ -176,7 +176,7 @@ curl -X POST http://localhost:8000/v1/chat/completions \
When you no longer need a LoRA, unload it to free up resources: When you no longer need a LoRA, unload it to free up resources:
```bash ```bash
curl -X DELETE http://localhost:8081/v1/loras/Neural-Hacker/Qwen3-Math-Reasoning-LoRA | jq . curl -X DELETE http://localhost:8081/v1/loras/codelion/Qwen3-0.6B-accuracy-recovery-lora | jq .
``` ```
Expected response: Expected response:
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment