Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
zhaoyu6
sglang
Commits
cd8d4b9d
"scripts/vscode:/vscode.git/clone" did not exist on "d645ae90a3f538c72baec8bbf312388c22e25eae"
Unverified
Commit
cd8d4b9d
authored
May 15, 2025
by
Qiaolin Yu
Committed by
GitHub
May 15, 2025
Browse files
Fix lora bench (#6302)
parent
f194e14f
Changes
2
Hide whitespace changes
Inline
Side-by-side
Showing
2 changed files
with
3 additions
and
3 deletions
+3
-3
benchmark/lora/lora_bench.py
benchmark/lora/lora_bench.py
+2
-0
python/sglang/srt/lora/lora_manager.py
python/sglang/srt/lora/lora_manager.py
+1
-3
No files found.
benchmark/lora/lora_bench.py
View file @
cd8d4b9d
...
...
@@ -170,6 +170,7 @@ async def benchmark(
prompt_len
=
test_prompt_len
,
output_len
=
test_output_len
,
lora_name
=
"dummy"
,
# the lora_name argument will not be used
image_data
=
None
,
extra_request_body
=
extra_request_body
,
)
test_output
=
await
request_func
(
request_func_input
=
test_input
)
...
...
@@ -194,6 +195,7 @@ async def benchmark(
prompt_len
=
prompt_len
,
output_len
=
output_len
,
lora_name
=
"dummy"
,
image_data
=
None
,
extra_request_body
=
extra_request_body
,
)
tasks
.
append
(
...
...
python/sglang/srt/lora/lora_manager.py
View file @
cd8d4b9d
...
...
@@ -170,9 +170,7 @@ class LoRAManager:
dim
=
0
,
out
=
self
.
cuda_graph_batch_info
.
seg_indptr
[
1
:
bs
+
1
],
)
self
.
cuda_graph_batch_info
.
max_len
=
int
(
torch
.
max
(
self
.
cuda_graph_batch_info
.
seg_lens
[:
bs
])
)
self
.
cuda_graph_batch_info
.
max_len
=
1
for
i
,
lora_path
in
enumerate
(
forward_batch
.
lora_paths
):
self
.
cuda_graph_batch_info
.
weight_indices
[
i
]
=
(
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment