Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
change
sglang
Commits
a5a892ff
Unverified
Commit
a5a892ff
authored
Mar 13, 2025
by
Lianmin Zheng
Committed by
GitHub
Mar 13, 2025
Browse files
Fix auto merge & add back get_flat_data_by_layer (#4393)
parent
8e66fbec
Changes
3
Show whitespace changes
Inline
Side-by-side
Showing
3 changed files
with
19 additions
and
2 deletions
+19
-2
.github/workflows/pr-test.yml
.github/workflows/pr-test.yml
+12
-2
python/sglang/srt/mem_cache/memory_pool.py
python/sglang/srt/mem_cache/memory_pool.py
+3
-0
test/srt/test_vision_openai_server.py
test/srt/test_vision_openai_server.py
+4
-0
No files found.
.github/workflows/pr-test.yml
View file @
a5a892ff
...
@@ -337,6 +337,7 @@ jobs:
...
@@ -337,6 +337,7 @@ jobs:
python3 test_moe_eval_accuracy_large.py
python3 test_moe_eval_accuracy_large.py
finish
:
finish
:
if
:
always()
needs
:
[
needs
:
[
unit-test-frontend
,
unit-test-backend-1-gpu
,
unit-test-backend-2-gpu
,
unit-test-frontend
,
unit-test-backend-1-gpu
,
unit-test-backend-2-gpu
,
performance-test-1-gpu-part-1
,
performance-test-1-gpu-part-2
,
performance-test-2-gpu
,
performance-test-1-gpu-part-1
,
performance-test-1-gpu-part-2
,
performance-test-2-gpu
,
...
@@ -344,5 +345,14 @@ jobs:
...
@@ -344,5 +345,14 @@ jobs:
]
]
runs-on
:
ubuntu-latest
runs-on
:
ubuntu-latest
steps
:
steps
:
-
name
:
Finish
-
name
:
Check all dependent job statuses
run
:
echo "This is an empty step to ensure that all jobs are completed."
run
:
|
results=(${{ join(needs.*.result, ' ') }})
for result in "${results[@]}"; do
if [ "$result" = "failure" ] || [ "$result" = "cancelled" ]; then
echo "Job failed with result: $result"
exit 1
fi
done
echo "All jobs completed successfully"
exit 0
python/sglang/srt/mem_cache/memory_pool.py
View file @
a5a892ff
...
@@ -591,6 +591,9 @@ class MHATokenToKVPoolHost:
...
@@ -591,6 +591,9 @@ class MHATokenToKVPoolHost:
def
get_flat_data
(
self
,
indices
):
def
get_flat_data
(
self
,
indices
):
return
self
.
kv_buffer
[:,
:,
indices
]
return
self
.
kv_buffer
[:,
:,
indices
]
def
get_flat_data_by_layer
(
self
,
indices
,
layer_id
):
return
self
.
kv_buffer
[:,
layer_id
,
indices
]
def
assign_flat_data
(
self
,
indices
,
flat_data
):
def
assign_flat_data
(
self
,
indices
,
flat_data
):
self
.
kv_buffer
[:,
:,
indices
]
=
flat_data
self
.
kv_buffer
[:,
:,
indices
]
=
flat_data
...
...
test/srt/test_vision_openai_server.py
View file @
a5a892ff
...
@@ -535,6 +535,10 @@ class TestJanusProServer(TestOpenAIVisionServer):
...
@@ -535,6 +535,10 @@ class TestJanusProServer(TestOpenAIVisionServer):
def
test_video_chat_completion
(
self
):
def
test_video_chat_completion
(
self
):
pass
pass
def
test_single_image_chat_completion
(
self
):
# Skip this test because it is flaky
pass
if
__name__
==
"__main__"
:
if
__name__
==
"__main__"
:
unittest
.
main
()
unittest
.
main
()
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment