Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
8b624950
Unverified
Commit
8b624950
authored
Oct 29, 2025
by
Zhewen Li
Committed by
GitHub
Oct 29, 2025
Browse files
[Bugfix] Fix non-contiguous tensor error in `rocm_unquantized_gemm_impl` (#27605)
Signed-off-by:
zhewenli
<
zhewenli@meta.com
>
parent
83fd49b1
Changes
2
Show whitespace changes
Inline
Side-by-side
Showing
2 changed files
with
4 additions
and
4 deletions
+4
-4
.buildkite/test-amd.yaml
.buildkite/test-amd.yaml
+1
-1
vllm/model_executor/layers/utils.py
vllm/model_executor/layers/utils.py
+3
-3
No files found.
.buildkite/test-amd.yaml
View file @
8b624950
...
@@ -286,7 +286,7 @@ steps:
...
@@ -286,7 +286,7 @@ steps:
-
label
:
Engine Test
# 25min
-
label
:
Engine Test
# 25min
timeout_in_minutes
:
40
timeout_in_minutes
:
40
mirror_hardwares
:
[
amdexperimental
]
mirror_hardwares
:
[
amdexperimental
,
amdproduction
]
agent_pool
:
mi325_1
agent_pool
:
mi325_1
#grade: Blocking
#grade: Blocking
source_file_dependencies
:
source_file_dependencies
:
...
...
vllm/model_executor/layers/utils.py
View file @
8b624950
...
@@ -119,17 +119,17 @@ def rocm_unquantized_gemm_impl(
...
@@ -119,17 +119,17 @@ def rocm_unquantized_gemm_impl(
if
use_skinny
is
not
True
:
if
use_skinny
is
not
True
:
return
torch
.
nn
.
functional
.
linear
(
x
,
weight
,
bias
)
return
torch
.
nn
.
functional
.
linear
(
x
,
weight
,
bias
)
x_view
=
x
.
view
(
-
1
,
x
.
size
(
-
1
))
x_view
=
x
.
reshape
(
-
1
,
x
.
size
(
-
1
))
n
=
x_view
.
shape
[
0
]
n
=
x_view
.
shape
[
0
]
m
=
weight
.
shape
[
0
]
m
=
weight
.
shape
[
0
]
cu_count
=
current_platform
.
get_cu_count
()
cu_count
=
current_platform
.
get_cu_count
()
if
m
>
8
and
0
<
n
<=
4
:
if
m
>
8
and
0
<
n
<=
4
:
out
=
ops
.
wvSplitK
(
weight
,
x_view
,
cu_count
,
bias
)
out
=
ops
.
wvSplitK
(
weight
,
x_view
,
cu_count
,
bias
)
return
out
.
view
(
*
x
.
shape
[:
-
1
],
weight
.
shape
[
0
])
return
out
.
reshape
(
*
x
.
shape
[:
-
1
],
weight
.
shape
[
0
])
elif
m
%
4
==
0
and
n
==
1
and
k
<=
8192
and
bias
is
None
:
elif
m
%
4
==
0
and
n
==
1
and
k
<=
8192
and
bias
is
None
:
out
=
ops
.
LLMM1
(
weight
,
x_view
,
4
)
out
=
ops
.
LLMM1
(
weight
,
x_view
,
4
)
return
out
.
view
(
*
x
.
shape
[:
-
1
],
weight
.
shape
[
0
])
return
out
.
reshape
(
*
x
.
shape
[:
-
1
],
weight
.
shape
[
0
])
return
torch
.
nn
.
functional
.
linear
(
x
,
weight
,
bias
)
return
torch
.
nn
.
functional
.
linear
(
x
,
weight
,
bias
)
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment