Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
c5d55356
Unverified
Commit
c5d55356
authored
Sep 27, 2024
by
Lucas Wilkinson
Committed by
GitHub
Sep 27, 2024
Browse files
[Bugfix] fix for deepseek w4a16 (#8906)
Co-authored-by:
mgoin
<
michael@neuralmagic.com
>
parent
172d1cd2
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
5 additions
and
4 deletions
+5
-4
vllm/model_executor/layers/quantization/kernels/marlin.py
vllm/model_executor/layers/quantization/kernels/marlin.py
+5
-4
No files found.
vllm/model_executor/layers/quantization/kernels/marlin.py
View file @
c5d55356
...
@@ -38,10 +38,11 @@ class MarlinLinearKernel(MPLinearKernel):
...
@@ -38,10 +38,11 @@ class MarlinLinearKernel(MPLinearKernel):
"Marlin, supported group sizes are: "
\
"Marlin, supported group sizes are: "
\
f
"
{
MARLIN_SUPPORTED_GROUP_SIZES
}
"
f
"
{
MARLIN_SUPPORTED_GROUP_SIZES
}
"
return
check_marlin_supports_shape
(
c
.
partition_weight_shape
[
0
],
return
check_marlin_supports_shape
(
c
.
partition_weight_shape
[
1
],
c
.
partition_weight_shape
[
1
],
# out_features
c
.
full_weight_shape
[
1
],
c
.
partition_weight_shape
[
0
],
# in_features
c
.
group_size
)
c
.
full_weight_shape
[
0
],
# in_features
c
.
group_size
)
# note assumes that
# note assumes that
# `weight_packed` is: {input_dim = 0, output_dim = 1, packed_dim = 0}
# `weight_packed` is: {input_dim = 0, output_dim = 1, packed_dim = 0}
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment