Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
9fdc0f3a
Commit
9fdc0f3a
authored
Mar 26, 2026
by
khluu
Browse files
merge
Signed-off-by:
khluu
<
khluu000@gmail.com
>
parent
05d96d79
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
0 additions
and
12 deletions
+0
-12
vllm/model_executor/layers/quantization/fp8.py
vllm/model_executor/layers/quantization/fp8.py
+0
-12
No files found.
vllm/model_executor/layers/quantization/fp8.py
View file @
9fdc0f3a
...
@@ -437,19 +437,7 @@ class Fp8LinearMethod(LinearMethodBase):
...
@@ -437,19 +437,7 @@ class Fp8LinearMethod(LinearMethodBase):
else
:
else
:
layer
.
input_scale
=
None
layer
.
input_scale
=
None
<<<<<<<
HEAD
if
self
.
use_marlin
:
prepare_fp8_layer_for_marlin
(
layer
,
size_k_first
,
input_dtype
=
self
.
marlin_input_dtype
)
# Activations not quantized for marlin.
del
layer
.
input_scale
return
if
self
.
block_quant
:
=======
if
self
.
block_quant
and
self
.
use_deep_gemm
:
if
self
.
block_quant
and
self
.
use_deep_gemm
:
>>>>>>>
52069012
f
([
Bugfix
]
Fix
DeepGemm
E8M0
accuracy
degradation
for
Qwen3
.
5
FP8
on
Blackwell
(
#38083))
maybe_post_process_fp8_weight_block
(
layer
)
maybe_post_process_fp8_weight_block
(
layer
)
def
apply
(
def
apply
(
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment