Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
change
sglang
Commits
3e34e900
Unverified
Commit
3e34e900
authored
Jun 30, 2025
by
narutolhy
Committed by
GitHub
Jun 30, 2025
Browse files
Fix: sync prepare_fp8_layer_for_marlin with latest vllm changes (#7648)
parent
7349717e
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
1 addition
and
1 deletion
+1
-1
python/sglang/srt/layers/quantization/compressed_tensors/schemes/compressed_tensors_w8a16_fp8.py
...ompressed_tensors/schemes/compressed_tensors_w8a16_fp8.py
+1
-1
No files found.
python/sglang/srt/layers/quantization/compressed_tensors/schemes/compressed_tensors_w8a16_fp8.py
View file @
3e34e900
...
@@ -76,7 +76,7 @@ class CompressedTensorsW8A16Fp8(CompressedTensorsScheme):
...
@@ -76,7 +76,7 @@ class CompressedTensorsW8A16Fp8(CompressedTensorsScheme):
layer
.
input_scale
=
torch
.
nn
.
Parameter
(
layer
.
input_scale
=
torch
.
nn
.
Parameter
(
layer
.
input_scale
.
data
,
requires_grad
=
False
layer
.
input_scale
.
data
,
requires_grad
=
False
)
)
prepare_fp8_layer_for_marlin
(
layer
,
s
trategy
=
"channel"
)
prepare_fp8_layer_for_marlin
(
layer
,
s
ize_k_first
=
True
)
def
create_weights
(
def
create_weights
(
self
,
self
,
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment