Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
9d44744c
Commit
9d44744c
authored
Feb 24, 2026
by
SAC_fanth
Browse files
修复channel-wise 用triton报错的bug
parent
d146a231
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
2 additions
and
3 deletions
+2
-3
vllm/model_executor/layers/quantization/compressed_tensors/schemes/compressed_tensors_w8a8_fp8.py
...compressed_tensors/schemes/compressed_tensors_w8a8_fp8.py
+2
-3
No files found.
vllm/model_executor/layers/quantization/compressed_tensors/schemes/compressed_tensors_w8a8_fp8.py
View file @
9d44744c
...
@@ -161,9 +161,8 @@ class CompressedTensorsW8A8Fp8(CompressedTensorsScheme):
...
@@ -161,9 +161,8 @@ class CompressedTensorsW8A8Fp8(CompressedTensorsScheme):
)
)
if
envs
.
VLLM_W8A8_BACKEND
==
3
:
if
envs
.
VLLM_W8A8_BACKEND
==
3
:
weight
=
weight
.
t
().
contiguous
()
weight
=
weight
.
t
().
contiguous
()
# triton不用转置,torch需要
else
:
# else:
weight
=
weight
.
t
()
# weight = weight.t()
elif
self
.
strategy
==
QuantizationStrategy
.
BLOCK
:
elif
self
.
strategy
==
QuantizationStrategy
.
BLOCK
:
assert
self
.
is_static_input_scheme
is
False
assert
self
.
is_static_input_scheme
is
False
weight
,
weight_scale
=
process_fp8_weight_block_strategy
(
weight
,
weight_scale
=
process_fp8_weight_block_strategy
(
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment