Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
change
sglang
Commits
aaf0ad8c
Unverified
Commit
aaf0ad8c
authored
Aug 07, 2025
by
Stefan He
Committed by
GitHub
Aug 07, 2025
Browse files
remove vllm fp8quant from fp8.py (#8937)
parent
361379b5
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
0 additions
and
3 deletions
+0
-3
python/sglang/srt/layers/quantization/fp8.py
python/sglang/srt/layers/quantization/fp8.py
+0
-3
No files found.
python/sglang/srt/layers/quantization/fp8.py
View file @
aaf0ad8c
...
@@ -98,9 +98,6 @@ if _is_hip and (_use_aiter or _use_hip_int4):
...
@@ -98,9 +98,6 @@ if _is_hip and (_use_aiter or _use_hip_int4):
from
aiter.fused_moe
import
fused_moe
from
aiter.fused_moe
import
fused_moe
from
aiter.ops.shuffle
import
shuffle_weight
from
aiter.ops.shuffle
import
shuffle_weight
if
not
(
_is_cuda
or
_is_npu
or
(
_is_cpu
and
_is_cpu_amx_available
)
or
_is_hip
):
from
vllm._custom_ops
import
scaled_fp8_quant
ACTIVATION_SCHEMES
=
[
"static"
,
"dynamic"
]
ACTIVATION_SCHEMES
=
[
"static"
,
"dynamic"
]
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment