Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
change
sglang
Commits
72031173
Unverified
Commit
72031173
authored
Mar 28, 2025
by
Jiaqi
Committed by
GitHub
Mar 27, 2025
Browse files
fix: fix typo of comments in w8a8_fp8.py (#4843)
parent
9fdc6d6a
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
1 addition
and
1 deletion
+1
-1
python/sglang/srt/layers/quantization/w8a8_fp8.py
python/sglang/srt/layers/quantization/w8a8_fp8.py
+1
-1
No files found.
python/sglang/srt/layers/quantization/w8a8_fp8.py
View file @
72031173
...
...
@@ -37,7 +37,7 @@ class W8A8Fp8Config(QuantizationConfig):
Note:
- For models without offline quantization, weights will be quantized during model loading
- If CUTLASS is supported: Per-channel weight quantization is used
- If CUTLASS is not supported: Falls back to per-t
ok
en weight quantization
- If CUTLASS is not supported: Falls back to per-ten
sor
weight quantization
"""
def
__init__
(
self
,
is_checkpoint_fp8_serialized
:
bool
=
False
):
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment