Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
9482b0b0
Unverified
Commit
9482b0b0
authored
Mar 18, 2026
by
Michael Goin
Committed by
GitHub
Mar 18, 2026
Browse files
[Bugfix] Remove assertion for NVFP4 scale dynamic range (#37465)
Signed-off-by:
Michael Goin
<
mgoin64@gmail.com
>
parent
5bc1da14
Changes
1
Show whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
0 additions
and
3 deletions
+0
-3
vllm/model_executor/layers/quantization/utils/marlin_utils_fp4.py
...el_executor/layers/quantization/utils/marlin_utils_fp4.py
+0
-3
No files found.
vllm/model_executor/layers/quantization/utils/marlin_utils_fp4.py
View file @
9482b0b0
...
...
@@ -37,9 +37,6 @@ def _nvfp4_compute_scale_factor(marlin_scales: torch.Tensor) -> float:
min_val
=
ws_float
[
nonzero_mask
].
min
()
if
min_val
<
2
:
sf
=
(
2
/
min_val
).
log2
().
ceil
().
exp2
()
assert
(
ws_float
[
nonzero_mask
]
*
sf
<=
448
*
(
2
**
7
)).
all
(),
(
"NVFP4 scale dynamic range too large for rescaling"
)
return
sf
.
item
()
return
1.0
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment