Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
1bd3ae33
Commit
1bd3ae33
authored
Oct 11, 2025
by
zhuwenwen
Browse files
skip silu_mul_fp8_quant_deep_gemm_cuda and remove zero_overhead
parent
9bf1b213
Changes
22
Expand all
Hide whitespace changes
Inline
Side-by-side
Showing
2 changed files
with
0 additions
and
759 deletions
+0
-759
vllm/zero_overhead/v1/gpu_model_runner.py
vllm/zero_overhead/v1/gpu_model_runner.py
+0
-747
vllm/zero_overhead/v1/outputs.py
vllm/zero_overhead/v1/outputs.py
+0
-12
No files found.
vllm/zero_overhead/v1/gpu_model_runner.py
deleted
100644 → 0
View file @
9bf1b213
This diff is collapsed.
Click to expand it.
vllm/zero_overhead/v1/outputs.py
deleted
100644 → 0
View file @
9bf1b213
from
dataclasses
import
dataclass
from
vllm.v1.outputs
import
ModelRunnerOutput
@
dataclass
class
ZeroV1ModelRunnerOutput
(
ModelRunnerOutput
):
# [num_reqs]
fix_req_ids
:
list
[
str
]
=
None
fix_sampled_token_ids
:
list
[
list
[
int
]]
=
None
fix_draft_req_ids
:
list
[
str
]
=
None
fix_draft_tokens_ids
:
list
[
list
[
int
]]
=
None
is_output_valid
:
bool
=
True
\ No newline at end of file
Prev
1
2
Next
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment