Unverified Commit e1ce44cd authored by Swipe4057's avatar Swipe4057 Committed by GitHub
Browse files

Disabling mixed chunked prefill when eagle is enabled (#6874)

parent f1114e7f
......@@ -425,6 +425,12 @@ class ServerArgs:
"Overlap scheduler is disabled because of using "
"eagle speculative decoding."
)
if self.enable_mixed_chunk:
self.enable_mixed_chunk = False
logger.warning(
"Mixed chunked prefill is disabled because of using "
"eagle speculative decoding."
)
model_arch = get_model_arch(self)
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment