Unverified Commit 8425f785 authored by Varun Sundar Rabindranath's avatar Varun Sundar Rabindranath Committed by GitHub
Browse files

[Misc] DeepEPHighThroughtput - Enable Inductor pass (#21311)


Signed-off-by: default avatarVarun Sundar Rabindranath <vsundarr@redhat.com>
Co-authored-by: default avatarVarun Sundar Rabindranath <vsundarr@redhat.com>
parent c17231e8
...@@ -182,9 +182,6 @@ class CudaPlatformBase(Platform): ...@@ -182,9 +182,6 @@ class CudaPlatformBase(Platform):
compilation_config.use_cudagraph = False compilation_config.use_cudagraph = False
if model_config is not None: if model_config is not None:
model_config.enforce_eager = True model_config.enforce_eager = True
# TODO (varun): Turning this ON gives incorrect results for the
# Deepseek-V2-lite model.
vllm_config.compilation_config.use_inductor = False
@classmethod @classmethod
def get_current_memory_usage(cls, def get_current_memory_usage(cls,
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment