Unverified Commit 89fca671 authored by Simon Mo's avatar Simon Mo Committed by GitHub
Browse files

[V1] Default MLA to V1 (#14921)


Signed-off-by: default avatarsimon-mo <simon.mo@hey.com>
parent d20b0c13
...@@ -1191,7 +1191,7 @@ class EngineArgs: ...@@ -1191,7 +1191,7 @@ class EngineArgs:
NOTE: for autoselection of V0 vs V1 engine, we need to NOTE: for autoselection of V0 vs V1 engine, we need to
create the ModelConfig first, since ModelConfig's attrs create the ModelConfig first, since ModelConfig's attrs
(e.g. the model arch) are needed to make the decision. (e.g. the model arch) are needed to make the decision.
This function set VLLM_USE_V1=X if VLLM_USE_V1 is This function set VLLM_USE_V1=X if VLLM_USE_V1 is
unspecified by the user. unspecified by the user.
...@@ -1576,10 +1576,6 @@ class EngineArgs: ...@@ -1576,10 +1576,6 @@ class EngineArgs:
############################################################# #############################################################
# Experimental Features - allow users to opt in. # Experimental Features - allow users to opt in.
# MLA is is supported on V1, but off by default for now.
if model_config.use_mla and _warn_or_fallback("MLA"):
return False
# LoRA is supported on V1, but off by default for now. # LoRA is supported on V1, but off by default for now.
if self.enable_lora and _warn_or_fallback("LORA"): if self.enable_lora and _warn_or_fallback("LORA"):
return False return False
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment