Unverified Commit 4fb8142a authored by Woosuk Kwon's avatar Woosuk Kwon Committed by GitHub
Browse files

[V1][PP] Enable true PP with Ray executor (#13472)


Signed-off-by: default avatarWoosuk Kwon <woosuk.kwon@berkeley.edu>
parent a02c86b4
...@@ -32,7 +32,7 @@ class RayDistributedExecutor(RayDistributedExecutorV0, Executor): ...@@ -32,7 +32,7 @@ class RayDistributedExecutor(RayDistributedExecutorV0, Executor):
"""Ray distributed executor supports pipeline parallelism, """Ray distributed executor supports pipeline parallelism,
meaning that it allows PP size batches to be executed concurrently. meaning that it allows PP size batches to be executed concurrently.
""" """
return 1 #self.vllm_config.parallel_config.pipeline_parallel_size return self.parallel_config.pipeline_parallel_size
def execute_model( def execute_model(
self, self,
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment