[doc] update pipeline parallel in readme (#6347)

2d23b42d · youkaichao · GitHub · 1df43de9 · 2d23b42d · 2d23b42d
Unverified Commit 2d23b42d authored Jul 11, 2024 by youkaichao Committed by GitHub Jul 11, 2024
Show whitespace changes
Inline Side-by-side

Showing with 2 additions and 2 deletions

README.md README.md +1 -1

docs/source/index.rst docs/source/index.rst +1 -1

No files found.
--- a/README.md
+++ b/README.md
@@ -56,7 +56,7 @@ vLLM is flexible and easy to use with:
 - Seamless integration with popular Hugging Face models
 - High-throughput serving with various decoding algorithms, including *parallel sampling*, *beam search*, and more
- Tensor parallelism support for distributed inference
+- Tensor parallelism and pipieline parallelism support for distributed inference
 - Streaming outputs
 - OpenAI-compatible API server
 - Support NVIDIA GPUs, AMD CPUs and GPUs, Intel CPUs and GPUs, PowerPC CPUs

--- a/docs/source/index.rst
+++ b/docs/source/index.rst
@@ -38,7 +38,7 @@ vLLM is flexible and easy to use with:
 * Seamless integration with popular HuggingFace models
 * High-throughput serving with various decoding algorithms, including *parallel sampling*, *beam search*, and more
-* Tensor parallelism support for distributed inference
+* Tensor parallelism and pipieline parallelism support for distributed inference
 * Streaming outputs
 * OpenAI-compatible API server
 * Support NVIDIA GPUs and AMD GPUs