Unverified Commit 439c8458 authored by Jie Fu (傅杰)'s avatar Jie Fu (傅杰) Committed by GitHub
Browse files

[Doc] Update description of vLLM support for CPUs (#6003)

parent 99ded1e1
...@@ -59,7 +59,7 @@ vLLM is flexible and easy to use with: ...@@ -59,7 +59,7 @@ vLLM is flexible and easy to use with:
- Tensor parallelism support for distributed inference - Tensor parallelism support for distributed inference
- Streaming outputs - Streaming outputs
- OpenAI-compatible API server - OpenAI-compatible API server
- Support NVIDIA GPUs, AMD GPUs, Intel CPUs and GPUs - Support NVIDIA GPUs, AMD CPUs and GPUs, Intel CPUs and GPUs, PowerPC CPUs
- (Experimental) Prefix caching support - (Experimental) Prefix caching support
- (Experimental) Multi-lora support - (Experimental) Multi-lora support
......
...@@ -20,7 +20,7 @@ Requirements ...@@ -20,7 +20,7 @@ Requirements
* OS: Linux * OS: Linux
* Compiler: gcc/g++>=12.3.0 (optional, recommended) * Compiler: gcc/g++>=12.3.0 (optional, recommended)
* Instruction set architecture (ISA) requirement: AVX512 is required. * Instruction set architecture (ISA) requirement: AVX512 (optional, recommended)
.. _cpu_backend_quick_start_dockerfile: .. _cpu_backend_quick_start_dockerfile:
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment