[Doc] Update description of vLLM support for CPUs (#6003)

2024-07-11 12:15:29 +08:00 · 2024-07-11 12:15:29 +08:00 · 439c84581a
commit 439c84581a
parent 99ded1e1c4
2 changed files with 2 additions and 2 deletions
--- a/README.md
+++ b/README.md
@ -59,7 +59,7 @@ vLLM is flexible and easy to use with:
 - Tensor parallelism support for distributed inference
 - Streaming outputs
 - OpenAI-compatible API server
- Support NVIDIA GPUs, AMD GPUs, Intel CPUs and GPUs
+- Support NVIDIA GPUs, AMD CPUs and GPUs, Intel CPUs and GPUs, PowerPC CPUs
 - (Experimental) Prefix caching support
 - (Experimental) Multi-lora support

--- a/docs/source/getting_started/cpu-installation.rst
+++ b/docs/source/getting_started/cpu-installation.rst
@ -20,7 +20,7 @@ Requirements

 * OS: Linux
 * Compiler: gcc/g++>=12.3.0 (optional, recommended)
-* Instruction set architecture (ISA) requirement: AVX512 is required.
+* Instruction set architecture (ISA) requirement: AVX512 (optional, recommended)

 .. _cpu_backend_quick_start_dockerfile: