Harry Mellor bc2d4473bf
[Docs] Make installation URLs nicer (#14556)
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
2025-03-10 10:43:08 -07:00

765 B

Installation

vLLM has been adapted to work on ARM64 CPUs with NEON support, leveraging the CPU backend initially developed for the x86 platform.

ARM CPU backend currently supports Float32, FP16 and BFloat16 datatypes.

:::{attention} There are no pre-built wheels or images for this device, so you must build vLLM from source. :::

Requirements

  • OS: Linux
  • Compiler: gcc/g++ >= 12.3.0 (optional, recommended)
  • Instruction Set Architecture (ISA): NEON support is required

Set up using Python

Pre-built wheels

Build wheel from source

:::{include} cpu/build.inc.md :::

Testing has been conducted on AWS Graviton3 instances for compatibility.

Set up using Docker

Pre-built images

Build image from source

Extra information