vllm/docs/source/getting_started/installation/cpu/arm.inc.md

# Installation

vLLM has been adapted to work on ARM64 CPUs with NEON support, leveraging the CPU backend initially developed for the x86 platform.

ARM CPU backend currently supports Float32, FP16 and BFloat16 datatypes.

## Requirements

- OS: Linux
- Compiler: `gcc/g++ >= 12.3.0` (optional, recommended)
- Instruction Set Architecture (ISA): NEON support is required

## Set up using Python

### Pre-built wheels

### Build wheel from source

:::{include} build.inc.md
:::

Testing has been conducted on AWS Graviton3 instances for compatibility.

## Set up using Docker

### Pre-built images

### Build image from source

## Extra information
[Doc] Organise installation documentation into categories and tabs (#11935) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> 2025-01-13 12:27:36 +00:00			`# Installation`

			`vLLM has been adapted to work on ARM64 CPUs with NEON support, leveraging the CPU backend initially developed for the x86 platform.`

			`ARM CPU backend currently supports Float32, FP16 and BFloat16 datatypes.`

			`## Requirements`

			`- OS: Linux`
			- Compiler: `gcc/g++ >= 12.3.0` (optional, recommended)
			`- Instruction Set Architecture (ISA): NEON support is required`

			`## Set up using Python`

			`### Pre-built wheels`

			`### Build wheel from source`

			`:::{include} build.inc.md`
			`:::`

			`Testing has been conducted on AWS Graviton3 instances for compatibility.`

			`## Set up using Docker`

			`### Pre-built images`

			`### Build image from source`

			`## Extra information`