vllm/docker/Dockerfile.xpu

# oneapi 2025.0.2 docker base image use rolling 2448 package. https://dgpu-docs.intel.com/releases/packages.html?release=Rolling+2448.13&os=Ubuntu+22.04, and we don't need install driver manually.
FROM intel/deep-learning-essentials:2025.0.2-0-devel-ubuntu22.04 AS vllm-base

RUN rm /etc/apt/sources.list.d/intel-graphics.list

RUN apt-get update -y && \
    apt-get install -y --no-install-recommends --fix-missing \
    curl \
    ffmpeg \
    git \
    libsndfile1 \
    libsm6 \
    libxext6 \
    libgl1 \
    lsb-release \
    numactl \
    python3 \
    python3-dev \
    python3-pip \
    wget

WORKDIR /workspace/vllm
COPY requirements/xpu.txt /workspace/vllm/requirements/xpu.txt
COPY requirements/common.txt /workspace/vllm/requirements/common.txt

RUN --mount=type=cache,target=/root/.cache/pip \
    pip install --no-cache-dir \
    -r requirements/xpu.txt

ENV LD_LIBRARY_PATH="$LD_LIBRARY_PATH:/usr/local/lib/"

COPY . .
ARG GIT_REPO_CHECK=0
RUN --mount=type=bind,source=.git,target=.git \
    if [ "$GIT_REPO_CHECK" != 0 ]; then bash tools/check_repo.sh; fi

ENV VLLM_TARGET_DEVICE=xpu

RUN --mount=type=cache,target=/root/.cache/pip \
    --mount=type=bind,source=.git,target=.git \
    python3 setup.py install

# Please refer xpu doc, we need manually install intel-extension-for-pytorch 2.6.10+xpu due to there are some conflict dependencies with torch 2.6.0+xpu
# FIXME: This will be fix in ipex 2.7. just leave this here for awareness.
RUN --mount=type=cache,target=/root/.cache/pip \
    pip install intel-extension-for-pytorch==2.6.10+xpu \
    --extra-index-url=https://pytorch-extension.intel.com/release-whl/stable/xpu/us/

CMD ["/bin/bash"]

FROM vllm-base AS vllm-openai

# install additional dependencies for openai api server
RUN --mount=type=cache,target=/root/.cache/pip \
    pip install accelerate hf_transfer 'modelscope!=1.15.0'

ENV VLLM_USAGE_SOURCE production-docker-image \
    TRITON_XPU_PROFILE 1
# install development dependencies (for testing)
RUN python3 -m pip install -e tests/vllm_test_utils
ENTRYPOINT ["python3", "-m", "vllm.entrypoints.openai.api_server"]
[CI][Intel GPU] update XPU dockerfile and CI script (#15109) Signed-off-by: Kunshang Ji <kunshang.ji@intel.com> 2025-03-19 01:29:25 -07:00			`# oneapi 2025.0.2 docker base image use rolling 2448 package. https://dgpu-docs.intel.com/releases/packages.html?release=Rolling+2448.13&os=Ubuntu+22.04, and we don't need install driver manually.`
			`FROM intel/deep-learning-essentials:2025.0.2-0-devel-ubuntu22.04 AS vllm-base`
[Hardware][Intel GPU] Add Intel GPU(XPU) inference backend (#3814) Co-authored-by: Jiang Li <jiang1.li@intel.com> Co-authored-by: Abhilash Majumder <abhilash.majumder@intel.com> Co-authored-by: Abhilash Majumder <30946547+abhilash1910@users.noreply.github.com> 2024-06-18 02:01:25 +08:00
[CI][Intel GPU] update XPU dockerfile and CI script (#15109) Signed-off-by: Kunshang Ji <kunshang.ji@intel.com> 2025-03-19 01:29:25 -07:00			`RUN rm /etc/apt/sources.list.d/intel-graphics.list`
[Hardware][Intel GPU] Add Intel GPU(XPU) inference backend (#3814) Co-authored-by: Jiang Li <jiang1.li@intel.com> Co-authored-by: Abhilash Majumder <abhilash.majumder@intel.com> Co-authored-by: Abhilash Majumder <30946547+abhilash1910@users.noreply.github.com> 2024-06-18 02:01:25 +08:00
[Bugfix][Intel] Fix XPU Dockerfile Build (#7824) Signed-off-by: tylertitsworth <tyler.titsworth@intel.com> Co-authored-by: youkaichao <youkaichao@126.com> 2024-09-27 23:45:50 -07:00			`RUN apt-get update -y && \`
			`apt-get install -y --no-install-recommends --fix-missing \`
			`curl \`
			`ffmpeg \`
			`git \`
			`libsndfile1 \`
			`libsm6 \`
			`libxext6 \`
			`libgl1 \`
			`lsb-release \`
			`numactl \`
			`python3 \`
			`python3-dev \`
			`python3-pip \`
			`wget`
[Hardware][Intel GPU] Add Intel GPU(XPU) inference backend (#3814) Co-authored-by: Jiang Li <jiang1.li@intel.com> Co-authored-by: Abhilash Majumder <abhilash.majumder@intel.com> Co-authored-by: Abhilash Majumder <30946547+abhilash1910@users.noreply.github.com> 2024-06-18 02:01:25 +08:00
			`WORKDIR /workspace/vllm`
Move requirements into their own directory (#12547) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> 2025-03-08 17:44:35 +01:00			`COPY requirements/xpu.txt /workspace/vllm/requirements/xpu.txt`
			`COPY requirements/common.txt /workspace/vllm/requirements/common.txt`
[Hardware][Intel GPU] Add Intel GPU(XPU) inference backend (#3814) Co-authored-by: Jiang Li <jiang1.li@intel.com> Co-authored-by: Abhilash Majumder <abhilash.majumder@intel.com> Co-authored-by: Abhilash Majumder <30946547+abhilash1910@users.noreply.github.com> 2024-06-18 02:01:25 +08:00
[CI/Build] use setuptools-scm to set __version__ (#4738) Co-authored-by: youkaichao <youkaichao@126.com> 2024-09-23 18:44:26 +02:00			`RUN --mount=type=cache,target=/root/.cache/pip \`
[Bugfix][Intel] Fix XPU Dockerfile Build (#7824) Signed-off-by: tylertitsworth <tyler.titsworth@intel.com> Co-authored-by: youkaichao <youkaichao@126.com> 2024-09-27 23:45:50 -07:00			`pip install --no-cache-dir \`
Move requirements into their own directory (#12547) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> 2025-03-08 17:44:35 +01:00			`-r requirements/xpu.txt`
[Bugfix][Intel] Fix XPU Dockerfile Build (#7824) Signed-off-by: tylertitsworth <tyler.titsworth@intel.com> Co-authored-by: youkaichao <youkaichao@126.com> 2024-09-27 23:45:50 -07:00
[Misc][XPU] Upgrade to Pytorch 2.5 for xpu backend (#9823) Signed-off-by: Kunshang Ji <kunshang.ji@intel.com> Signed-off-by: yan ma <yan.ma@intel.com> Co-authored-by: Kunshang Ji <kunshang.ji@intel.com> 2024-11-07 09:29:03 +08:00			`ENV LD_LIBRARY_PATH="$LD_LIBRARY_PATH:/usr/local/lib/"`

[CI/Build] remove .github from .dockerignore, add dirty repo check (#9375) 2024-10-17 19:25:06 +02:00			`COPY . .`
[Hardware][Intel GPU] upgrade IPEX dependency to 2.6.10. (#14564) Signed-off-by: Kunshang Ji <kunshang.ji@intel.com> 2025-03-11 10:11:47 -07:00			`ARG GIT_REPO_CHECK=0`
[CI/Build] remove .github from .dockerignore, add dirty repo check (#9375) 2024-10-17 19:25:06 +02:00			`RUN --mount=type=bind,source=.git,target=.git \`
			`if [ "$GIT_REPO_CHECK" != 0 ]; then bash tools/check_repo.sh; fi`
[Bugfix][Intel] Fix XPU Dockerfile Build (#7824) Signed-off-by: tylertitsworth <tyler.titsworth@intel.com> Co-authored-by: youkaichao <youkaichao@126.com> 2024-09-27 23:45:50 -07:00
			`ENV VLLM_TARGET_DEVICE=xpu`
[Hardware][Intel GPU] Add Intel GPU(XPU) inference backend (#3814) Co-authored-by: Jiang Li <jiang1.li@intel.com> Co-authored-by: Abhilash Majumder <abhilash.majumder@intel.com> Co-authored-by: Abhilash Majumder <30946547+abhilash1910@users.noreply.github.com> 2024-06-18 02:01:25 +08:00
[CI/Build] use setuptools-scm to set __version__ (#4738) Co-authored-by: youkaichao <youkaichao@126.com> 2024-09-23 18:44:26 +02:00			`RUN --mount=type=cache,target=/root/.cache/pip \`
			`--mount=type=bind,source=.git,target=.git \`
[Bugfix][Intel] Fix XPU Dockerfile Build (#7824) Signed-off-by: tylertitsworth <tyler.titsworth@intel.com> Co-authored-by: youkaichao <youkaichao@126.com> 2024-09-27 23:45:50 -07:00			`python3 setup.py install`
[Hardware][Intel GPU] Add Intel GPU(XPU) inference backend (#3814) Co-authored-by: Jiang Li <jiang1.li@intel.com> Co-authored-by: Abhilash Majumder <abhilash.majumder@intel.com> Co-authored-by: Abhilash Majumder <30946547+abhilash1910@users.noreply.github.com> 2024-06-18 02:01:25 +08:00
[Hardware][Intel GPU] upgrade IPEX dependency to 2.6.10. (#14564) Signed-off-by: Kunshang Ji <kunshang.ji@intel.com> 2025-03-11 10:11:47 -07:00			`# Please refer xpu doc, we need manually install intel-extension-for-pytorch 2.6.10+xpu due to there are some conflict dependencies with torch 2.6.0+xpu`
			`# FIXME: This will be fix in ipex 2.7. just leave this here for awareness.`
			`RUN --mount=type=cache,target=/root/.cache/pip \`
			`pip install intel-extension-for-pytorch==2.6.10+xpu \`
			`--extra-index-url=https://pytorch-extension.intel.com/release-whl/stable/xpu/us/`

[Hardware][Intel GPU] Add Intel GPU(XPU) inference backend (#3814) Co-authored-by: Jiang Li <jiang1.li@intel.com> Co-authored-by: Abhilash Majumder <abhilash.majumder@intel.com> Co-authored-by: Abhilash Majumder <30946547+abhilash1910@users.noreply.github.com> 2024-06-18 02:01:25 +08:00			`CMD ["/bin/bash"]`
[Bugfix][Intel] Fix XPU Dockerfile Build (#7824) Signed-off-by: tylertitsworth <tyler.titsworth@intel.com> Co-authored-by: youkaichao <youkaichao@126.com> 2024-09-27 23:45:50 -07:00
			`FROM vllm-base AS vllm-openai`

			`# install additional dependencies for openai api server`
			`RUN --mount=type=cache,target=/root/.cache/pip \`
			`pip install accelerate hf_transfer 'modelscope!=1.15.0'`

			`ENV VLLM_USAGE_SOURCE production-docker-image \`
			`TRITON_XPU_PROFILE 1`
[ci] add vllm_test_utils (#10659) Signed-off-by: youkaichao <youkaichao@gmail.com> 2024-11-26 00:20:04 -08:00			`# install development dependencies (for testing)`
			`RUN python3 -m pip install -e tests/vllm_test_utils`
[Bugfix][Intel] Fix XPU Dockerfile Build (#7824) Signed-off-by: tylertitsworth <tyler.titsworth@intel.com> Co-authored-by: youkaichao <youkaichao@126.com> 2024-09-27 23:45:50 -07:00			`ENTRYPOINT ["python3", "-m", "vllm.entrypoints.openai.api_server"]`