(compatibility-matrix)=
# Compatibility Matrix
The tables below show mutually exclusive features and the support on some hardware.
:::{note}
Check the '✗' with links to see tracking issue for unsupported feature/hardware combination.
:::
## Feature x Feature
:::{raw} html
:::
:::{list-table}
:header-rows: 1
:stub-columns: 1
:widths: auto
- * Feature
* [CP](#chunked-prefill)
* [APC](#automatic-prefix-caching)
* [LoRA](#lora-adapter)
* prmpt adptr
* [SD](#spec_decode)
* CUDA graph
* pooling
* enc-dec
* logP
* prmpt logP
* async output
* multi-step
* mm
* best-of
* beam-search
* guided dec
- * [CP](#chunked-prefill)
*
*
*
*
*
*
*
*
*
*
*
*
*
*
*
*
- * [APC](#automatic-prefix-caching)
* ✅
*
*
*
*
*
*
*
*
*
*
*
*
*
*
*
- * [LoRA](#lora-adapter)
* [✗](gh-pr:9057)
* ✅
*
*
*
*
*
*
*
*
*
*
*
*
*
*
- * prmpt adptr
* ✅
* ✅
* ✅
*
*
*
*
*
*
*
*
*
*
*
*
*
- * [SD](#spec_decode)
* ✅
* ✅
* ✗
* ✅
*
*
*
*
*
*
*
*
*
*
*
*
- * CUDA graph
* ✅
* ✅
* ✅
* ✅
* ✅
*
*
*
*
*
*
*
*
*
*
*
- * pooling
* ✗
* ✗
* ✗
* ✗
* ✗
* ✗
*
*
*
*
*
*
*
*
*
*
- * enc-dec
* ✗
* [✗](gh-issue:7366)
* ✗
* ✗
* [✗](gh-issue:7366)
* ✅
* ✅
*
*
*
*
*
*
*
*
*
- * logP
* ✅
* ✅
* ✅
* ✅
* ✅
* ✅
* ✗
* ✅
*
*
*
*
*
*
*
*
- * prmpt logP
* ✅
* ✅
* ✅
* ✅
* [✗](gh-pr:8199)
* ✅
* ✗
* ✅
* ✅
*
*
*
*
*
*
*
- * async output
* ✅
* ✅
* ✅
* ✅
* ✗
* ✅
* ✗
* ✗
* ✅
* ✅
*
*
*
*
*
*
- * multi-step
* ✗
* ✅
* ✗
* ✅
* ✗
* ✅
* ✗
* ✗
* ✅
* [✗](gh-issue:8198)
* ✅
*
*
*
*
*
- * mm
* ✅
* [✗](gh-pr:8348)
* [✗](gh-pr:7199)
* ?
* ?
* ✅
* ✅
* ✅
* ✅
* ✅
* ✅
* ?
*
*
*
*
- * best-of
* ✅
* ✅
* ✅
* ✅
* [✗](gh-issue:6137)
* ✅
* ✗
* ✅
* ✅
* ✅
* ?
* [✗](gh-issue:7968)
* ✅
*
*
*
- * beam-search
* ✅
* ✅
* ✅
* ✅
* [✗](gh-issue:6137)
* ✅
* ✗
* ✅
* ✅
* ✅
* ?
* [✗](gh-issue:7968)
* ?
* ✅
*
*
- * guided dec
* ✅
* ✅
* ?
* ?
* [✗](gh-issue:11484)
* ✅
* ✗
* ?
* ✅
* ✅
* ✅
* [✗](gh-issue:9893)
* ?
* ✅
* ✅
*
:::
(feature-x-hardware)=
## Feature x Hardware
:::{list-table}
:header-rows: 1
:stub-columns: 1
:widths: auto
- * Feature
* Volta
* Turing
* Ampere
* Ada
* Hopper
* CPU
* AMD
- * [CP](#chunked-prefill)
* [✗](gh-issue:2729)
* ✅
* ✅
* ✅
* ✅
* ✅
* ✅
- * [APC](#automatic-prefix-caching)
* [✗](gh-issue:3687)
* ✅
* ✅
* ✅
* ✅
* ✅
* ✅
- * [LoRA](#lora-adapter)
* ✅
* ✅
* ✅
* ✅
* ✅
* ✅
* ✅
- * prmpt adptr
* ✅
* ✅
* ✅
* ✅
* ✅
* [✗](gh-issue:8475)
* ✅
- * [SD](#spec_decode)
* ✅
* ✅
* ✅
* ✅
* ✅
* ✅
* ✅
- * CUDA graph
* ✅
* ✅
* ✅
* ✅
* ✅
* ✗
* ✅
- * pooling
* ✅
* ✅
* ✅
* ✅
* ✅
* ✅
* ?
- * enc-dec
* ✅
* ✅
* ✅
* ✅
* ✅
* ✅
* ✗
- * mm
* ✅
* ✅
* ✅
* ✅
* ✅
* ✅
* ✅
- * logP
* ✅
* ✅
* ✅
* ✅
* ✅
* ✅
* ✅
- * prmpt logP
* ✅
* ✅
* ✅
* ✅
* ✅
* ✅
* ✅
- * async output
* ✅
* ✅
* ✅
* ✅
* ✅
* ✗
* ✗
- * multi-step
* ✅
* ✅
* ✅
* ✅
* ✅
* [✗](gh-issue:8477)
* ✅
- * best-of
* ✅
* ✅
* ✅
* ✅
* ✅
* ✅
* ✅
- * beam-search
* ✅
* ✅
* ✅
* ✅
* ✅
* ✅
* ✅
- * guided dec
* ✅
* ✅
* ✅
* ✅
* ✅
* ✅
* ✅
:::