(compatibility-matrix)= # Compatibility Matrix The tables below show mutually exclusive features and the support on some hardware. :::{note} Check the '✗' with links to see tracking issue for unsupported feature/hardware combination. ::: ## Feature x Feature :::{raw} html ::: :::{list-table} :header-rows: 1 :stub-columns: 1 :widths: auto - * Feature * [CP](#chunked-prefill) * [APC](#automatic-prefix-caching) * [LoRA](#lora-adapter) * prmpt adptr * [SD](#spec_decode) * CUDA graph * pooling * enc-dec * logP * prmpt logP * async output * multi-step * mm * best-of * beam-search * guided dec - * [CP](#chunked-prefill) * * * * * * * * * * * * * * * * - * [APC](#automatic-prefix-caching) * ✅ * * * * * * * * * * * * * * * - * [LoRA](#lora-adapter) * [✗](gh-pr:9057) * ✅ * * * * * * * * * * * * * * - * prmpt adptr * ✅ * ✅ * ✅ * * * * * * * * * * * * * - * [SD](#spec_decode) * ✅ * ✅ * ✗ * ✅ * * * * * * * * * * * * - * CUDA graph * ✅ * ✅ * ✅ * ✅ * ✅ * * * * * * * * * * * - * pooling * ✗ * ✗ * ✗ * ✗ * ✗ * ✗ * * * * * * * * * * - * enc-dec * ✗ * [✗](gh-issue:7366) * ✗ * ✗ * [✗](gh-issue:7366) * ✅ * ✅ * * * * * * * * * - * logP * ✅ * ✅ * ✅ * ✅ * ✅ * ✅ * ✗ * ✅ * * * * * * * * - * prmpt logP * ✅ * ✅ * ✅ * ✅ * [✗](gh-pr:8199) * ✅ * ✗ * ✅ * ✅ * * * * * * * - * async output * ✅ * ✅ * ✅ * ✅ * ✗ * ✅ * ✗ * ✗ * ✅ * ✅ * * * * * * - * multi-step * ✗ * ✅ * ✗ * ✅ * ✗ * ✅ * ✗ * ✗ * ✅ * [✗](gh-issue:8198) * ✅ * * * * * - * mm * ✅ * [✗](gh-pr:8348) * [✗](gh-pr:7199) * ? * ? * ✅ * ✅ * ✅ * ✅ * ✅ * ✅ * ? * * * * - * best-of * ✅ * ✅ * ✅ * ✅ * [✗](gh-issue:6137) * ✅ * ✗ * ✅ * ✅ * ✅ * ? * [✗](gh-issue:7968) * ✅ * * * - * beam-search * ✅ * ✅ * ✅ * ✅ * [✗](gh-issue:6137) * ✅ * ✗ * ✅ * ✅ * ✅ * ? * [✗](gh-issue:7968>) * ? * ✅ * * - * guided dec * ✅ * ✅ * ? * ? * [✗](gh-issue:11484) * ✅ * ✗ * ? * ✅ * ✅ * ✅ * [✗](gh-issue:9893) * ? * ✅ * ✅ * ::: (feature-x-hardware)= ## Feature x Hardware :::{list-table} :header-rows: 1 :stub-columns: 1 :widths: auto - * Feature * Volta * Turing * Ampere * Ada * Hopper * CPU * AMD - * [CP](#chunked-prefill) * [✗](gh-issue:2729) * ✅ * ✅ * ✅ * ✅ * ✅ * ✅ - * [APC](#automatic-prefix-caching) * [✗](gh-issue:3687) * ✅ * ✅ * ✅ * ✅ * ✅ * ✅ - * [LoRA](#lora-adapter) * ✅ * ✅ * ✅ * ✅ * ✅ * ✅ * ✅ - * prmpt adptr * ✅ * ✅ * ✅ * ✅ * ✅ * [✗](gh-issue:8475) * ✅ - * [SD](#spec_decode) * ✅ * ✅ * ✅ * ✅ * ✅ * ✅ * ✅ - * CUDA graph * ✅ * ✅ * ✅ * ✅ * ✅ * ✗ * ✅ - * pooling * ✅ * ✅ * ✅ * ✅ * ✅ * ✅ * ? - * enc-dec * ✅ * ✅ * ✅ * ✅ * ✅ * ✅ * ✗ - * mm * ✅ * ✅ * ✅ * ✅ * ✅ * ✅ * ✅ - * logP * ✅ * ✅ * ✅ * ✅ * ✅ * ✅ * ✅ - * prmpt logP * ✅ * ✅ * ✅ * ✅ * ✅ * ✅ * ✅ - * async output * ✅ * ✅ * ✅ * ✅ * ✅ * ✗ * ✗ - * multi-step * ✅ * ✅ * ✅ * ✅ * ✅ * [✗](gh-issue:8477) * ✅ - * best-of * ✅ * ✅ * ✅ * ✅ * ✅ * ✅ * ✅ - * beam-search * ✅ * ✅ * ✅ * ✅ * ✅ * ✅ * ✅ - * guided dec * ✅ * ✅ * ✅ * ✅ * ✅ * ✅ * ✅ :::