(compatibility-matrix)= # Compatibility Matrix The tables below show mutually exclusive features and the support on some hardware. The symbols used have the following meanings: - ✅ = Full compatibility - 🟠 = Partial compatibility - ❌ = No compatibility :::{note} Check the ❌ or 🟠 with links to see tracking issue for unsupported feature/hardware combination. ::: ## Feature x Feature :::{raw} html ::: :::{list-table} :header-rows: 1 :stub-columns: 1 :widths: auto :class: vertical-table-header - * Feature * [CP](#chunked-prefill) * [APC](#automatic-prefix-caching) * [LoRA](#lora-adapter) * prmpt adptr * [SD](#spec_decode) * CUDA graph * pooling * enc-dec * logP * prmpt logP * async output * multi-step * mm * best-of * beam-search * guided dec - * [CP](#chunked-prefill) * ✅ * * * * * * * * * * * * * * * - * [APC](#automatic-prefix-caching) * ✅ * ✅ * * * * * * * * * * * * * * - * [LoRA](#lora-adapter) * ✅ * ✅ * ✅ * * * * * * * * * * * * * - * prmpt adptr * ✅ * ✅ * ✅ * ✅ * * * * * * * * * * * * - * [SD](#spec_decode) * ✅ * ✅ * ❌ * ✅ * ✅ * * * * * * * * * * * - * CUDA graph * ✅ * ✅ * ✅ * ✅ * ✅ * ✅ * * * * * * * * * * - * pooling * ❌ * ❌ * ❌ * ❌ * ❌ * ❌ * ✅ * * * * * * * * * - * enc-dec * ❌ * [❌](gh-issue:7366) * ❌ * ❌ * [❌](gh-issue:7366) * ✅ * ✅ * ✅ * * * * * * * * - * logP * ✅ * ✅ * ✅ * ✅ * ✅ * ✅ * ❌ * ✅ * ✅ * * * * * * * - * prmpt logP * ✅ * ✅ * ✅ * ✅ * ✅ * ✅ * ❌ * ✅ * ✅ * ✅ * * * * * * - * async output * ✅ * ✅ * ✅ * ✅ * ❌ * ✅ * ❌ * ❌ * ✅ * ✅ * ✅ * * * * * - * multi-step * ❌ * ✅ * ❌ * ✅ * ❌ * ✅ * ❌ * ❌ * ✅ * ✅ * ✅ * ✅ * * * * - * mm * ✅ * [🟠](gh-pr:8348) * [🟠](gh-pr:4194) * ❔ * ❔ * ✅ * ✅ * ✅ * ✅ * ✅ * ✅ * ❔ * ✅ * * * - * best-of * ✅ * ✅ * ✅ * ✅ * [❌](gh-issue:6137) * ✅ * ❌ * ✅ * ✅ * ✅ * ❔ * [❌](gh-issue:7968) * ✅ * ✅ * * - * beam-search * ✅ * ✅ * ✅ * ✅ * [❌](gh-issue:6137) * ✅ * ❌ * ✅ * ✅ * ✅ * ❔ * [❌](gh-issue:7968) * ❔ * ✅ * ✅ * - * guided dec * ✅ * ✅ * ❔ * ❔ * [❌](gh-issue:11484) * ✅ * ❌ * ❔ * ✅ * ✅ * ✅ * [❌](gh-issue:9893) * ❔ * ✅ * ✅ * ✅ ::: (feature-x-hardware)= ## Feature x Hardware :::{list-table} :header-rows: 1 :stub-columns: 1 :widths: auto - * Feature * Volta * Turing * Ampere * Ada * Hopper * CPU * AMD - * [CP](#chunked-prefill) * [❌](gh-issue:2729) * ✅ * ✅ * ✅ * ✅ * ✅ * ✅ - * [APC](#automatic-prefix-caching) * [❌](gh-issue:3687) * ✅ * ✅ * ✅ * ✅ * ✅ * ✅ - * [LoRA](#lora-adapter) * ✅ * ✅ * ✅ * ✅ * ✅ * ✅ * ✅ - * prmpt adptr * ✅ * ✅ * ✅ * ✅ * ✅ * [❌](gh-issue:8475) * ✅ - * [SD](#spec_decode) * ✅ * ✅ * ✅ * ✅ * ✅ * ✅ * ✅ - * CUDA graph * ✅ * ✅ * ✅ * ✅ * ✅ * ❌ * ✅ - * pooling * ✅ * ✅ * ✅ * ✅ * ✅ * ✅ * ❔ - * enc-dec * ✅ * ✅ * ✅ * ✅ * ✅ * ✅ * ❌ - * mm * ✅ * ✅ * ✅ * ✅ * ✅ * ✅ * ✅ - * logP * ✅ * ✅ * ✅ * ✅ * ✅ * ✅ * ✅ - * prmpt logP * ✅ * ✅ * ✅ * ✅ * ✅ * ✅ * ✅ - * async output * ✅ * ✅ * ✅ * ✅ * ✅ * ❌ * ❌ - * multi-step * ✅ * ✅ * ✅ * ✅ * ✅ * [❌](gh-issue:8477) * ✅ - * best-of * ✅ * ✅ * ✅ * ✅ * ✅ * ✅ * ✅ - * beam-search * ✅ * ✅ * ✅ * ✅ * ✅ * ✅ * ✅ - * guided dec * ✅ * ✅ * ✅ * ✅ * ✅ * ✅ * ✅ :::