Doc: Improve benchmark documentation (#9927)
Signed-off-by: Rafael Vasquez <rafvasq21@gmail.com>
This commit is contained in:
parent
104d729656
commit
d7263a1bb8
@ -1,5 +1,6 @@
|
||||
==============
|
||||
Profiling vLLM
|
||||
=================================
|
||||
==============
|
||||
|
||||
We support tracing vLLM workers using the ``torch.profiler`` module. You can enable tracing by setting the ``VLLM_TORCH_PROFILER_DIR`` environment variable to the directory where you want to save the traces: ``VLLM_TORCH_PROFILER_DIR=/mnt/traces/``
|
||||
|
||||
|
@ -126,9 +126,9 @@ Documentation
|
||||
|
||||
.. toctree::
|
||||
:maxdepth: 1
|
||||
:caption: Performance benchmarks
|
||||
:caption: Performance
|
||||
|
||||
performance_benchmark/benchmarks
|
||||
performance/benchmarks
|
||||
|
||||
.. toctree::
|
||||
:maxdepth: 2
|
||||
|
33
docs/source/performance/benchmarks.rst
Normal file
33
docs/source/performance/benchmarks.rst
Normal file
@ -0,0 +1,33 @@
|
||||
.. _benchmarks:
|
||||
|
||||
================
|
||||
Benchmark Suites
|
||||
================
|
||||
|
||||
vLLM contains two sets of benchmarks:
|
||||
|
||||
+ :ref:`Performance benchmarks <performance_benchmarks>`
|
||||
+ :ref:`Nightly benchmarks <nightly_benchmarks>`
|
||||
|
||||
|
||||
.. _performance_benchmarks:
|
||||
|
||||
Performance Benchmarks
|
||||
----------------------
|
||||
|
||||
The performance benchmarks are used for development to confirm whether new changes improve performance under various workloads. They are triggered on every commit with both the ``perf-benchmarks`` and ``ready`` labels, and when a PR is merged into vLLM.
|
||||
|
||||
The latest performance results are hosted on the public `vLLM Performance Dashboard <https://perf.vllm.ai>`_.
|
||||
|
||||
More information on the performance benchmarks and their parameters can be found `here <https://github.com/vllm-project/vllm/blob/main/.buildkite/nightly-benchmarks/performance-benchmarks-descriptions.md>`__.
|
||||
|
||||
.. _nightly_benchmarks:
|
||||
|
||||
Nightly Benchmarks
|
||||
------------------
|
||||
|
||||
These compare vLLM's performance against alternatives (``tgi``, ``trt-llm``, and ``lmdeploy``) when there are major updates of vLLM (e.g., bumping up to a new version). They are primarily intended for consumers to evaluate when to choose vLLM over other options and are triggered on every commit with both the ``perf-benchmarks`` and ``nightly-benchmarks`` labels.
|
||||
|
||||
The latest nightly benchmark results are shared in major release blog posts such as `vLLM v0.6.0 <https://blog.vllm.ai/2024/09/05/perf-update.html>`_.
|
||||
|
||||
More information on the nightly benchmarks and their parameters can be found `here <https://github.com/vllm-project/vllm/blob/main/.buildkite/nightly-benchmarks/nightly-descriptions.md>`__.
|
@ -1,23 +0,0 @@
|
||||
.. _benchmarks:
|
||||
|
||||
Benchmark suites of vLLM
|
||||
========================
|
||||
|
||||
|
||||
|
||||
vLLM contains two sets of benchmarks:
|
||||
|
||||
+ **Performance benchmarks**: benchmark vLLM's performance under various workloads at a high frequency (when a pull request (PR for short) of vLLM is being merged). See `vLLM performance dashboard <https://perf.vllm.ai>`_ for the latest performance results.
|
||||
|
||||
+ **Nightly benchmarks**: compare vLLM's performance against alternatives (tgi, trt-llm, and lmdeploy) when there are major updates of vLLM (e.g., bumping up to a new version). The latest results are available in the `vLLM GitHub README <https://github.com/vllm-project/vllm/blob/main/README.md>`_.
|
||||
|
||||
|
||||
Trigger a benchmark
|
||||
-------------------
|
||||
|
||||
The performance benchmarks and nightly benchmarks can be triggered by submitting a PR to vLLM, and label the PR with `perf-benchmarks` and `nightly-benchmarks`.
|
||||
|
||||
|
||||
.. note::
|
||||
|
||||
Please refer to `vLLM performance benchmark descriptions <https://github.com/vllm-project/vllm/blob/main/.buildkite/nightly-benchmarks/performance-benchmarks-descriptions.md>`_ and `vLLM nightly benchmark descriptions <https://github.com/vllm-project/vllm/blob/main/.buildkite/nightly-benchmarks/nightly-descriptions.md>`_ for detailed descriptions on benchmark environment, workload and metrics.
|
Loading…
x
Reference in New Issue
Block a user