vllm/source at baee28c46c242b72f90d6b1211ab9d7872ab05d3 - vllm - Luminance Code Repo

20231088/vllm

History

Yuan Tang 49d849b3ab

docs: Add tutorial on deploying vLLM model with KServe (#2586 )

Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>

2024-03-01 11:04:14 -08:00

..

Update README.md (#1292 )

2023-10-08 23:15:50 -07:00

[DOC] Add additional comments for LLMEngine and AsyncLLMEngine (#1011 )

2024-01-11 19:26:49 -08:00

getting_started

[ROCm] support Radeon™ 7900 series (gfx1100) without using flash-attention (#2768 )

2024-02-10 23:14:37 -08:00

multi-lora documentation fix (#3064 )

2024-02-27 21:26:15 -08:00

[CI] Ensure documentation build is checked in CI (#2842 )

2024-02-12 22:53:07 -08:00

docs: Add tutorial on deploying vLLM model with KServe (#2586 )

2024-03-01 11:04:14 -08:00

conf.py

Port metrics from aioprometheus to prometheus_client (#2730 )

2024-02-25 11:54:00 -08:00

index.rst

docs: Add tutorial on deploying vLLM model with KServe (#2586 )

2024-03-01 11:04:14 -08:00