vllm/source at e7c46b9527c9a50253657fd0078a0b1f23560ce4 - vllm - Luminance Code Repo

20231088/vllm

History

SangBin Cho e7c46b9527

[Scheduler] Warning upon preemption and Swapping (#4647 )

Co-authored-by: Robert Shaw <114415538+robertgshaw2-neuralmagic@users.noreply.github.com>

2024-05-13 23:50:44 +09:00

..

[Doc] add visualization for multi-stage dockerfile (#4456 )

2024-04-30 17:41:59 +00:00

[Doc] add visualization for multi-stage dockerfile (#4456 )

2024-04-30 17:41:59 +00:00

getting_started

Unable to find Punica extension issue during source code installation (#4494 )

2024-05-01 00:42:09 +00:00

[Scheduler] Warning upon preemption and Swapping (#4647 )

2024-05-13 23:50:44 +09:00

Enable scaled FP8 (e4m3fn) KV cache on ROCm (AMD GPU) (#3290 )

2024-04-03 14:15:55 -07:00

[Bugfix] Fix CLI arguments in OpenAI server docs (#4709 )

2024-05-09 09:53:14 -07:00

conf.py

[CI] Disable non-lazy string operation on logging (#4326 )

2024-04-26 00:16:58 -07:00

generate_examples.py

Add example scripts to documentation (#4225 )

2024-04-22 16:36:54 +00:00

index.rst

[Doc] Chunked Prefill Documentation (#4580 )

2024-05-04 00:18:00 -07:00