vllm/source at a9bcc7afb23d208efaa1b47549fa93eaa1d9d6cf - vllm - Luminance Code Repo

20231088/vllm

History

Cyrus Leung a9bcc7afb2

[Doc] Use intersphinx and update entrypoints docs (#5125 )

2024-05-30 09:59:23 -07:00

..

[Doc] add visualization for multi-stage dockerfile (#4456 )

2024-04-30 17:41:59 +00:00

[Docs] Add Dropbox as sponsors (#5089 )

2024-05-28 10:29:09 -07:00

[Core] Consolidate prompt arguments to LLM engines (#4328 )

2024-05-28 13:29:31 -07:00

getting_started

[Doc] add ccache guide in doc (#5012 )

2024-05-23 23:21:54 +00:00

[Kernel][Backend][Model] Blocksparse flash attention kernel and Phi-3-Small model (#4799 )

2024-05-24 22:00:52 -07:00

Enable scaled FP8 (e4m3fn) KV cache on ROCm (AMD GPU) (#3290 )

2024-04-03 14:15:55 -07:00

[Doc][Build] update after removing vllm-nccl (#5103 )

2024-05-29 23:51:18 +00:00

conf.py

[Doc] Use intersphinx and update entrypoints docs (#5125 )

2024-05-30 09:59:23 -07:00

generate_examples.py

Add example scripts to documentation (#4225 )

2024-04-22 16:36:54 +00:00

index.rst

[Core] Consolidate prompt arguments to LLM engines (#4328 )

2024-05-28 13:29:31 -07:00