vllm/source at d03d64fd2e22f1a48e7b78c66d7644e6b6230fb7 - vllm - Luminance Code Repo

20231088/vllm

History

youkaichao d03d64fd2e

[CI/Build] refactor dockerfile & fix pip cache

[CI/Build] fix pip cache with vllm_nccl & refactor dockerfile to build wheels (#3859)

2024-04-04 21:53:16 -07:00

..

fix document error for value and v_vec illustration (#3421 )

2024-03-15 16:06:09 -07:00

[Doc] Add docs about OpenAI compatible server (#3288 )

2024-03-18 22:05:34 -07:00

getting_started

[Hardware][Intel] Add CPU inference backend (#3634 )

2024-04-01 22:07:30 -07:00

[Doc]Add asynchronous engine arguments to documentation. (#3810 )

2024-04-04 21:52:01 -07:00

Enable scaled FP8 (e4m3fn) KV cache on ROCm (AMD GPU) (#3290 )

2024-04-03 14:15:55 -07:00

Usage Stats Collection (#2852 )

2024-03-28 22:16:12 -07:00

conf.py

[CI/Build] refactor dockerfile & fix pip cache

2024-04-04 21:53:16 -07:00

index.rst

Enable scaled FP8 (e4m3fn) KV cache on ROCm (AMD GPU) (#3290 )

2024-04-03 14:15:55 -07:00