vllm/source at 35bd2151684ffb20cdad825abe33e0e6f0cc005a - vllm - Luminance Code Repo

20231088/vllm

History

whyiug e01ab595d8

[Model] support input embeddings for qwen2vl (#8856 )

2024-09-30 03:16:10 +00:00

..

[Docs] Add RunLLM chat widget (#6857 )

2024-07-27 09:24:46 -07:00

_templates/sections

[Doc] Guide for adding multi-modal plugins (#6205 )

2024-07-10 14:55:34 +08:00

[Doc] add visualization for multi-stage dockerfile (#4456 )

2024-04-30 17:41:59 +00:00

automatic_prefix_caching

[Doc] Add an automatic prefix caching section in vllm documentation (#5324 )

2024-06-11 10:24:59 -07:00

Add NVIDIA Meetup slides, announce AMD meetup, and add contact info (#8319 )

2024-09-09 23:21:00 -07:00

[Core] renamePromptInputs and inputs (#8876 )

2024-09-26 20:35:15 -07:00

getting_started

[doc] organize installation doc and expose per-commit docker (#8931 )

2024-09-28 17:48:41 -07:00

[Model] support input embeddings for qwen2vl (#8856 )

2024-09-30 03:16:10 +00:00

performance_benchmark

[Doc] fix 404 link (#7966 )

2024-08-28 13:54:23 -07:00

[[Misc]Upgrade bitsandbytes to the latest version 0.44.0 (#8768 )

2024-09-24 17:08:55 -07:00

[Feature] Add support for Llama 3.1 and 3.2 tool use (#8343 )

2024-09-26 17:01:42 -07:00

conf.py

[model] Support for Llava-Next-Video model (#7559 )

2024-09-10 22:21:36 -07:00

generate_examples.py

Add example scripts to documentation (#4225 )

2024-04-22 16:36:54 +00:00

index.rst

[Doc] neuron documentation update (#8671 )

2024-09-20 15:04:37 -07:00