vllm/examples/llava_example.py

from vllm import LLM
from vllm.assets.image import ImageAsset


def run_llava():
    llm = LLM(model="llava-hf/llava-1.5-7b-hf")

    prompt = "USER: <image>\nWhat is the content of this image?\nASSISTANT:"

    image = ImageAsset("stop_sign").pil_image

    outputs = llm.generate({
        "prompt": prompt,
        "multi_modal_data": {
            "image": image
        },
    })

    for o in outputs:
        generated_text = o.outputs[0].text
        print(generated_text)


if __name__ == "__main__":
    run_llava()
[Feature] Add vision language model support. (#3042) 2024-03-25 14:16:30 -07:00			`from vllm import LLM`
[CI/Build] vLLM cache directory for images (#6444) 2024-07-16 14:12:25 +08:00			`from vllm.assets.image import ImageAsset`
[Feature] Add vision language model support. (#3042) 2024-03-25 14:16:30 -07:00

[VLM] Remove `image_input_type` from VLM config (#5852) Signed-off-by: Xiaowei Jiang <xwjiang2010@gmail.com> Co-authored-by: Cyrus Leung <cyrus.tl.leung@gmail.com> Co-authored-by: Roger Wang <ywang@roblox.com> 2024-07-02 00:57:09 -07:00			`def run_llava():`
[vlm] Remove vision language config. (#6089) Signed-off-by: Xiaowei Jiang <xwjiang2010@gmail.com> Co-authored-by: Roger Wang <ywang@roblox.com> 2024-07-03 15:14:16 -07:00			`llm = LLM(model="llava-hf/llava-1.5-7b-hf")`
[Feature] Add vision language model support. (#3042) 2024-03-25 14:16:30 -07:00
[Core] Dynamic image size support for VLMs (#5276) Signed-off-by: Xiaowei Jiang <xwjiang2010@gmail.com> Co-authored-by: Xiaowei Jiang <xwjiang2010@gmail.com> Co-authored-by: ywang96 <ywang@roblox.com> Co-authored-by: xwjiang2010 <87673679+xwjiang2010@users.noreply.github.com> Co-authored-by: Roger Wang <136131678+ywang96@users.noreply.github.com> 2024-07-03 11:34:00 +08:00			`prompt = "USER: <image>\nWhat is the content of this image?\nASSISTANT:"`
[Feature] Add vision language model support. (#3042) 2024-03-25 14:16:30 -07:00
[CI/Build] vLLM cache directory for images (#6444) 2024-07-16 14:12:25 +08:00			`image = ImageAsset("stop_sign").pil_image`
[Core] Consolidate prompt arguments to LLM engines (#4328) Co-authored-by: Roger Wang <ywang@roblox.com> 2024-05-29 04:29:31 +08:00
			`outputs = llm.generate({`
[Core] Support image processor (#4197) 2024-06-03 13:56:41 +08:00			`"prompt": prompt,`
[VLM] Remove `image_input_type` from VLM config (#5852) Signed-off-by: Xiaowei Jiang <xwjiang2010@gmail.com> Co-authored-by: Cyrus Leung <cyrus.tl.leung@gmail.com> Co-authored-by: Roger Wang <ywang@roblox.com> 2024-07-02 00:57:09 -07:00			`"multi_modal_data": {`
			`"image": image`
			`},`
[Core] Consolidate prompt arguments to LLM engines (#4328) Co-authored-by: Roger Wang <ywang@roblox.com> 2024-05-29 04:29:31 +08:00			`})`
[Feature] Add vision language model support. (#3042) 2024-03-25 14:16:30 -07:00
			`for o in outputs:`
			`generated_text = o.outputs[0].text`
			`print(generated_text)`


			`if __name__ == "__main__":`
[CI/Build] vLLM cache directory for images (#6444) 2024-07-16 14:12:25 +08:00			`run_llava()`