20231088/vllm

History

Co-authored-by: Chen Shen <scv119@gmail.com>
Co-authored-by: Antoni Baum <antoni.baum@protonmail.com>

2023-12-16 21:12:08 -08:00

2023-12-10 10:12:53 -08:00

benchmark_latency.py

2023-12-16 21:12:08 -08:00

benchmark_serving.py

2023-10-02 19:22:05 -07:00

benchmark_throughput.py

2023-12-16 21:12:08 -08:00

launch_tgi_server.sh

2023-06-26 13:15:35 -07:00

README.md

2023-06-17 03:07:40 -07:00

Benchmarking vLLM

Downloading the ShareGPT dataset

You can download the dataset by running:

wget https://huggingface.co/datasets/anon8231489123/ShareGPT_Vicuna_unfiltered/resolve/main/ShareGPT_V3_unfiltered_cleaned_split.json