20231088/vllm - vllm - Luminance Code Repo

20231088/vllm

Author	SHA1	Message	Date
Zhuohan Li	d6fa1be3a8	[Quality] Add code formatter and linter (#326 )	2023-07-03 11:31:55 -07:00
Woosuk Kwon	14f0b39cda	[Bugfix] Fix a bug in RequestOutput.finished (#202 )	2023-06-22 00:17:24 -07:00
Woosuk Kwon	0b98ba15c7	Change the name to vLLM (#150 )	2023-06-17 03:07:40 -07:00
Zhuohan Li	e5464ee484	Rename servers to engines (#152 )	2023-06-17 17:25:21 +08:00
Zhuohan Li	eedb46bf03	Rename servers and change port numbers to reduce confusion (#149 )	2023-06-17 00:13:02 +08:00
Woosuk Kwon	311490a720	Add script for benchmarking serving throughput (#145 )	2023-06-14 19:55:38 -07:00
Zhuohan Li	5020e1e80c	Non-streaming simple fastapi server (#144 )	2023-06-10 10:43:07 -07:00
Zhuohan Li	4298374265	Add docstrings for LLMServer and related classes and examples (#142 )	2023-06-07 18:25:20 +08:00
Woosuk Kwon	211318d44a	Add throughput benchmarking script (#133 )	2023-05-28 03:20:05 -07:00
Zhuohan Li	057daef778	OpenAI Compatible Frontend (#116 )	2023-05-23 21:39:50 -07:00
Woosuk Kwon	655a5e48df	Introduce LLM class for offline inference (#115 )	2023-05-21 17:04:18 -07:00
Woosuk Kwon	f746ced08d	Implement stop strings and best_of (#114 )	2023-05-21 11:18:00 -07:00
Woosuk Kwon	c3442c1f6f	Refactor system architecture (#109 )	2023-05-20 13:06:59 -07:00

... 4 5 6 7 8