This website requires JavaScript.
Explore
Help
Sign In
20231088
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
1
Packages
Projects
Releases
Wiki
Activity
vllm
/
examples
History
shiyi.c_98
d10f8e1d43
[Experimental] Prefix Caching Support (
#1669
)
...
Co-authored-by: DouHappy <2278958187@qq.com> Co-authored-by: Zhuohan Li <zhuohan123@gmail.com>
2024-01-17 16:32:10 -08:00
..
api_client.py
[Quality] Add code formatter and linter (
#326
)
2023-07-03 11:31:55 -07:00
gradio_openai_chatbot_webserver.py
Add gradio chatbot for openai webserver (
#2307
)
2024-01-11 19:45:56 -08:00
gradio_webserver.py
Remove deprecated parameter: concurrency_count (
#2315
)
2024-01-03 09:56:21 -08:00
llm_engine_example.py
Refactor LLMEngine demo script for clarity and modularity (
#1413
)
2023-10-30 09:14:37 -07:00
offline_inference_with_prefix.py
[Experimental] Prefix Caching Support (
#1669
)
2024-01-17 16:32:10 -08:00
offline_inference.py
[Quality] Add code formatter and linter (
#326
)
2023-07-03 11:31:55 -07:00
openai_chatcompletion_client.py
chore(examples-docs): upgrade to OpenAI V1 (
#1785
)
2023-12-03 01:11:22 -08:00
openai_completion_client.py
chore(examples-docs): upgrade to OpenAI V1 (
#1785
)
2023-12-03 01:11:22 -08:00
template_alpaca.jinja
Support chat template and
echo
for chat API (
#1756
)
2023-11-30 16:43:13 -08:00
template_baichuan.jinja
Add baichuan chat template jinjia file (
#2390
)
2024-01-09 09:13:02 -08:00
template_chatml.jinja
Support chat template and
echo
for chat API (
#1756
)
2023-11-30 16:43:13 -08:00
template_inkbot.jinja
Support chat template and
echo
for chat API (
#1756
)
2023-11-30 16:43:13 -08:00