Logo
Explore Help
Sign In
20231088/vllm
1
0
Fork 0
You've already forked vllm
Code Issues Pull Requests Actions 1 Packages Projects Releases Wiki Activity
vllm/examples
History
shiyi.c_98 d10f8e1d43
[Experimental] Prefix Caching Support (#1669)
Co-authored-by: DouHappy <2278958187@qq.com>
Co-authored-by: Zhuohan Li <zhuohan123@gmail.com>
2024-01-17 16:32:10 -08:00
..
api_client.py
[Quality] Add code formatter and linter (#326)
2023-07-03 11:31:55 -07:00
gradio_openai_chatbot_webserver.py
Add gradio chatbot for openai webserver (#2307)
2024-01-11 19:45:56 -08:00
gradio_webserver.py
Remove deprecated parameter: concurrency_count (#2315)
2024-01-03 09:56:21 -08:00
llm_engine_example.py
Refactor LLMEngine demo script for clarity and modularity (#1413)
2023-10-30 09:14:37 -07:00
offline_inference_with_prefix.py
[Experimental] Prefix Caching Support (#1669)
2024-01-17 16:32:10 -08:00
offline_inference.py
[Quality] Add code formatter and linter (#326)
2023-07-03 11:31:55 -07:00
openai_chatcompletion_client.py
chore(examples-docs): upgrade to OpenAI V1 (#1785)
2023-12-03 01:11:22 -08:00
openai_completion_client.py
chore(examples-docs): upgrade to OpenAI V1 (#1785)
2023-12-03 01:11:22 -08:00
template_alpaca.jinja
Support chat template and echo for chat API (#1756)
2023-11-30 16:43:13 -08:00
template_baichuan.jinja
Add baichuan chat template jinjia file (#2390)
2024-01-09 09:13:02 -08:00
template_chatml.jinja
Support chat template and echo for chat API (#1756)
2023-11-30 16:43:13 -08:00
template_inkbot.jinja
Support chat template and echo for chat API (#1756)
2023-11-30 16:43:13 -08:00
Powered by Gitea Version: 23.0.0 Page: 47ms Template: 3ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API