Nick Hill
|
99dac099ab
|
[Core][Doc] Default to multiprocessing for single-node distributed case (#5230)
Co-authored-by: Antoni Baum <antoni.baum@protonmail.com>
|
2024-06-11 11:10:41 -07:00 |
|
Roger Wang
|
7a9cb294ae
|
[Frontend] Add OpenAI Vision API Support (#5237)
Co-authored-by: DarkLight1337 <tlleungac@connect.ust.hk>
|
2024-06-07 11:23:32 -07:00 |
|
Breno Faria
|
f775a07e30
|
[FRONTEND] OpenAI tools support named functions (#5032)
|
2024-06-03 18:25:29 -05:00 |
|
Chansung Park
|
429d89720e
|
add doc about serving option on dstack (#3074)
Co-authored-by: Roger Wang <ywang@roblox.com>
|
2024-05-30 10:11:07 -07:00 |
|
youkaichao
|
4fbcb0f27e
|
[Doc][Build] update after removing vllm-nccl (#5103)
Co-authored-by: Roger Wang <136131678+ywang96@users.noreply.github.com>
|
2024-05-29 23:51:18 +00:00 |
|
Cyrus Leung
|
5ae5ed1e60
|
[Core] Consolidate prompt arguments to LLM engines (#4328)
Co-authored-by: Roger Wang <ywang@roblox.com>
|
2024-05-28 13:29:31 -07:00 |
|
Kante Yin
|
8e7fb5d43a
|
Support to serve vLLM on Kubernetes with LWS (#4829)
Signed-off-by: kerthcet <kerthcet@gmail.com>
|
2024-05-16 16:37:29 -07:00 |
|
Cyrus Leung
|
4bfa7e7f75
|
[Doc] Add API reference for offline inference (#4710)
|
2024-05-13 17:47:42 -07:00 |
|
Cyrus Leung
|
a3c124570a
|
[Bugfix] Fix CLI arguments in OpenAI server docs (#4709)
|
2024-05-09 09:53:14 -07:00 |
|
youkaichao
|
2d7bce9cd5
|
[Doc] add env vars to the doc (#4572)
|
2024-05-03 05:13:49 +00:00 |
|
Frαnçois
|
e491c7e053
|
[Doc] update(example model): for OpenAI compatible serving (#4503)
|
2024-05-01 10:14:16 -07:00 |
|
youkaichao
|
2768884ac4
|
[Doc] Add note for docker user (#4340)
Co-authored-by: Simon Mo <simon.mo@hey.com>
|
2024-04-24 21:09:44 +00:00 |
|
Zhanghao Wu
|
ceaf4ed003
|
[Doc] Update the SkyPilot doc with serving and Llama-3 (#4276)
|
2024-04-22 15:34:31 -07:00 |
|
Frαnçois
|
92cd2e2f21
|
[Doc] Fix getting stared to use publicly available model (#3963)
|
2024-04-10 18:05:52 +00:00 |
|
yhu422
|
d8658c8cc1
|
Usage Stats Collection (#2852)
|
2024-03-28 22:16:12 -07:00 |
|
Simon Mo
|
ef65dcfa6f
|
[Doc] Add docs about OpenAI compatible server (#3288)
|
2024-03-18 22:05:34 -07:00 |
|
Sherlock Xu
|
b0925b3878
|
docs: Add BentoML deployment doc (#3336)
Signed-off-by: Sherlock113 <sherlockxu07@gmail.com>
|
2024-03-12 10:34:30 -07:00 |
|
Yuan Tang
|
49d849b3ab
|
docs: Add tutorial on deploying vLLM model with KServe (#2586)
Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>
|
2024-03-01 11:04:14 -08:00 |
|
Massimiliano Pronesti
|
5ed704ec8c
|
docs: fix langchain (#2736)
|
2024-02-03 18:17:55 -08:00 |
|
Ronen Schaffer
|
c17daa9f89
|
[Docs] Fix broken links (#2222)
|
2023-12-20 12:43:42 -08:00 |
|
Ikko Eltociear Ashimine
|
c0ce15dfb2
|
Update run_on_sky.rst (#2025)
sharable -> shareable
|
2023-12-11 10:32:58 -08:00 |
|
Simon Mo
|
c85b80c2b6
|
[Docker] Add cuda arch list as build option (#1950)
|
2023-12-08 09:53:47 -08:00 |
|
AguirreNicolas
|
24f60a54f4
|
[Docker] Adding number of nvcc_threads during build as envar (#1893)
|
2023-12-07 11:00:32 -08:00 |
|
Simon Mo
|
5313c2cb8b
|
Add Production Metrics in Prometheus format (#1890)
|
2023-12-02 16:37:44 -08:00 |
|
Massimiliano Pronesti
|
05a38612b0
|
docs: add instruction for langchain (#1162)
|
2023-11-30 10:57:44 -08:00 |
|
Simon Mo
|
0f621c2c7d
|
[Docs] Add information about using shared memory in docker (#1845)
|
2023-11-29 18:33:56 -08:00 |
|
Wen Sun
|
112627e8b2
|
[Docs] Fix the code block's format in deploying_with_docker page (#1722)
|
2023-11-20 01:22:39 -08:00 |
|
Simon Mo
|
37c1e3c218
|
Documentation about official docker image (#1709)
|
2023-11-19 20:56:26 -08:00 |
|
Stephen Krider
|
9cabcb7645
|
Add Dockerfile (#1350)
|
2023-10-31 12:36:47 -07:00 |
|
Tanmay Verma
|
6f2dd6c37e
|
Add documentation to Triton server tutorial (#983)
|
2023-09-20 10:32:40 -07:00 |
|
Zhanghao Wu
|
58df2883cb
|
[Doc] Add doc for running vLLM on the cloud (#426)
Co-authored-by: Zhuohan Li <zhuohan123@gmail.com>
|
2023-07-16 13:37:14 -07:00 |
|
Zhuohan Li
|
2cf1a333b6
|
[Doc] Documentation for distributed inference (#261)
|
2023-06-26 11:34:23 -07:00 |
|