Yuan Tang
|
4800339c62
|
Add docs on serving with Llama Stack (#10183)
Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>
Co-authored-by: Russell Bryant <rbryant@redhat.com>
|
2024-11-11 11:28:55 -08:00 |
|
Kameshwara Pavan Kumar Mantha
|
22b39e11f2
|
llama_index serving integration documentation (#6973)
Co-authored-by: pavanmantha <pavan.mantha@thevaslabs.io>
|
2024-08-14 15:38:37 -07:00 |
|
milo157
|
2bd231a7b7
|
[Doc] Added cerebrium as Integration option (#5553)
|
2024-06-18 15:56:59 -07:00 |
|
Chansung Park
|
429d89720e
|
add doc about serving option on dstack (#3074)
Co-authored-by: Roger Wang <ywang@roblox.com>
|
2024-05-30 10:11:07 -07:00 |
|
Kante Yin
|
8e7fb5d43a
|
Support to serve vLLM on Kubernetes with LWS (#4829)
Signed-off-by: kerthcet <kerthcet@gmail.com>
|
2024-05-16 16:37:29 -07:00 |
|
Simon Mo
|
ef65dcfa6f
|
[Doc] Add docs about OpenAI compatible server (#3288)
|
2024-03-18 22:05:34 -07:00 |
|