Cyrus Leung
|
5bf35a91e4
|
[Doc][CI/Build] Update docs and tests to use vllm serve (#6431)
|
2024-07-17 07:43:21 +00:00 |
|
Cyrus Leung
|
9389380015
|
[Doc] Move guide for multimodal model and other improvements (#6168)
|
2024-07-06 17:18:59 +08:00 |
|
Cyrus Leung
|
5cbe8d155c
|
[Core] Registry for processing model inputs (#5214)
Co-authored-by: ywang96 <ywang@roblox.com>
|
2024-06-28 12:09:56 +00:00 |
|
xiaoji
|
7f2593b164
|
[Doc]: Update the doc of adding new models (#4236)
|
2024-04-21 09:57:08 -07:00 |
|
youkaichao
|
95baec828f
|
[Core] enable out-of-tree model register (#3871)
|
2024-04-06 17:11:41 -07:00 |
|
Woosuk Kwon
|
e66b629c04
|
[Misc] Minor fix in KVCache type (#3652)
|
2024-03-26 23:14:06 -07:00 |
|
Zhuohan Li
|
fd4ea8ef5c
|
Use NCCL instead of ray for control-plane communication to remove serialization overhead (#2221)
|
2024-01-03 11:30:22 -08:00 |
|
Peter Götz
|
d940ce497e
|
Fix typo in adding_model.rst (#1947)
adpated -> adapted
|
2023-12-06 10:04:26 -08:00 |
|
Simon Mo
|
0f621c2c7d
|
[Docs] Add information about using shared memory in docker (#1845)
|
2023-11-29 18:33:56 -08:00 |
|
Zhuohan Li
|
0fc280b06c
|
Update the adding-model doc according to the new refactor (#1692)
|
2023-11-16 18:46:26 -08:00 |
|
Zhuohan Li
|
002800f081
|
Align vLLM's beam search implementation with HF generate (#857)
|
2023-09-04 17:29:42 -07:00 |
|
Woosuk Kwon
|
794e578de0
|
[Minor] Fix URLs (#166)
|
2023-06-19 22:57:14 -07:00 |
|
Woosuk Kwon
|
b7e62d3454
|
Fix repo & documentation URLs (#163)
|
2023-06-19 20:03:40 -07:00 |
|
Woosuk Kwon
|
0b98ba15c7
|
Change the name to vLLM (#150)
|
2023-06-17 03:07:40 -07:00 |
|
Woosuk Kwon
|
456941cfe4
|
[Docs] Write the Adding a New Model section (#138)
|
2023-06-05 20:01:26 -07:00 |
|
Woosuk Kwon
|
62ec38ea41
|
Document supported models (#127)
|
2023-06-02 22:35:17 -07:00 |
|