This website requires JavaScript.
Explore
Help
Register
Sign In
20231088
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
1
Packages
Projects
Releases
Wiki
Activity
vllm
/
docs
/
source
History
Zhuohan Li
fd4ea8ef5c
Use NCCL instead of ray for control-plane communication to remove serialization overhead (
#2221
)
2024-01-03 11:30:22 -08:00
..
assets
/logos
Update README.md (
#1292
)
2023-10-08 23:15:50 -07:00
getting_started
[Docs] Update installation instructions to include CUDA 11.8 xFormers (
#2246
)
2023-12-22 23:20:02 -08:00
models
Use NCCL instead of ray for control-plane communication to remove serialization overhead (
#2221
)
2024-01-03 11:30:22 -08:00
quantization
[Docs] Update the AWQ documentation to highlight performance issue (
#1883
)
2023-12-02 15:52:47 -08:00
serving
[Docs] Fix broken links (
#2222
)
2023-12-20 12:43:42 -08:00
conf.py
Fix repo & documentation URLs (
#163
)
2023-06-19 20:03:40 -07:00
index.rst
[Docs] Add CUDA graph support to docs (
#2148
)
2023-12-17 01:49:20 -08:00