This website requires JavaScript.
Explore
Help
Sign In
20231088
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
1
Packages
Projects
Releases
Wiki
Activity
3,589
Commits
1
Branch
0
Tags
Commit Graph
3 Commits
Author
SHA1
Message
Date
Cody Yu
d11bf435a0
[MISC] Consolidate cleanup() and refactor offline_inference_with_prefix.py (
#9510
)
2024-10-18 14:30:55 -07:00
Joe Runde
de4008e2ab
[Bugfix][Core] Use torch.cuda.memory_stats() to profile peak memory usage (
#9352
)
...
Signed-off-by: Joe Runde <Joseph.Runde@ibm.com>
2024-10-17 22:47:27 -04:00
youkaichao
7d9ffa2ae1
[misc][core] lazy import outlines (
#7831
)
2024-08-24 00:51:38 -07:00