This website requires JavaScript.
Explore
Help
Register
Sign In
20231088
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
1
Packages
Projects
Releases
Wiki
Activity
2,080
Commits
1
Branch
0
Tags
Commit Graph
3 Commits
Author
SHA1
Message
Date
Michael Goin
978aed5300
[Kernel][Attention] Separate
Attention.kv_scale
into
k_scale
and
v_scale
(
#6081
)
2024-07-16 15:31:32 -07:00
Roger Wang
bd620b01fb
[Kernel][CPU] Add Quick
gelu
to CPU (
#5717
)
2024-06-21 06:39:40 +00:00
bnellnm
5467ac3196
[Kernel][Misc] Use TORCH_LIBRARY instead of PYBIND11_MODULE for custom ops (
#5047
)
2024-06-09 16:23:30 -04:00