This website requires JavaScript.
Explore
Help
Register
Sign In
20231088
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
1
Packages
Projects
Releases
Wiki
Activity
vllm
/
tests
/
v1
/
core
History
Woosuk Kwon
cd4a72a28d
[V1][Spec decode] Move drafter to model runner (
#13363
)
...
Signed-off-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>
2025-02-17 15:40:12 -08:00
..
test_kv_cache_utils.py
[V1][Metrics] Add GPU prefix cache hit rate % gauge (
#12592
)
2025-02-11 08:27:25 +00:00
test_prefix_caching.py
[V1] Move KV block hashes from Request to KVCacheManager (
#12922
)
2025-02-07 19:14:10 -08:00
test_scheduler.py
[V1][Spec decode] Move drafter to model runner (
#13363
)
2025-02-17 15:40:12 -08:00