This website requires JavaScript.
Explore
Help
Register
Sign In
20231088
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
1
Packages
Projects
Releases
Wiki
Activity
1,288
Commits
1
Branch
0
Tags
Commit Graph
3 Commits
Author
SHA1
Message
Date
SangBin Cho
0d62fe58db
[Bug fix][Core] assert num_new_tokens == 1 fails when SamplingParams.n is not 1 and max_tokens is large & Add tests for preemption (
#4451
)
2024-05-01 19:24:13 -07:00
SangBin Cho
36729bac13
[Test] Test multiple attn backend for chunked prefill. (
#4023
)
2024-04-12 09:56:57 -07:00
SangBin Cho
67b4221a61
[Core][5/N] Fully working chunked prefill e2e (
#3884
)
2024-04-10 17:56:48 -07:00