This website requires JavaScript.
Explore
Help
Register
Sign In
20231088
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
1
Packages
Projects
Releases
Wiki
Activity
vllm
/
docs
/
source
/
models
History
ywfang
8a0cf1ddc3
[Model] support minicpm3 (
#8297
)
...
Co-authored-by: DarkLight1337 <tlleungac@connect.ust.hk>
2024-09-14 14:50:26 +00:00
..
adding_model.rst
[Doc][CI/Build] Update docs and tests to use
vllm serve
(
#6431
)
2024-07-17 07:43:21 +00:00
enabling_multimodal_inputs.rst
[VLM][Core] Support profiling with multiple multi-modal inputs per prompt (
#7126
)
2024-08-14 17:55:42 +00:00
engine_args.rst
[Doc][CI/Build] Update docs and tests to use
vllm serve
(
#6431
)
2024-07-17 07:43:21 +00:00
lora.rst
[Core] Support load and unload LoRA in api server (
#6566
)
2024-09-05 18:10:33 -07:00
performance.rst
[Scheduler] Warning upon preemption and Swapping (
#4647
)
2024-05-13 23:50:44 +09:00
spec_decode.rst
[Documentation][Spec Decode] Add documentation about lossless guarantees in Speculative Decoding in vLLM (
#7962
)
2024-09-05 16:25:29 -04:00
supported_models.rst
[Model] support minicpm3 (
#8297
)
2024-09-14 14:50:26 +00:00
vlm.rst
[Doc] Indicate more information about supported modalities (
#8181
)
2024-09-05 10:51:53 +00:00