vllm/cacheflow/__init__.py

from cacheflow.entrypoints.llm import LLM
from cacheflow.outputs import RequestOutput, CompletionOutput
from cacheflow.sampling_params import SamplingParams
from cacheflow.server.arg_utils import ServerArgs
from cacheflow.server.llm_server import LLMEngine
from cacheflow.server.ray_utils import initialize_cluster

__version__ = "0.1.0"

__all__ = [
    "LLM",
    "SamplingParams",
    "RequestOutput",
    "CompletionOutput",
    "LLMEngine",
    "ServerArgs",
    "initialize_cluster",
]
Introduce LLM class for offline inference (#115) 2023-05-21 17:04:18 -07:00			`from cacheflow.entrypoints.llm import LLM`
Add throughput benchmarking script (#133) 2023-05-28 03:20:05 -07:00			`from cacheflow.outputs import RequestOutput, CompletionOutput`
Refactor system architecture (#109) 2023-05-20 13:06:59 -07:00			`from cacheflow.sampling_params import SamplingParams`
Introduce LLM class for offline inference (#115) 2023-05-21 17:04:18 -07:00			`from cacheflow.server.arg_utils import ServerArgs`
Rename servers and change port numbers to reduce confusion (#149) 2023-06-17 00:13:02 +08:00			`from cacheflow.server.llm_server import LLMEngine`
Refactor system architecture (#109) 2023-05-20 13:06:59 -07:00			`from cacheflow.server.ray_utils import initialize_cluster`

[PyPI] Packaging for PyPI distribution (#140) 2023-06-05 20:03:14 -07:00			`__version__ = "0.1.0"`

Refactor system architecture (#109) 2023-05-20 13:06:59 -07:00			`__all__ = [`
Introduce LLM class for offline inference (#115) 2023-05-21 17:04:18 -07:00			`"LLM",`
Refactor system architecture (#109) 2023-05-20 13:06:59 -07:00			`"SamplingParams",`
Introduce LLM class for offline inference (#115) 2023-05-21 17:04:18 -07:00			`"RequestOutput",`
Add throughput benchmarking script (#133) 2023-05-28 03:20:05 -07:00			`"CompletionOutput",`
Rename servers and change port numbers to reduce confusion (#149) 2023-06-17 00:13:02 +08:00			`"LLMEngine",`
Introduce LLM class for offline inference (#115) 2023-05-21 17:04:18 -07:00			`"ServerArgs",`
Refactor system architecture (#109) 2023-05-20 13:06:59 -07:00			`"initialize_cluster",`
			`]`