vllm/cacheflow/parallel_utils

The files in this folder are ported from Megatron-LM. We only keep the codes that are used in inference.