vllm/requirements-neuron.txt

12 lines
244 B
Plaintext
Raw Normal View History

sentencepiece # Required for LLaMA tokenizer.
numpy
transformers-neuronx >= 0.9.0
torch-neuronx >= 2.1.0
neuronx-cc
fastapi
uvicorn[standard]
2024-01-22 01:05:56 +01:00
pydantic >= 2.0 # Required for OpenAI server.
prometheus_client >= 0.18.0
2024-03-28 22:16:12 -07:00
requests
psutil
py-cpuinfo