2 Commits

Author SHA1 Message Date
Woosuk Kwon
3f942acfe1
Fix latency benchmark script (#118) 2023-05-22 17:03:40 -07:00
Woosuk Kwon
655a5e48df
Introduce LLM class for offline inference (#115) 2023-05-21 17:04:18 -07:00