5 Commits

Author SHA1 Message Date
Woosuk Kwon
62ec38ea41
Document supported models (#127) 2023-06-02 22:35:17 -07:00
Woosuk Kwon
211318d44a
Add throughput benchmarking script (#133) 2023-05-28 03:20:05 -07:00
Woosuk Kwon
4a151dd453
Add activation registry (#126) 2023-05-25 00:09:07 -07:00
Woosuk Kwon
3f942acfe1
Fix latency benchmark script (#118) 2023-05-22 17:03:40 -07:00
Woosuk Kwon
655a5e48df
Introduce LLM class for offline inference (#115) 2023-05-21 17:04:18 -07:00