Stephen Krider
|
1356df53bd
|
[Kernel] Use flash-attn for decoding (#3648)
Co-authored-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>
Co-authored-by: LiuXiaoxuanPKU <lilyliupku@gmail.com>
|
2024-05-13 15:50:33 -07:00 |
|
SangBin Cho
|
f6a593093a
|
[CI] Make mistral tests pass (#4596)
|
2024-05-08 08:44:35 -07:00 |
|
Jee Li
|
d6f4bd7cdd
|
[Misc]Add customized information for models (#4132)
|
2024-04-30 21:18:14 -07:00 |
|
SangBin Cho
|
26422e477b
|
[Test] Make model tests run again and remove --forked from pytest (#3631)
Co-authored-by: Simon Mo <simon.mo@hey.com>
|
2024-03-28 21:06:40 -07:00 |
|