Lucas Wilkinson
cabaf4eff3
[Attention] MLA decode optimizations (#12528)
Signed-off-by: Lucas Wilkinson <lwilkinson@neuralmagic.com>
Signed-off-by: simon-mo <xmo@berkeley.edu>
Co-authored-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>
Co-authored-by: simon-mo <simon.mo@hey.com>
Co-authored-by: Michael Goin <mgoin64@gmail.com>
Co-authored-by: Zhuohan Li <zhuohan123@gmail.com>
Co-authored-by: Tyler Michael Smith <tysmith@redhat.com>
Co-authored-by: Alexander Matveev <59768536+alexm-neuralmagic@users.noreply.github.com>
Co-authored-by: simon-mo <xmo@berkeley.edu>
2025-01-30 23:49:37 -08:00
..
2025-01-23 18:04:03 +00:00
2025-01-30 18:33:00 -08:00
2025-01-23 18:04:03 +00:00
2025-01-30 18:33:00 -08:00
2025-01-05 10:20:34 +09:00
2025-01-27 17:23:08 -07:00
2025-01-15 12:47:49 +08:00
2025-01-30 18:33:00 -08:00
2025-01-27 17:23:08 -07:00
2025-01-05 10:20:34 +09:00
2025-01-15 02:29:53 +00:00
2025-01-30 23:49:37 -08:00
2025-01-30 23:49:37 -08:00
2024-06-02 14:13:26 -07:00
2024-06-09 16:23:30 -04:00
2024-08-20 07:09:33 -06:00
2025-01-22 14:39:32 +08:00
2024-11-06 23:50:47 -08:00
2024-11-06 23:50:47 -08:00
2025-01-27 17:23:08 -07:00
2024-12-13 03:19:23 +00:00
2024-11-08 21:20:08 +00:00
2024-11-08 21:20:08 +00:00
2025-01-23 18:04:03 +00:00
2024-09-23 13:46:26 -04:00
2024-06-09 16:23:30 -04:00
2025-01-30 23:49:37 -08:00
2024-11-08 21:20:08 +00:00