124 Commits

Author SHA1 Message Date
Cyrus Leung
8ceffbf315
[Doc][3/N] Reorganize Serving section (#11766)
Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>
2025-01-07 11:20:01 +08:00
Simon Mo
82d24f7aac
[Docs] Document Deepseek V3 support (#11535)
Signed-off-by: simon-mo <simon.mo@hey.com>
2024-12-26 16:21:56 -08:00
Simon Mo
8fb26dac61
[Docs] Add media kit (#11121) 2024-12-11 17:33:11 -08:00
Diego Marinho
bfd610430c
Update README.md (#11034) 2024-12-09 23:08:10 -08:00
Simon Mo
452a4e80c3
[Docs] Add Snowflake Slides (#10641)
Signed-off-by: simon-mo <simon.mo@hey.com>
2024-11-25 09:34:46 -08:00
Zhuohan Li
49628fe13e
[Doc] Update README.md with Ray Summit talk links (#10610) 2024-11-24 16:45:09 -08:00
Simon Mo
c76ac49d26
[Docs] Add Nebius as sponsors (#10371)
Signed-off-by: simon-mo <simon.mo@hey.com>
2024-11-15 12:47:40 -08:00
Woosuk Kwon
1dbae0329c
[Docs] Publish meetup slides (#10331)
Signed-off-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>
2024-11-14 16:19:38 +00:00
Lily Liu
64cb1cdc3f
Update README.md (#9819) 2024-10-29 17:28:43 -07:00
Simon Mo
8d7724104a
[Docs] Add notes about Snowflake Meetup (#9814)
Signed-off-by: simon-mo <simon.mo@hey.com>
2024-10-29 15:19:02 -07:00
Yuan Tang
dbfa8d31d5
Add notes on the use of Slack (#9442) 2024-10-17 04:46:46 +00:00
youkaichao
e1faa2a598
[misc] improve ux on readme (#9147) 2024-10-07 22:26:25 -07:00
Simon Mo
8eeb857084
Add Slack to README (#9137) 2024-10-07 17:06:21 -07:00
Kuntai Du
c0d9a98d0c
[Doc] Include performance benchmark in README (#9135) 2024-10-07 15:04:06 -07:00
Zhuohan Li
a95354a36e
[Doc] Update README.md with Ray summit slides (#9088) 2024-10-05 02:54:45 +00:00
Simon Mo
36eecfbddb
Remove AMD Ray Summit Banner (#9075) 2024-10-04 10:17:16 -07:00
Simon Mo
a1d874224d
Add NVIDIA Meetup slides, announce AMD meetup, and add contact info (#8319) 2024-09-09 23:21:00 -07:00
Simon Mo
c5c7768264
Announce NVIDIA Meetup (#7483)
Co-authored-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>
2024-08-13 14:28:36 -07:00
Simon Mo
f020a6297e
[Docs] Update readme (#7316) 2024-08-11 17:13:37 -07:00
Simon Mo
5923532e15
Add Skywork AI as Sponsor (#7314) 2024-08-08 13:59:57 -07:00
Woosuk Kwon
b7215de2c5
[Docs] Publish 5th meetup slides (#6799) 2024-07-25 16:47:55 -07:00
Kuntai Du
6a1e25b151
[Doc] Add documentations for nightly benchmarks (#6412) 2024-07-25 11:57:16 -07:00
Woosuk Kwon
cb1362a889
[Docs] Announce llama3.1 support (#6688) 2024-07-23 08:18:15 -07:00
Woosuk Kwon
37d776606f
[Docs] Announce 5th meetup (#6458) 2024-07-15 21:04:58 -07:00
Woosuk Kwon
3dee97b05f
[Docs] Add Google Cloud to sponsor list (#6450) 2024-07-15 11:58:10 -07:00
Woosuk Kwon
d80aef3776
[Docs] Clean up latest news (#6401) 2024-07-12 19:36:53 -07:00
Saliya Ekanayake
a27f87da34
[Doc] Fix Typo in Doc (#6392)
Co-authored-by: Saliya Ekanayake <esaliya@d-matrix.ai>
2024-07-13 00:48:23 +00:00
Kuntai Du
a4feba929b
[CI/Build] Add nightly benchmarking for tgi, tensorrt-llm and lmdeploy (#5362) 2024-07-11 13:28:38 -07:00
youkaichao
2d23b42d92
[doc] update pipeline parallel in readme (#6347) 2024-07-11 11:38:40 -07:00
Jie Fu (傅杰)
439c84581a
[Doc] Update description of vLLM support for CPUs (#6003) 2024-07-10 21:15:29 -07:00
Kunshang Ji
cf90ae0123
[CI][Hardware][Intel GPU] add Intel GPU(XPU) ci pipeline (#5616) 2024-06-21 17:09:34 -07:00
Simon Mo
cdab68dcdb
[Docs] Add ZhenFund as a Sponsor (#5548) 2024-06-14 11:17:21 -07:00
Woosuk Kwon
a65634d3ae
[Docs] Add 4th meetup slides (#5509) 2024-06-13 10:18:26 -07:00
Li, Jiang
80aa7e91fc
[Hardware][Intel] Optimize CPU backend and add more performance tips (#4971)
Co-authored-by: Jianan Gu <jianan.gu@intel.com>
2024-06-13 09:33:14 -07:00
Woosuk Kwon
cb77ad836f
[Docs] Alphabetically sort sponsors (#5386) 2024-06-10 15:17:19 -05:00
Simon Mo
8f1729b829
[Docs] Add Ray Summit CFP (#5295) 2024-06-05 15:25:18 -07:00
Simon Mo
f270a39537
[Docs] Add Sequoia as sponsors (#5287) 2024-06-05 18:02:56 +00:00
Simon Mo
290f4ada2b
[Docs] Add Dropbox as sponsors (#5089) 2024-05-28 10:29:09 -07:00
Simon Mo
e941f88584
[Docs] Add acknowledgment for sponsors (#4925) 2024-05-21 00:17:25 -07:00
Zhuohan Li
361c461a12
[Doc] Highlight the fourth meetup in the README (#4842) 2024-05-15 11:38:49 -07:00
Simon Mo
29bc01bf3b
Add 4th meetup announcement to readme (#4817) 2024-05-14 18:33:06 -04:00
Zhuohan Li
ac1fbf7fd2
[Doc] Shorten README by removing supported model list (#4796) 2024-05-13 16:23:54 -07:00
Caio Mendes
bd7a8eef25
[Doc] README Phi-3 name fix. (#4372)
Co-authored-by: Caio Mendes <caiocesart@microsoft.com>
2024-04-25 10:32:00 -07:00
Isotr0py
fbf152d976
[Bugfix][Model] Refactor OLMo model to support new HF format in transformers 4.40.0 (#4324)
Co-authored-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>
2024-04-25 09:35:56 -07:00
Caio Mendes
96e90fdeb3
[Model] Adds Phi-3 support (#4298) 2024-04-25 03:06:57 +00:00
Simon Mo
705578ae14
[Docs] document that Meta Llama 3 is supported (#4175) 2024-04-18 10:55:48 -07:00
Simon Mo
aceb17cf2d
[Docs] document that mixtral 8x22b is supported (#4073) 2024-04-14 14:35:55 -07:00
ywfang
b4543c8f6b
[Model] add minicpm (#3893) 2024-04-08 18:28:36 +08:00
Woosuk Kwon
b95047f2da
[Misc] Publish 3rd meetup slides (#3835) 2024-04-03 15:46:10 -07:00
Robert Shaw
76b889bf1d
[Doc] Update README.md (#3806) 2024-04-02 23:11:10 -07:00