20231088/vllm - vllm - Luminance Code Repo

20231088/vllm

Author	SHA1	Message	Date
Harry Mellor	e78587a64c	Improve-mm-and-pooler-and-decoding-configs (#16789 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-04-17 22:13:32 -07:00
Cyrus Leung	61a44a0b22	[Doc] Add more tips to avoid OOM (#16765 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-04-17 09:54:34 +00:00
Cyrus Leung	facbe2a114	[Doc] Improve OOM troubleshooting (#16704 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-04-16 18:29:48 +08:00
Cyrus Leung	d9fc8cd9da	[V1] Enable multi-input by default (#15799 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-04-12 08:52:39 +00:00
Christian Sears	c09632a66c	Update openai_compatible_server.md (#16507 ) Signed-off-by: Christian Sears <csears@redhat.com>	2025-04-11 22:54:58 +00:00
Simon Mo	7acd539cd7	[Docs] update usage stats language (#15898 ) Signed-off-by: simon-mo <simon.mo@hey.com>	2025-04-01 12:54:13 -07:00
Wei Zeng	30d6a015e0	[Feature] specify model in config.yaml (#15798 ) Signed-off-by: weizeng <weizeng@roblox.com>	2025-04-01 01:20:06 -07:00
Reid	2914006fe0	[doc] add missing imports (#15699 ) Signed-off-by: reidliu41 <reid201711@gmail.com> Co-authored-by: reidliu41 <reid201711@gmail.com>	2025-03-28 15:56:48 +00:00
Cyrus Leung	6dd55af6c9	[Doc] Update docs on handling OOM (#15357 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by: Roger Wang <ywang@roblox.com> Co-authored-by: Roger Wang <ywang@roblox.com>	2025-03-24 14:29:34 -07:00
Roger Wang	9c5c81b0da	[Misc][Doc] Add note regarding loading `generation_config` by default (#15281 ) Signed-off-by: Roger Wang <ywang@roblox.com>	2025-03-23 14:00:55 -07:00
Cyrus Leung	baec0d4de9	Revert "[Feature] specify model in config.yaml (#14855 )" (#15293 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-03-21 08:30:23 -07:00
Wei Zeng	0fa3970deb	[Feature] specify model in config.yaml (#14855 ) Signed-off-by: weizeng <weizeng@roblox.com>	2025-03-21 00:26:03 -07:00
Harry Mellor	6edbfa924d	Mention `extra_body` as a way top pass vLLM only parameters using the OpenAI client (#15240 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-03-20 19:18:36 -07:00
Rui Qiao	4cb1c05c9e	[Doc] Clarify run vllm only on one node in distributed inference (#15148 ) Signed-off-by: Rui Qiao <ruisearch42@gmail.com>	2025-03-20 09:55:59 +08:00
Mark McLoughlin	9d2b4a70f4	[V1][Metrics] Updated list of deprecated metrics in v0.8 (#14695 ) Signed-off-by: Mark McLoughlin <markmc@redhat.com>	2025-03-15 00:45:25 +08:00
yasu52	3fb17d26c8	[Doc] Fix typo in documentation (#14783 ) Signed-off-by: yasu52 <tsuguro4649@gmail.com>	2025-03-13 20:33:09 -07:00
Chauncey	b0746fae3d	[Frontend] support image embeds (#13955 ) Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com>	2025-03-10 12:36:03 +00:00
Nicolò Lucchesi	fa82b93853	[Frontend][Docs] Transcription API streaming (#13301 ) Signed-off-by: NickLucche <nlucches@redhat.com>	2025-03-06 10:39:35 +00:00
Rui Qiao	abcc61e0af	[misc] Mention `ray list nodes` command to troubleshoot ray issues (#14318 ) Signed-off-by: Rui Qiao <ruisearch42@gmail.com>	2025-03-06 02:00:36 +00:00
Cyrus Leung	1088f06242	[Doc] Move multimodal Embedding API example to Online Serving page (#14017 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-02-28 07:12:04 +00:00
Mark McLoughlin	2cb8c1540e	[Metrics] Add `--show-hidden-metrics-for-version` CLI arg (#13295 )	2025-02-22 00:20:45 -08:00
Gabriel Marinho	1c3c975766	[FEATURE] Enables /score endpoint for embedding models (#12846 )	2025-02-20 22:09:47 -08:00
youkaichao	ad5a35c21b	[doc] clarify multi-node serving doc (#13558 ) Signed-off-by: youkaichao <youkaichao@gmail.com>	2025-02-19 22:32:17 +08:00
Cyrus Leung	7b623fca0b	[VLM] Check required fields before initializing field config in `DictEmbeddingItems` (#13380 )	2025-02-17 01:36:07 -08:00
Nicolò Lucchesi	d84cef76eb	[Frontend] Add `/v1/audio/transcriptions` OpenAI API endpoint (#12909 )	2025-02-13 07:23:45 -08:00
Farzad Abdolhosseini	08b2d845d6	[Model] Ultravox Model: Support v0.5 Release (#12912 ) Signed-off-by: Farzad Abdolhosseini <farzad@fixie.ai>	2025-02-10 22:02:48 +00:00
Cyrus Leung	8a69e0e20e	[CI/Build] Auto-fix Markdown files (#12941 )	2025-02-08 04:25:15 -08:00
youkaichao	e64330910b	[doc][misc] clarify VLLM_HOST_IP for multi-node inference (#12667 ) As more and more people are trying deepseek models with multi-node inference, https://github.com/vllm-project/vllm/issues/7815 becomes more frequent. Let's give clear message to users. Signed-off-by: youkaichao <youkaichao@gmail.com>	2025-02-03 09:32:18 +08:00
Harry Mellor	dd6a3a02cb	[Doc] Convert docs to use colon fences (#12471 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-01-29 11:38:29 +08:00
Kyle Mistele	0034b09ceb	[Frontend] Rerank API (Jina- and Cohere-compatible API) (#12376 ) Signed-off-by: Kyle Mistele <kyle@mistele.com>	2025-01-26 19:58:45 -07:00
Cyrus Leung	d07efb31c5	[Doc] Troubleshooting errors during model inspection (#12351 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-01-23 22:46:58 +08:00
Cyrus Leung	f8ef146f03	[Doc] Add documentation for specifying model architecture (#12105 )	2025-01-16 15:53:43 +08:00
Rafael Vasquez	43f3d9e699	[CI/Build] Add markdown linter (#11857 ) Signed-off-by: Rafael Vasquez <rafvasq21@gmail.com>	2025-01-12 00:17:13 -08:00
Harry Mellor	482cdc494e	[Doc] Rename offline inference examples (#11927 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-01-10 23:50:29 +08:00
Cyrus Leung	12664ddda5	[Doc] [1/N] Initial guide for merged multi-modal processor (#11925 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-01-10 14:30:25 +00:00
Harry Mellor	d85c47d6ad	Replace "online inference" with "online serving" (#11923 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-01-10 12:05:56 +00:00
Cyrus Leung	6cd40a5bfe	[Doc][4/N] Reorganize API Reference (#11843 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-01-08 21:34:44 +08:00
Harry Mellor	aba8d6ee00	[Doc] Move examples into categories (#11840 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-01-08 13:09:53 +00:00
Cyrus Leung	8ceffbf315	[Doc][3/N] Reorganize Serving section (#11766 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-01-07 11:20:01 +08:00
Cyrus Leung	ee77fdb5de	[Doc][2/N] Reorganize Models and Usage sections (#11755 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-01-06 21:40:31 +08:00
Suraj Deshmukh	2a622d704a	k8s-config: Update the secret to use stringData (#11679 ) Signed-off-by: Suraj Deshmukh <surajd.service@gmail.com>	2025-01-06 08:01:22 +00:00
Cyrus Leung	402d378360	[Doc] [1/N] Reorganize Getting Started section (#11645 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-01-06 02:18:33 +00:00
Chunyang Wen	84c35c374a	According to vllm.EngineArgs, the name should be distributed_executor_backend (#11689 )	2025-01-02 18:14:16 +00:00
Cyrus Leung	32b4c63f02	[Doc] Convert list tables to MyST (#11594 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2024-12-29 15:56:22 +08:00
Cyrus Leung	d427e5cfda	[Doc] Minor documentation fixes (#11580 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2024-12-28 21:53:59 +08:00
AlexHe99	d003f3ea39	Update deploying_with_k8s.md with AMD ROCm GPU example (#11465 ) Signed-off-by: Alex He <alehe@amd.com> Co-authored-by: Cyrus Leung <cyrus.tl.leung@gmail.com>	2024-12-27 10:00:04 +00:00
Robert Shaw	0c0c2015c5	Update openai_compatible_server.md (#11536 ) Co-authored-by: Simon Mo <simon.mo@hey.com>	2024-12-26 16:26:18 -08:00
Cyrus Leung	6ad909fdda	[Doc] Improve GitHub links (#11491 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2024-12-25 14:49:26 -08:00
Cyrus Leung	9edca6bf8f	[Frontend] Online Pooling API (#11457 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2024-12-24 17:54:30 +08:00
Rafael Vasquez	32aa2059ad	[Docs] Convert rST to MyST (Markdown) (#11145 ) Signed-off-by: Rafael Vasquez <rafvasq21@gmail.com>	2024-12-23 22:35:38 +00:00

1 2 3

149 Commits