[BugFix] Ensure worker model loop is always stopped at the right time (#5987)
This commit is contained in:
parent
c4059ea54f
commit
c87ebc3ef9
@ -838,7 +838,7 @@ class LLMEngine:
|
|||||||
# Tracing
|
# Tracing
|
||||||
self.do_tracing(scheduler_outputs)
|
self.do_tracing(scheduler_outputs)
|
||||||
|
|
||||||
if not request_outputs:
|
if not self.has_unfinished_requests():
|
||||||
# Stop the execute model loop in parallel workers until there are
|
# Stop the execute model loop in parallel workers until there are
|
||||||
# more requests to process. This avoids waiting indefinitely in
|
# more requests to process. This avoids waiting indefinitely in
|
||||||
# torch.distributed ops which may otherwise timeout, and unblocks
|
# torch.distributed ops which may otherwise timeout, and unblocks
|
||||||
|
Loading…
x
Reference in New Issue
Block a user