Why should no pod do any work for a parallel job once the first pod has exited with success?

inzanez · March 3, 2023, 10:21am

Hi

documentation states (Jobs | Kubernetes) for parallel jobs with a worker queue that:

once any Pod has exited with success, no other Pod should still be doing any work for this task or writing any output. They should all be in the process of exiting.

Now I wonder why that would be? Just assuming that I have a job that should process 100’000 PDF documents in some way?

Assume I create a job with parallelism of 50. Further assume that all pods work on PDF documents with a single page, so that all goes fairly quickly; except the last document to be processed which is 1’000’000 pages.
Now there would be 49 pods alive and blocking resources until that last pod finished its work which might take ages…why would that be desired? Is there any Kubernetes internal reason that no pod should yet exit with success if other pods still do work?

Topic	Replies	Views
Kubernetes pending pods with fine parallel processing work queue job General Discussions	157	July 30, 2024
Container exit General Discussions	402	November 6, 2022
[Question] Does Kubernetes support hundreds PODs in “terminating” status for a week? General Discussions development	571	June 2, 2020
How can I stop restarting completed job pod after scale down General Discussions	727	December 21, 2022
Is there a way to keep Succeeded pod for longer even after the node is recycled General Discussions	25	May 22, 2025

Why should no pod do any work for a parallel job once the first pod has exited with success?

Related topics