Subprocess Killed with a 137 error

jvc22 · November 25, 2022, 2:06pm

Yeah so this was an OOM because the firing up of the bash (as a sub-process) tipped the pod over its total allocation and the node terminated the sub-process. I watched dmesg with

sudo dmesg -wH

and then varied the resource limits on our dev cluster and observed that

if the main process exceeds the limits.memory. the pod is restarted
if a subprocess pushes the resources over the limits.memory the subprocess is killed but the pod remains running.
if the limits.memory is high enough the pod is not restarted and the sub process executes ok

Thanks again

Topic		Replies	Views
POD shows return code 137 - what are the defaults? General Discussions	3	1654	March 12, 2024
How can we tell if the OOMKilled in k8s is because the node is running out of memory and thus killing the pod, or if the pod itself is being killed because the memory it has requested exceeds the limt declaration limit? General Discussions development	1	2368	December 13, 2024
Kubelet doesn't recognize child process oom General Discussions	3	2264	June 16, 2019
Kube-apiserver being restarted more than 400 times (each) with exit code 137 (non OOM killed) General Discussions	3	1206	April 4, 2023
Correctly handle OOM killed job General Discussions	1	1787	June 13, 2019

Subprocess Killed with a 137 error

Related topics