How to shut down and restart Kubernetes clusters?

tomvolek · March 9, 2019, 7:55pm

I have used kuberspray to install Kubernetes on a three node lab infrastructure. I need to power off these hosts at night.

I have searched and read docs and I cant figure out whats the proper way of shutting down the clusters on all nodes and then restarting them the next day. Can someone point me to the proper process ?
It would be nice if we had an Ansible playbook to shut things down. (i do realize its my responsibility to shut down whats running on the containers themself )

Thanks

lalem · March 9, 2019, 9:06pm

I have never tried this but If I am about to do this I would try it in this order:

take a backup of the K8s in case things go south when you try to bring the cluster online (I will use Heptio-Velero for that.)
on the master node stop the following services:

kupe-apiserver
kube-scheduler
kube-controllers

on the nodes stop the following services:

kubelet
kube-proxy

good luck

tomvolek · March 9, 2019, 11:52pm

lalem :

which directories need to get backed up ?
How would you stop backplane processes like kupe-apiserver, controller etc? I mean do you just issue a kill PID , or there is a nicer way to stop them ?

Thanks

verilock · March 10, 2019, 1:07am

I have the same question. How that even possible that such thing like stop/start platform not described as first thing. Why people should looking answers on such simple question on forums, not acceptable.

lalem · March 10, 2019, 3:59pm

I meant you backup your entire cluster not a specific directories as a first choice. I mentioned you can use Velero for that, it is a great open source utility that can backup your K8s state incase of disaster, you can check it out yourself https://github.com/heptio/velero/ . And if you do not want to use Velero to backup your cluster state then backup your etcd https://github.com/etcd-io/etcd/blob/master/Documentation/op-guide/recovery.md and backup the root certificate files as well.
Your master components are running inside pods so you will have to stop them using “docker stop”. Then on the nodes just stop the services using “systemctl stop”.

Again I have not tried the steps above before, but here is what I did to terminate the cluster at the end of the day and bring it backup when I need it, and it works all the time.

automate the shutdown to take a Velero backup of the entire cluster.
destroy the cluster fully by reseting the play book.

Then, to bring it backup again automate:

bring up the cluster using the same play book.
restore from the taken backup prior to shutdown.

mrbobbytables · March 10, 2019, 6:08pm

We have had hard DC crashes and when bringing things back up, we just made sure the control plane was up before our nodes and things have come back fine.

We have also powered off our cluster(s) before and for the most part did this:

Scale all applications down to 0 excluding cluster services e.g. CNI DaemonSets, DNS etc.
Drain all nodes excluding the control plane.
Shut down nodes.
Shut down all components but kube-apiserver and etcd. – If using kubelet to manage components (kubeadm), just move the manifests out of the /etc/kubernetes/manifests dir and kubelet will stop the containers gracefully.
shut down kube-apiserver
Stop kubelet on control plane, just ensure the etcd leader is the last one to be stopped.
Backup dirs/etcd if needed.

Bringing it backup is essentially the opposite order.

tomvolek · March 10, 2019, 9:13pm

Thanks very much for all responses. I am just surprised there is no shut down playbook, etc. For sure its different pattern for each person/shop but there are common tasks that can be automated via a playbook which we can run after containers are drained.

I try to write something and contribute back to the repo to get some thoughts.

Thanks all

slunav · November 24, 2020, 3:43pm

Hi @tomvolek

Did you manage to get a shutdown/startup procedure for your kubernetes cluster?

I couldn’t find instructions on https://kubernetes.io/docs/ either.

Many thanks!

ajitgunturi · June 8, 2021, 3:57am

Looking for the same.

Joseph_John · June 26, 2023, 11:10am

I also searched did not get a documentation on how to shutdown and restart kebernetes services

git-withit · February 12, 2025, 12:52pm

Still looking for the same shutdown and restart information

Scarlet · March 10, 2025, 6:48am

To properly shut down your Kubernetes cluster installed via Kubespray, first drain and cordon the nodes to safely evict workloads:

kubectl drain <node> --ignore-daemonsets --delete-emptydir-data  
kubectl cordon <node>

Then, stop Kubernetes services on each node using Ansible:

ansible all -i inventory/mycluster/hosts.yaml -m systemd -a "name=kubelet state=stopped"

Shut down the nodes afterward. For restarting, power on the machines and use Ansible to start the services:

ansible all -i inventory/mycluster/hosts.yaml -m systemd -a "name=kubelet state=started"

You could automate this with an Ansible playbook to simplify the shutdown and startup process.

Topic		Replies	Views
Schedule Shutdown & Restart on Kubernetes Cluster General Discussions development	1	1967	February 10, 2022
Control node keeps shutting down General Discussions development	0	368	December 18, 2023
Is there a quick way to safely bounce all nodes in a cluster? General Discussions	3	1623	February 5, 2020
Shutdown and restore cluster automatically General Discussions	0	993	July 17, 2018
Advice Wanted: proper way to restart nodes microk8s	1	3097	June 1, 2020

How to shut down and restart Kubernetes clusters?

Related topics