What is Unbalanced IRQs Kubernetes Event?

Balasuriyan · November 1, 2023, 3:57am

We can see a new Kubernetes Event - Unbalanced IRQs - 5 IRQs with affinity to 1 CPUs.
Could you please share some insights on this event, its root cause, impact on the cluster and any solutions to resolve this warning?

Thanks for your time.

thockin · November 1, 2023, 4:45am

IRQs are how the kernel is notified of hardware events. This has nothing particular to do with Kubernetes.

MDH · November 1, 2023, 1:05pm

We started getting this event a few days ago. I tried searching the keywords online (Google, Github Org, node-problem-detector repo, etc.) and got no results other than this thread. Any idea what this means? We are running v1.26.3 on AKS.

tomlobato · November 1, 2023, 2:52pm

Same here, starting 1 or 2 weeks ago:
Reason: UnbalancedIRQs
Message: 9 IRQs with affinity to 1 CPUs
Source: irqbalance-problem-monitor NODENAME
Object: Node/aks-NODENAME

thockin · November 1, 2023, 3:23pm

Possibly NPD started reporting this recently? Or maybe your OS was updated and the kernel is routing all IRQs to a single CPU?

maksagit · November 1, 2023, 4:19pm

We also started receiving these events in the node logs after upgrade AKS from 1.25.6 to 1.26.6.

Events:

|Type   | Reason       | Age                   | From                     | Message                       |
|-------|--------------|-----------------------|--------------------------|-------------------------------|
|Warning|UnbalancedIRQs|100s (x1930 over 6d16h)|irqbalance-problem-monitor|9 IRQs with affinity to 1 CPUs |

turtle · November 13, 2023, 6:19am

We faced the same issue with AKS.
In the cluster resource under “Resource Health” and then “Diagnose and solve Problems” the following was listed:

So it seems like an AKS issue which could be fixed by updating the node images.

alftio · December 13, 2023, 4:51pm

Solution is to upgrade the node image to the latest avaialble version.

Incorporated fix for irqbalance #275 a node image upgrade from 202310.19.2 will resolve the unbalanced IRQs

Warnings are displayed because Kubernetes synchronizes with the kernel to determine the availability of resources that can or cannot be assigned to pods.

“node-problem-detector aims to make various node problems visible to the upstream layers in the cluster management stack. It is a daemon that runs on each node, detects node problems and reports them to apiserver. node-problem-detector can either run as a DaemonSet or run standalone. Now it is running as a Kubernetes Addon enabled by default in the GKE cluster. It is also enabled by default in AKS as part of the AKS Linux Extension.”

Topic		Replies	Views
Kubernetes Podcast: MetalLB, with David Anderson General Discussions metallb	0	1273	December 5, 2018
Kubernetes Podcast from Google: Leader Election, with Mike Danese General Discussions podcast	0	750	September 29, 2020
Kubernetes Podcast: Supporting Kubernetes, with Ken Massada General Discussions podcast	1	1076	August 28, 2018
New Kubernetes Book Asking for Feedback General Discussions	2	1649	November 21, 2019
Kubernetes Podcast from Google: Lyft and KubeCon NA 2019, with Vicki Cheung General Discussions	0	844	November 21, 2019

What is Unbalanced IRQs Kubernetes Event?

Related Topics