In a multi node kubernetes cluster, for a service call, when a Service gets the IP of the POD (from iptables DNAT), how correct Node is chosen to forward the request?

Debu · October 26, 2024, 11:33am

In a service call, when a service gets the backend pod IP from iptable (DNAT), how does it know which node contains the pod when the request first goes to a node where the pod is not present?

jayeshmahajan · October 26, 2024, 5:21pm

The request for a service call is made to the ClusterIP, the kube-proxy on the node intercepts the request. The kube-proxy manages rules in iptables or IPVS. These rules perform destination NAT (DNAT) to translate the Service IP into one of the backend Pod IPs.

I wrote about it recently if you want to take a look here

Debu · October 26, 2024, 5:47pm

Yes, the Service IP translates to backend pod ip using Iptables. But suppose the node where the iptables rule is executed doesn’t contain the pod. The pod is available in another node. So there should be an extra step, where it decides which nodes contain the pod and forward the request there. So, question is, how this destination node is seleted? Is there any pod to node mapping? if yes, where the details can be found?

TaoYu · November 27, 2024, 6:23am

CNI is knows map relations between pods and nodes, It can forward to other node by chain of the iptables create when receive the trafic of the svc

jayeshmahajan · December 5, 2024, 7:48pm

Traffic Flow Explanation

Client Sends Traffic to a Kubernetes Service:

The client sends a request to a service’s ClusterIP (e.g., 10.96.0.1) or an external IP if using NodePort or LoadBalancer.

Service-Level Load Balancing in kube-proxy:

The kube-proxy on the receiving node intercepts this traffic.
It uses iptables rules (or IPVS in some configurations) to match the destination port and IP.
An iptables rule matches the service IP and jumps to a service-specific chain.

Selection of a Pod:

Within the service-specific chain, a rule is applied to select a backend pod based on probability (random load balancing).
This rule jumps to an endpoint-specific chain.

DNAT to Pod IP:

In the endpoint-specific chain, the traffic is DNAT’ed:
- The destination IP is replaced with the pod’s IP (e.g., 192.168.1.2), and the destination port is replaced with the pod’s port (if necessary).
- The traffic is directed toward the selected pod.

Traffic to a Pod on a Different Node:

If the selected pod is on a different node, kube-proxy forwards the traffic to the pod’s node via an overlay network (e.g., Flannel, Calico) or host networking (depending on the CNI).

Arrival at the Target Node:

On the target node, the traffic enters through the node’s network interface.
It bypasses kube-proxy and directly reaches the pod via the CNI, as the DNAT’ed IP matches the pod IP.

Pod Processes the Request:

The pod receives the request and processes it.

Response Path:

The pod sends a response back to the original client using the same DNAT/IP translation rules to maintain connection state.

Traffic Flow Diagram

Here’s a simple diagram to visualize the flow:

Client -> Service (ClusterIP) -> kube-proxy -> iptables chain:
     - Match ServiceIP:Port
     - Select Pod using probability
     - DNAT to PodIP:Port

Node A (kube-proxy):
   - DNAT traffic to PodIP (on Node B)

Node B:
   - Receive DNAT'ed traffic
   - Route to Pod
   - Pod processes request and sends response back

Key Points about DNAT:

iptables and Chains: The DNAT process in kube-proxy uses iptables to rewrite the destination IP and port to the pod’s IP and port.
Probability Matching: The service’s iptables chain uses a probabilistic algorithm to ensure traffic is balanced across all endpoints (pods).
Cross-Node Traffic: When the pod is on a different node, traffic is routed via the overlay network provided by the Kubernetes CNI plugin.

Topic		Replies	Views
How does a service choose which pod to send request to? The inner workings General Discussions service , network	0	444	September 4, 2023
Can I use NodePort-type service to access local pod? General Discussions network	4	29	April 3, 2025
【Help】 Regarding the communication strategy and working principle between k8s service nodeport and iptables General Discussions	13	144	December 9, 2024
Kubernetes Pod's service cannot reach via Pod'sIP General Discussions	0	691	November 18, 2019
Default behavior for Nodeport service General Discussions	1	436	December 27, 2022

In a multi node kubernetes cluster, for a service call, when a Service gets the IP of the POD (from iptables DNAT), how correct Node is chosen to forward the request?

Traffic Flow Explanation

Traffic Flow Diagram

Key Points about DNAT:

Related topics