Client early failover in case of the node failure for long lived TCP connections

thisisbhaskar · June 22, 2026, 4:59am

Looking for guidance on TCP connection failover behavior with Kubernetes Services + MetalLB.

Setup:

Kubernetes cluster with 3 worker nodes
DaemonSet running one pod per node
Service type = LoadBalancer
MetalLB in L2 mode
externalTrafficPolicy: Cluster
Clients establish long-lived TCP connections to the Service VIP
Clients are external and not under our control

Example flow:

Client
  |
  v
VIP (owned by Node1)
  |
  v
kube-proxy
  |
  v
Pod on Node3

Failure scenario:

Client establishes a TCP connection to the Service VIP.
MetalLB advertises the VIP from Node1.
kube-proxy selects a backend pod running on Node3.
Node3 crashes (or becomes unreachable).
The existing TCP connection becomes unusable.
The client does not establish a new connection for ~60 seconds (appears to be waiting on TCP timeout/retransmission behavior).

Question:

Is there any Kubernetes networking mechanism (Service, kube-proxy, conntrack tuning, MetalLB configuration, etc.) that can reduce the failure detection time for an already-established TCP connection when the selected backend node disappears?

More specifically:

Can Kubernetes/MetalLB cause the client to receive a faster TCP failure indication (RST/ICMP/etc.) when the backend node hosting the selected endpoint dies?
Is the ~60 second wait fundamentally a client TCP behavior once the backend connection state is lost?
Would moving to MetalLB BGP mode with externalTrafficPolicy: Local change anything for existing TCP sessions, or only improve routing of new connections after node failure?

My current understanding is that Kubernetes can help steer new connections away from failed endpoints, but cannot accelerate failure detection of an already-established TCP session when the endpoint node hard-crashes. Looking to confirm whether that’s correct or if I’m missing any networking-level options.

Topic		Replies	Views
Traffic Still Routed to a Hung Node for Minutes — Is This a Kubernetes Design Limitation? General Discussions	0	33	March 30, 2026
TCP SYN_SENT conntrack flow later becomes ESTABLISHED after backend Pod termination / IP reuse — expected kube-proxy behavior or gap? General Discussions network	0	60	March 28, 2026
Bare Metal K8S Load Balancer Service Routing Delay General Discussions	5	2413	March 14, 2020
Traffic to a Pod located in a Dead Node General Discussions	2	1879	August 23, 2019
Issues with KubeProxy Network Programming Duration General Discussions	1	2305	September 22, 2020

Client early failover in case of the node failure for long lived TCP connections

Related topics