I am trying to figure out what forum to ask this on, and this is for general discussion, and so it seems ok to ask here.
Is AWS EKS Windows Ready For Production? We are now on EKS 1.17. We continuously run into obscure problems. Most recently those problems surround DNS. Examples:
- Sometimes Windows nodes get the wrong MAC address of pods in the cluster. This causes an impact when coredns is one of those pods. Pods running on the impacted Windows nodes cannot resolve DNS whenever requests go to the coredns pod with the wrong MAC address (by wrong MAC address i mean the node has the wrong MAC address for that pod)
- Sometimes Pods that start on a Windows Node cannot resolve DNS at all. ie- DNS is broken for the entire lifetime of the pod, and the Pod cannot connect to any internal Cluster Ip. However, whenever a fresh Pod is started, the problem goes away.
It’s hard to tell if this is a Windows Container problem or an EKS Problem. But, I am wondering whether other people are successfully running important workflows on Windows EKS. Any comments would be helpful as I assess this offering.