Ambassador qotm route shows "no healthy upstream"

Sony_Joseph · March 25, 2019, 9:56am

I am getting issues with a qotm service deployed with ambassdor gateway. What ever I do, I get a statement saying no healthy upstream

mrbobbytables · March 25, 2019, 11:37am

I can’t really offer any advice, but you might be able to get a better answer asking in the Ambassador community slack.

rata · March 25, 2019, 11:27pm

Or please provide more logs and debug info.

Although never used ambassador here

Sony_Joseph · March 26, 2019, 8:14am

The ambassador log while access the API shows:ACCESS [2019-03-26T06:01:18.215Z] “GET /qotm/ HTTP/1.1” 503 UH 0 19 0 - “10.244.1.1” “PostmanRuntime/7.6.0” “708e70f6-5acd-4c8b-8f0c-d4e66e7b67db” “192.168.99.101:32686” “-”.
The ambassador internal routes are all working fine and in those ACCESS logs it shows the target ip address correctly.

rata · March 28, 2019, 3:39am

And did that request arrive to the target IP? Who is returning that 503 you showed? The backend or ambassador because there was some problem with the backend?

Sony_Joseph · March 28, 2019, 6:45am

I have observed that it is not a problem with ambassador. But, it could be a problem with the DNS(10.96.0.10) access from any of the pods. I tried this with a busybox and executed nslookup of services like qotm or its ip address which failed. I tried to ping or netcat to DNS but none worked.
So I understood the basic problem as DNS not accessible.
I am adding few more details on how I did the set up.

Kube version : v1.13.3
CNI: Flannel - (quay.io/coreos/flannel:v0.11.0-amd64).
Have a single node and a master for kubernetes.

Sony_Joseph · March 28, 2019, 6:46am

thanks for replying. I am adding few more data in the discussion thread.

rata · March 29, 2019, 3:28am

Are you using kubernetes service discovery? If you are not, I’d try setting the pod spec attribute dnsPolicy to default.

If that works fine, then you are hitting a bug (I don’t have it handy, I’m on my phone).

Please try if this happens with dnsPolicy default

Sony_Joseph · March 29, 2019, 4:55am

I just tried it with dnsPolicy: Default and result is failure.
I did make this change in the ambassador pod spec as the routing, dns lookup and forwarding has to happen from these pods.
Also to add, the entire setup is running on two centos installation on Oracle Virtual Box with Host Only Network.

rata · March 29, 2019, 3:30pm

But you did see the problem by running a pod and nslookup inside, right? Can you try that with dnsPolicy and report if it fails or not?

Sony_Joseph · March 31, 2019, 3:56am

I will do so, apologies for the weekend blues…

rata · March 31, 2019, 11:51pm

Nothing to apologise for

Sony_Joseph · April 2, 2019, 8:46am

@rata
I tired as below:
started qotm -

apiVersion: v1
kind: Service
metadata:
name: qotm
annotations:
getambassador.io/config: |
—
apiVersion: ambassador/v1
kind: Mapping
name: qotm_mapping
prefix: /qotm/
service: qotm
spec:
selector:
app: qotm
ports:

port: 80
name: http-qotm
targetPort: http-api

apiVersion: extensions/v1beta1
kind: Deployment
metadata:
name: qotm
spec:
replicas: 1
strategy:
type: RollingUpdate
template:
metadata:
labels:
app: qotm
spec:
dnsPolicy: Default
containers:
- name: qotm
image: datawire/qotm:1.2
ports:
- name: http-api
containerPort: 5000
readinessProbe:
httpGet:
path: /health
port: 5000
initialDelaySeconds: 30
periodSeconds: 3
resources:
limits:
cpu: “0.1”
memory: 100Mi
The tried executing commands as below:

[root@k8s-master sony]# kubectl exec qotm-5f7f56569d-nkp7b – cat /etc/resolv.conf
nameserver 10.91.59.137
nameserver 10.165.108.1
nameserver 10.165.108.2
search
[root@k8s-master sony]# kubectl exec qotm-5f7f56569d-nkp7b – nslookup 10.108.83.129 (cluster ip of qotm service)
nslookup: can’t resolve ‘(null)’: Name does not resolve
Name: 10.108.83.129
Address 1: 10.108.83.129
[root@k8s-master sony]# kubectl exec qotm-5f7f56569d-nkp7b – nslookup 10.108.83.129 10.96.0.1
Server: 10.96.0.1
Address 1: 10.96.0.1

Name: 10.108.83.129
Address 1: 10.108.83.129

Sony_Joseph · April 4, 2019, 5:17am

@rata
Below is the result of nc command to DNS from a busy box

/ # cat /etc/resolv.conf
nameserver 10.96.0.10
search default.svc.cluster.local svc.cluster.local cluster.local ent.bhicorp.com
options ndots:5
/ # nc 10.96.0.10 53
/ # nc 10.96.0.10 53 -v
nc: 10.96.0.10 (10.96.0.10:53): No route to host
/ #

Sony_Joseph · April 5, 2019, 5:56am

I force redeployed kube-dns and after that when I do - below is the result

/ # nslookup kubernetes
Server: 10.96.0.10
Address: 10.96.0.10:53

Name: kubernetes.default.svc.cluster.local
Address: 10.96.0.1

*** Can’t find kubernetes.svc.cluster.local: No answer
*** Can’t find kubernetes.cluster.local: No answer
*** Can’t find kubernetes.: No answer
*** Can’t find kubernetes.default.svc.cluster.local: No answer
*** Can’t find kubernetes.svc.cluster.local: No answer
*** Can’t find kubernetes.cluster.local: No answer
*** Can’t find kubernetes.: No answer

rata · April 6, 2019, 12:31am

Your CNI or something is broken, I think :-/

That smells like a network configuration problem for me.

But not sure I can’t help, I don’t have experience with CNIs

Sony_Joseph · April 8, 2019, 3:28am

I am trying my level best. For my understainding so far, it is just the access to DNS from POD which is required. Is there a good document on how DNS lookup happens in kube? I would like to grab more details. I am sure it is some petty issue.

Sony_Joseph · April 8, 2019, 10:28am

one more to find. I examined the iptables and did a watch on to it to see how the flow happens. first impression is that the iptables work properly.
But, a recent break thru understanding is that, the nc 10.96.0.10 -53 just works from a pod which is deployed in the MASTER node - [ kubectl exec etcd-k8s-master -n kube-system – nc 10.96.0.10 53 -v
10.96.0.10 (10.96.0.10:53) open].

Sony_Joseph · April 8, 2019, 12:00pm

well, this pod has the resolv.conf different than the suual pods.

rata · April 9, 2019, 2:01am

You can use be using kube-dns or coredns as resolver (they run as pods of the kube-system namespace) and they resolve kubernetes services and if not forward to another DNS server, usually called upstream (for example, to resolve google.com it is forwarded).

If you change the setting we discussed, dnsPolicy IIRC, you don’t use coredns/kubedns, and just use the one specified in the hosts /etc/resolve.conf the pod is running.

So, the weird thing is that you saw the problem using IPs, instead of DNS names too. That would point to a network problem. And if you are using a network overlay, that is probably the most likely culprit.

I think you should continue debugging the network problem at the network overlay level. But not sure what advise to give, as I usually don’t use a network overlay. Sorry

Topic		Replies	Views
Ambassador 0.39 released – Kubernetes-native API Gateway built on the Envoy Proxy Announcements	0	913	August 30, 2018
Addon: Ambassador microk8s docs	10	10533	December 24, 2021
Kubectl works and all the pods are up but no traffic goes through General Discussions	6	2089	October 17, 2019
Problem with DNS resolucion (only sometimes) General Discussions	1	1286	October 5, 2018
Ingress resource No Address General Discussions development	0	838	July 23, 2020

Ambassador qotm route shows "no healthy upstream"

@rata I tired as below: started qotm -

Related topics

@rata
I tired as below:
started qotm -