cluster dns

James Eckersall Fri, 12 Aug 2016 01:51:01 -0700

Hi,

I believe I've identified a couple of bugs, but would like to ask forother opinions before raising them officially. These might also be morerelated to Kubernetes than Openshift.



First one:

We have a cluster running Openshift Origin v1.2.1 with kubernetesv1.2.0-36-g4a3f9c5. This cluster has 3 masters and 4+ nodes.

We've noticed that if we take master01 offline for maintenance, DNSlookups inside the cluster are affected. It only affects lookups to the172.30.0.1 cluster IP which translates to 53 tcp/udp on the threemasters. Due to the iptables rules using the probability module, 1 in 3DNS lookups fails as it is directed to the offline master. I guesswhat's really needed here is for the master to be removed from theservice endpoint so that the iptables rules are amended to preventtraffic hitting a master that isn't working. I would like to seehealthchecks here too.



Second one:

This cluster is running Openshift Origin v1.3.0-alpha.2+3da4944 withkubernetes v1.3.0+57fb9ac. Again 3 masters and a bunch (~20) nodes.

The first bug also applies to this cluster but the behaviour is somewhatdifferent due to the introduction of iptables recent module rules. I'mnot 100% clear on the behaviour of this, but what seems to happen isthat the first "recent" rule is always matched and hence all trafficfrom internal pods to cluster IP's always hits the first endpoint. Thismeans that ~100% of all DNS lookups against service names fail fromother pods while master01 is down.

If anyone has any information to share on this, I'd be grateful. I canalso provide further details if required.



Thanks


J

_______________________________________________
users mailing list
users@lists.openshift.redhat.com
http://lists.openshift.redhat.com/openshiftmm/listinfo/users

cluster dns

Reply via email to