Among other things, etcd does not scale that well, even if you do the recommended “put events on their own separate etcd cluster” configuration for large clusters.
You will likely have to re-think this one. You might wind up with multiple clusters, via e.g. federation-v2 or something like kube-applier. The former lets you manage them all in a way that they are aware of each other, whereas the second is essentially “one git repo per cluster, they all pull changes from their own repo periodically and just apply it”.
Microsoft wants you to wind up with something like this, which is based on virtual-kubelet and their own IoT services for the heavy lifting.
Please update this thread with your thoughts on the matter, as they may be of interest to others, as well.
What is your workload exactly? It may be a very real realization that Kubernetes may not be the answer for (some, if not all) of your workload(s) requirement(s) at this level.
If you do wind up running a huge amount of cluster nodes in a rather large federation, please be aware that you will run into strange issues that you never had to think of before. For instance, the v2 federation control plane uses a lot of etcd storage, due to how Overrides are implemented. You might run into issues when a certain etcd object takes up more than (the default) 1MB of space.
Make sure you are very comfortable with operating and managing etcd, as it will be a limiting factor for you when you make a deployment of this size.
Like @yomateod, I would love to learn more about your workload here. Massive scalability of Kubernetes is something I am very interested in, and it would be very interesting to learn about your thoughts and challenges in making this come true.
Just to chime in, I would suggest hopping into both the sig-multicluster and sig-scalability channels and/or calls. They both like to get actual use-cases fleshed and want to hear more from end-users as to what they’re looking for.
The main reason I ask is that etcd is designed to work well in a relatively low-latency environment. So even if you manage to make a cluster with 5,000 nodes in it, those are typically expected to be within a single data center, not “1 per customer” as it kind of sounds like you want to do in some kind of IoT fashion.
You may also be interested in a guide for tuning etcd (as I said before, I hope you are really good with managing and operating etcd, because it will be crucial for supporting any huge environment).
Actually what we are aiming at is IoT devices that will be at US Intersections.
We may start small, with only a certain section of this city, but we want to be prepared to scale.
Imagine Austin, Atlanta, San Francisco, etc backed by PostGis Databases.
Each city might use it’s own datacenter.
Low latency is definitely key.
I’m guessing well be using a 5 node etcd cluster with SAN storage with 500+ Sequential IOPS; per cluster.