Best storage design for kube cluster

Philip_Brown · September 16, 2019, 12:04am

Let’s say you were looking to build your first kubernetes cluster in your own personal data center.

You were going to be using 12-20 blade servers, and spreading them across multiple racks.
There would be multiple separate groups of applications, and even customers, running jobs across them.
You might even have more than one kube cluster.

You wanted to have some kind of standardised storage that would let you be able to write data apparently “locally” to each instance ( so it would appear as a file system most of the time. if not always)

But most importantly…

you wanted the ability to have the data store be transparently mirrored across racks, synchronously.
local raid1 data stores are not sufficient.

How would you choose to implement the common data storage, and why?

points for :

price
performance
ease of maintenance

riking · September 16, 2019, 12:51am

First, at a conceptual level: Assume networked storage, not local. It’ll always be mounted transparently to the pod, but it’ll actually be remote.

I’d take a look at Ceph+Rook, to manage all your non-boot drives though I admit I haven’t used it yet!

Philip_Brown · September 16, 2019, 1:20am

Thanks for the reply.
was hoping for some replies with concrete production at-scale experience though.

sylflo · September 16, 2019, 9:30am

I used Rook on homelab, it works pretty well with Ceph. It also works with Storage Provider but most of them are in alpha https://rook.io/docs/rook/v1.1/quickstart-toc.html.

The other problem I can see, it’s about share filesystem, you can only create one which can be a problem if you want to seperate your projects in differents file system. https://rook.io/docs/rook/v1.1/ceph-filesystem.html

riking · September 17, 2019, 5:21am

i forget, can you mount a subdirectory to a pod or do you need to give the application long paths?

Kyouuma · May 7, 2020, 12:32pm

I installed an NFS server on a seperate machine and used nfs client provisonner in kubernetes to point it to the storage ( as a default storage ). now that i have my volumes in a seperate machine, i made a daily backup to an external disk.
I used Debian 10 for the nfs server since it’s stable and needs very few updates. We have the same thing in production and it’s been working fine for more than a year now.
as for performance, i had no issues or latency, the nfs is quite fast if you use ssd

Topic		Replies	Views
Good storage provisioner for on-premises (non-cloud) cluster? General Discussions	7	8918	March 4, 2021
Recommendations for sharing storage across a cluster General Discussions	3	1032	August 4, 2019
On premises k8s PV General Discussions development , network	1	478	November 22, 2023
Persistent Storage Providers General Discussions	8	3625	January 23, 2019
Rook.io production use General Discussions	3	2285	May 15, 2018

Best storage design for kube cluster

Related topics