Re: [PVE-User] Ceph + ZFS?

Lindsay Mathieson Wed, 01 Jun 2016 14:47:59 -0700

On 2/06/2016 6:10 AM, Jeremy McCoy wrote:

I am new here and am working on designing a Proxmox cluster. Just wondering if
anyone has tried doing Ceph on ZFS (instead of XFS on LVM or whatever pveceph
sets up) and, if so, how you went about implementing it. I have 4 hosts that
each have 1 spinner and 1 SSD to offer to the cluster.

Been there ... ceph does not like zfs, apparently COW filestems performbadly for cephs workload. Its recommended against (as is ext4 btw).

ceph does not do well on small setups, with only one osd/disk per nodeyou will get *terrible* performance.

The 9.x versions of ceph have a latency bug that triggers under memorypressure (common on compute/storage nodes) and 10.x IMO is even lessfriendly to small setups. Also as you've probabluy noiced, its amaintenance headache for small shops, especially when things go wrong.


Are there any pitfalls to be aware of here? My goal is to mainly run LXC
containers (plus a few KVM VMs) on distributed storage, and I was hoping to take
advantage of ZFS's caching, compression, and data integrity features. I am also
open to doing GlusterFS or something else, but it looked like Proxmox does not
support LXC containers running on that yet.

Probably because lxc doesn't support the native gluster api (gfapi), Iimagine the same problem with ceph/rbd.

However there is also the gluster fuse mount that proxmox automaticcreates (/mnt/pve/<glusterid>), you should be able to set that up asshared directory storage and use that with lxc. I'll have a test of thatmyself later today.

gluster works pretty with with zfs, I get excellent performance, maxingout my network for writes and much better iops than I was getting withceph. Enabling lz4 compression gave me a 33% saving on space with nonoticeable impact on performance.

If you can afford it I recommend using ZFS RAID1 or better yet, ZFSRAID10 per node. Apart from the extra redundancy it had much betterread/write performance than a single disk. And its much easier toreplace a failed disk on a zfs mirror than it is to replace a failedgluster or ceph node.



--
Lindsay Mathieson

_______________________________________________
pve-user mailing list
pve-user@pve.proxmox.com
http://pve.proxmox.com/cgi-bin/mailman/listinfo/pve-user

Re: [PVE-User] Ceph + ZFS?

Reply via email to