Reviving an old thread:

On Sunday, July 12, 2015, Lionel Bouton <lionel+c...@bouton.name> wrote:

> On 07/12/15 05:55, Alex Gorbachev wrote:
> > FWIW. Based on the excellent research by Mark Nelson
> > (
> http://ceph.com/community/ceph-performance-part-2-write-throughput-without-ssd-journals/
> )
> > we have dropped SSD journals altogether, and instead went for the
> > battery protected controller writeback cache.
>
> Note that this has limitations (and the research is nearly 2 years old):
> - the controller writeback caches are relatively small (often less than
> 4GB, 2GB is common on the controller, a small portion is not usable, and
> 10% of the rest is often used for readahead/read cache) and this is
> shared by all of your drives. If your workload is not "write spikes"
> oriented, but nearly constant writes this won't help as you will be
> limited on each OSD by roughly half of the disk IOPS. With journals on
> SSDs when you hit their limit (which is ~5GB of buffer for 10GB journals
> and not <2GB divided by the amount of OSDs per controller), the limit is
> the raw disk IOPS.
> - you *must* make sure the controller is configured to switch to
> write-through when the battery/capacitor fails (or a power failure on
> hardware from the same generation could make you lose all of the OSDs
> connected to them in a single event which means data loss),
> - you should monitor the battery/capacitor status to trigger maintenance
> (and your cluster will slow down while the battery/capacitor is waiting
> for a replacement, you might want to down the associated OSDs depending
> on your cluster configuration). We mostly eliminated this problem by
> replacing the whole chassis of the servers we lease for new generations
> every 2 or 3 years: if you time the hardware replacement to match a
> fresh chassis generation this means fresh capacitors and they shouldn't
> fail you (ours are rated for 3 years).
>
> We just ordered Intel S3710 SSDs even though we have battery/capacitor
> backed caches on the controllers: the latencies have started to rise
> nevertheless when there are long periods of write intensive activity.
> I'm currently pondering if we should bypass the write-cache for the
> SSDs. The cache is obviously less effective on them and might be more
> useful overall if it is dedicated to the rotating disks. Does anyone
> have test results with cache active/inactive on SSD journals with HP
> Smart Array p420 or p840 controllers?


We have come to the same conclusion once we started seeing some more
constant write loads. Thank you for the great info - question: have you
tried SSD journals with and without additional controller cache?  Any
benefit?

Thank you,
Alex



>
> Lionel
>


-- 
--
Alex Gorbachev
Storcium
_______________________________________________
ceph-users mailing list
ceph-users@lists.ceph.com
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com

Reply via email to