We use XFS for our data drives, and we've had somewhat mixed results. One of the biggest pros is that XFS has more free space than ext3, even with the reserved space settings turned all the way to 0. Another is that you can format a 1TB drive as XFS in about 0 seconds, versus minutes for ext3. This makes it really fast to kickstart our worker nodes.

We have seen some weird stuff happen though when machines run out of memory, apparently because the XFS driver does something odd with kernel memory. When this happens, we end up having to do some fscking before we can get that node back online.

As far as outright performance, I actually *did* do some tests of xfs vs ext3 performance on our cluster. If you just look at a single machine's local disk speed, you can write and read noticeably faster when using XFS instead of ext3. However, the reality is that this extra disk performance won't have much of an effect on your overall job completion performance, since you will find yourself network bottlenecked well in advance of even ext3's performance.

The long and short of it is that we use XFS to speed up our new machine deployment, and that's it.

-Bryan

On May 18, 2009, at 10:31 AM, Alex Loddengaard wrote:

I believe Yahoo! uses ext3, though I know other people have said that XFS has performed better in various benchmarks. We use ext3, though we haven't
done any benchmarks to prove its worth.

This question has come up a lot, so I think it'd be worth doing a benchmark and writing up the results. I haven't been able to find a detailed analysis
/ benchmark writeup comparing various filesystems, unfortunately.

Hope this helps,

Alex

On Mon, May 18, 2009 at 8:54 AM, Bob Schulze <b.schu...@ecircle.com> wrote:

We are currently rebuilding our cluster - has anybody recommendations on
the underlaying file system? Just standard Ext3?

I could imagine that the block size could be larger than its default...

Thx for any tips,

       Bob



Reply via email to