Hi,

Just curious - did you try ext3? Can it be faster then ext4? Hadoop wiki
suggests ext3 as it's used mostly for hadoop clusters:

http://wiki.apache.org/hadoop/DiskSetup

On Thu, Apr 22, 2010 at 12:02 PM, stephen mulcahy
<stephen.mulc...@deri.org>wrote:

> Hi,
>
> I've been tweaking our cluster roll-out process to refine it. While doing
> so, I decided to check if XFS gives any performance benefit over EXT4.
>
> As per a comment I read somewhere on the hbase wiki - XFS makes for faster
> formatting of filesystems (it takes us 5.5 minutes to rebuild a datanode
> from bare metal to a full Hadoop config on top of Debian Squeeze using XFS)
> versus EXT4 (same bare metal restore takes 9 minutes).
>
> However, TeraSort performance on a cluster of 45 of these data-nodes shows
> XFS is slower (same configuration settings on both installs other than
> changed filesystem), specifically,
>
> mkfs.xfs -f -l size=64m DEV
> (mounted with noatime,nodiratime,logbufs=8)
> gives me a cluster which runs TeraSort in about 23 minutes
>
> mkfs.ext4 -T largefile4 DEV
> (mounted with noatime)
> gives me a cluster which runs TeraSort in about 18.5 minutes
>
> So I'll be rolling our cluster back to EXT4, but thought the information
> might be useful/interesting to others.
>
> -stephen
>
>
> XFS config chosen from notes at
> http://everything2.com/index.pl?node_id=1479435
>
> --
> Stephen Mulcahy, DI2, Digital Enterprise Research Institute,
> NUI Galway, IDA Business Park, Lower Dangan, Galway, Ireland
> http://di2.deri.ie    http://webstar.deri.ie    http://sindice.com
>



-- 
Andrew Klochkov

Reply via email to