[Nutch-general] Re: Hadoop MapReduce: using NFS as the filesystem

Jeff Ritchie Tue, 28 Feb 2006 12:25:17 -0800

I was having some success with PVFS2. Jobtrackers and Tasktrackers weresetup to 'local' file system.


mapred.local.dir was on the hard drive of the machine.  ie /tmp/hadoop

mapred.system.dir was on the pvfs2 mount and the same path for alltasktrackers and the jobtracker. ie /mnt/pvfs2/hadoop/systemmapred.temp.dir was on the pvfs2 mount and the same path for alltasktrackers and the jobtracker. ie /mnt/pvfs2/hadoop/temp

It worked out pretty good except for the performance of the pvfs2cluster. When I decided to switch to the hadoop dfs I noticed thatthings were more stable (tasktrackers stopped timing out) and that myreduce tasks completed quicker.

There may have been somethings I could have done to the storage clusterto increase it's performance but I decided it was quicker to try out thehadoop dfs.


Jeff

Doug Cutting wrote:

Stefan Groschupf wrote:
in general hadoop's tasktracks and jobtrackers require to run with aswitched-on dfs.
Stefan: that should not be the case. One should be able to run thingsentirely out of the "local" filesystem. Absolute pathnames may berequired for input and output directories, but that's a bug that wecan fix.
Doug




-------------------------------------------------------
This SF.Net email is sponsored by xPML, a groundbreaking scripting language
that extends applications into web and mobile media. Attend the live webcast
and join the prime developer group breaking into this new coding territory!
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=110944&bid=241720&dat=121642
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

[Nutch-general] Re: Hadoop MapReduce: using NFS as the filesystem

Reply via email to