On 05/30/2011 11:02 AM, Panayotis Antonopoulos wrote:
Hello,
I just noticed that the files that are created using MultipleOutputs remain in the temporary folder into attempt sub-folders when there is no normal output (using context.write(...)).

Has anyone else noticed that?
Is there any way to change that and make the files appear in the output directory?

Thank you in advance!
Panagiotis.


       |mapred.local.dir|

This lets the MapReduce servers know where to store intermediate files. This may be a comma-separated list of directories to spread the load. Make sure there’s enough space here for all your intermediate files. We share the same disks for MapReduce and HDFS.


       |mapred.system.dir|

This is a folder in the|defaultFS|where MapReduce stores some control files. In our case that would be a directory in HDFS. If you have|dfs.permissions|(which it is by default) enabled make sure that this directory exists and is owned by mapred:hadoop.


       |mapred.temp.dir|

This is a folder to store temporary files in. It is hardly -- if at all used. If I understand the description correctly this is supposed to be in HDFS but I’m not entirely sure by reading the source code. So we set this to a directory that exists on the local filesystem as well as in HDFS.



--
Marcos Luis Ortiz Valmaseda
 Software Engineer (Distributed Systems)
 http://uncubanitolinuxero.blogspot.com

Reply via email to