Hello,

I have a 4-node cluster. One namenode and 3 other datanodes. I want to
explicitly set the dfs.replication factor to 1 inorder to run some
experiments. I tried setting this via the hdfs-site.xml file and via
the command line as well (hadoop dfs -setrep -R -w 1 /). But I have a
feeling that the replication factor that hdfs is seeing is 3. It seems
to be writing the temporary mapper outputs to all the 3 datanodes. Is
this the default configuration for MR jobs ? If no, how can I set this
to 1 ?

Thanks,
Aishwarya

Reply via email to