Hello, I have a 4-node cluster. One namenode and 3 other datanodes. I want to explicitly set the dfs.replication factor to 1 inorder to run some experiments. I tried setting this via the hdfs-site.xml file and via the command line as well (hadoop dfs -setrep -R -w 1 /). But I have a feeling that the replication factor that hdfs is seeing is 3. It seems to be writing the temporary mapper outputs to all the 3 datanodes. Is this the default configuration for MR jobs ? If no, how can I set this to 1 ?
Thanks, Aishwarya