Hi, I am testing ItemSimilarityJob with Netflix data (2.6 GB) and I have just ran out of disk space (160 GB) in my mapred.local.dir when running RowSimilarityJob.
Is this normal behaviour? How can I improve this? Thanks! Charly
Hi, I am testing ItemSimilarityJob with Netflix data (2.6 GB) and I have just ran out of disk space (160 GB) in my mapred.local.dir when running RowSimilarityJob.
Is this normal behaviour? How can I improve this? Thanks! Charly