Hi,
I know that Nutch uses Hadoop to run the map reduce tasks. I want to know where it stores the data. I see Nutch creating the indexes and segments in 'crawl' directory. But shouldn't it be created in the HDFS instead?
Hi,
I know that Nutch uses Hadoop to run the map reduce tasks. I want to know where it stores the data. I see Nutch creating the indexes and segments in 'crawl' directory. But shouldn't it be created in the HDFS instead?