Re: Understanding of the hadoop distribution system (tuning)

2012-09-10 Thread Jagat Singh
Hello Elaine, You did not tell your cluster size. Number of nodes , cores in each node. What sort of work you are doing , 6 hours for 518MB data is huge time. The number of map tasks would be 518/64 So this many map tasks needs to run to process your data. Now they can run on single node or

Re: Understanding of the hadoop distribution system (tuning)

2012-09-10 Thread Hemanth Yamijala
Hi, Responses inline to some points. On Tue, Sep 11, 2012 at 7:26 AM, Elaine Gan elaine-...@gmo.jp wrote: Hi, I'm new to hadoop and i've just played around with map reduce. I would like to check if my understanding to hadoop is correct and i would appreciate if anyone could correct me if