Re: good way to debug map reduce code

2012-12-26 Thread Harsh J
For Java MR jobs, there is Apache MRUnit that provides a good way of writing test cases. See http://mrunit.apache.org On Wed, Dec 26, 2012 at 7:26 AM, jamal sasha jamalsha...@gmail.com wrote: Hi, I have been using python hadoop streaming framework to write the code and now I am slowly moving

Re: good way to debug map reduce code

2012-12-26 Thread SUJIT PAL
Hi Jamal, A missing semi-colon should get flagged by the Java compiler, but one way to keep you debug cycles short is to (1) use local mode and (2) small data sets which you can run through under a minute. Once you are happy that your stuff works, move to distributed and target data sets. HTH

good way to debug map reduce code

2012-12-25 Thread jamal sasha
Hi, I have been using python hadoop streaming framework to write the code and now I am slowly moving towards the core java api's. And I am getting comfortable with it but what is the quickest way to debug the map reduce native code.. like in hadoop streaming this worked great. % cat input.txt |

Re: good way to debug map reduce code

2012-12-25 Thread Rishi Yadav
pseudo-distributed mode is the only way you can test your code as far as I know. This means that you are running a single node cluster. Are you using eclipse? Thanks and Regards, Rishi Yadav (o) 408.988.2000x113 || (f) 408.716.2726 InfoObjects Inc || http://www.infoobjects.com *(Big Data