For Java MR jobs, there is Apache MRUnit that provides a good way of
writing test cases. See http://mrunit.apache.org
On Wed, Dec 26, 2012 at 7:26 AM, jamal sasha jamalsha...@gmail.com wrote:
Hi,
I have been using python hadoop streaming framework to write the code and
now I am slowly moving
Hi Jamal,
A missing semi-colon should get flagged by the Java compiler, but one way to
keep you debug cycles short is to (1) use local mode and (2) small data sets
which you can run through under a minute. Once you are happy that your stuff
works, move to distributed and target data sets.
HTH
Hi,
I have been using python hadoop streaming framework to write the code and
now I am slowly moving towards the core java api's.
And I am getting comfortable with it but what is the quickest way to debug
the map reduce native code..
like in hadoop streaming this worked great.
% cat input.txt |
pseudo-distributed mode is the only way you can test your code as far as I
know. This means that you are running a single node cluster. Are you using
eclipse?
Thanks and Regards,
Rishi Yadav
(o) 408.988.2000x113 || (f) 408.716.2726
InfoObjects Inc || http://www.infoobjects.com *(Big Data