Re: Multi user Hadoop 0.20.205 ?

2011-12-28 Thread praveenesh kumar
Guys, Did someone try this thing ? Thanks On Tue, Dec 27, 2011 at 4:36 PM, praveenesh kumar wrote: > Hey guys, > > How we can make hadoop as multiuser ? > > One way to think as whatever group we currently assigned to use hadoop, > add users to same group and change permissions to hadoop.tmp.dir

Re: Debugging MR jobs using Eclipse

2011-12-28 Thread Harsh J
There are a few options. Most of the times, using logging and a LocalJobRunner turns out to be the easiest way. There is also Apache MRUnit you can use to test your M/R code (which is a much better approach, IMO) - https://cwiki.apache.org/confluence/display/MRUNIT Remote debugging of tasks is

Debugging MR jobs using Eclipse

2011-12-28 Thread Shreya.Pal
HI, I want to debug my MR Job on Ubuntu machine using eclipse. How can I do that, can anyone point me to the steps required? Regards, Shreya This e-mail and any files transmitted with it are for the sole use of the intended recipient(s) and may contain confidential and privileged informa

Re: mapreduce matrix multiplication on hadoop

2011-12-28 Thread Lance Norskog
Are you running on Windows? There are quirks about closing and opening files in read/write mode. You may be getting a sequencing error. On Wed, Nov 30, 2011 at 9:09 AM, J. Rottinghuis wrote: > The error is that you cannot open /tmp/MatrixMultiply/out/_logs > Does the directory exist? > Do you hav

Re: Hadoop MySQL database access

2011-12-28 Thread Prashant Kommireddi
By design reduce would start only after all the maps finish. There is no way for the reduce to begin grouping/merging by key unless all the maps have finished. Sent from my iPhone On Dec 28, 2011, at 8:53 AM, JAGANADH G wrote: > Hi All, > > I wrote a map reduce program to fetch data from MySQL

Hadoop MySQL database access

2011-12-28 Thread JAGANADH G
Hi All, I wrote a map reduce program to fetch data from MySQL and process the data(word count). The program executes successfully . But I noticed that the reduce task starts after finishing the map task only . Is there any way to run the map and reduce in parallel. The program fetches data from M