Have a look at jobclient, it should suffice. Cheers! Amogh
-----Original Message----- From: bharath vissapragada [mailto:bharathvissapragada1...@gmail.com] Sent: Friday, September 04, 2009 9:15 PM To: common-user@hadoop.apache.org Subject: Re: Some issues! Hey , I have one more doubt , Suppose I have some cascading mapred jobs and suppose some data which was collected in MRjob1 is to be used in MRjob2 m is there any way? Thanks On Fri, Sep 4, 2009 at 1:54 PM, Amandeep Khurana <ama...@gmail.com> wrote: > Or you can output the data in the keys and NullWritable as the value. > That ways you'll get only unique data... > > On 9/4/09, zhang jianfeng <zjf...@gmail.com> wrote: > > Hi Sugandha , > > > > If you only want to the value, you need to set the key as NullWritable in > > reduce. > > > > e.g. > > output.collect(NullWritable.get(), value); > > > > > > > > On Fri, Sep 4, 2009 at 12:46 AM, Sugandha Naolekar > > <sugandha....@gmail.com>wrote: > > > >> Hello! > >> > >> Running a simple MR job, and setting a replication factor of 2. > >> Now, > >> after its execution, the output is split in files named as part-00000 > and > >> so > >> on. I want to ask is, can't we avoid these keys or key values to get > >> printed > >> in output files? I mean, I am getting the output in the files in > key-value > >> pair. I want just the data and not the keys(integers) in it. > >> > >> > >> > >> > >> -- > >> Regards! > >> Sugandha > >> > > > > > -- > > > Amandeep Khurana > Computer Science Graduate Student > University of California, Santa Cruz >