Job config before read fields

2013-08-30 Thread Adrian CAPDEFIER
Howdy, I apologise for the lack of code in this message, but the code is fairly convoluted and it would obscure my problem. That being said, I can put together some sample code if really needed. I am trying to pass some metadata between the map & reduce steps. This metadata is read and generated

Re: Job config before read fields

2013-08-30 Thread Adrian CAPDEFIER
Aug 30, 2013 at 9:01 PM, Adrian CAPDEFIER > wrote: > >> Howdy, >> >> I apologise for the lack of code in this message, but the code is fairly >> convoluted and it would obscure my problem. That being said, I can put >> together some sample code if really nee

Re: Job config before read fields

2013-08-31 Thread Adrian CAPDEFIER
here that is. Cheers, Adi On Sat, Aug 31, 2013 at 3:42 AM, Shahab Yunus wrote: > What I meant was that you might have to split or redesign your logic or > your usecase (which we don't know about)? > > Regards, > Shahab > > > On Fri, Aug 30, 2013 at 10:31 PM, Adrian

Re: Job config before read fields

2013-09-09 Thread Adrian CAPDEFIER
mapwritable%20in%20hadoop&f=false > > and then this: > > http://www.chrisstucchio.com/blog/2011/mapwritable_sometimes_a_performance_hog.html > > and add your own custom types here (note that you are restricted by size > of byte): > > http://hadoop.sourcearchive.com/doc

chaining (the output of) jobs/ reducers

2013-09-12 Thread Adrian CAPDEFIER
Howdy, My application requires 2 distinct processing steps (reducers) to be performed on the input data. The first operation generates changes the key values and, records that had different keys in step 1 can end up having the same key in step 2. The heavy lifting of the operation is in step1 and

Re: chaining (the output of) jobs/ reducers

2013-09-12 Thread Adrian CAPDEFIER
reduce' you write > logic and they map it to the various mapper/reducer steps automatically. > > Hope this helps, > > Chris > > > On Thu, Sep 12, 2013 at 9:36 AM, Adrian CAPDEFIER > wrote: > >> Howdy, >> >> My application requires 2 distin

Re: chaining (the output of) jobs/ reducers

2013-09-12 Thread Adrian CAPDEFIER
FS will do > the work to ensure that the mappers in the second job do as good a job as > possible to be data or rack-local. > > > On Thu, Sep 12, 2013 at 12:35 PM, Adrian CAPDEFIER > wrote: > >> Thank you, Chris. I will look at Cascading and Pig, but for starters I'd

Re: chaining (the output of) jobs/ reducers

2013-09-17 Thread Adrian CAPDEFIER
apper on the same host as the data, but was at least able to > keep it within the same rack. > > All of this is dependent a proper topology configuration, both in your > NameNode and JobTracker. > > > On Thu, Sep 12, 2013 at 3:02 PM, Adrian CAPDEFIER > wrote: > >>

Re: chaining (the output of) jobs/ reducers

2013-09-17 Thread Adrian CAPDEFIER
et to happen. > > You can learn about it more here: http://tez.incubator.apache.org/ > > HTH, > +Vinod > > On Sep 12, 2013, at 6:36 AM, Adrian CAPDEFIER wrote: > > Howdy, > > My application requires 2 distinct processing steps (reducers) to be > performed on the inp