Hierarchal Map call

2011-04-17 Thread mohak
hi I have a project on hadoop where I need to have hierarchal map functions. Essentially, I have a map function which would take the input and apart from emitting results(for reducer) it would also create new inputs. I now need to have map threads working on these new inputs and they keep on

Re: Estimating Time required to compute M/Rjob

2011-04-17 Thread real great..
Thanks a lot guys..will go throught it all. On Sun, Apr 17, 2011 at 3:33 AM, Ted Dunning tdunn...@maprtech.com wrote: Sounds like this paper might help you: Predicting Multiple Performance Metrics for Queries: Better Decisions Enabled by Machine Learning by Ganapathi, Archana, Harumi Kuno,

Re: Estimating Time required to compute M/Rjob

2011-04-17 Thread Matthew Foley
Since general M/R jobs vary over a huge (Turing problem equivalent!) range of behaviors, a more tractable problem might be to characterize the descriptive parameters needed to answer the question: If the following problem P runs in T0 amount of time on a certain benchmark platform B0, how long

Re: Estimating Time required to compute M/Rjob

2011-04-17 Thread Lance Norskog
ROC Convex Hull is an analysis technique for optimizing parameters for given outputs. For example, if a classification technique has tuning knobs, ROCCH will find the settings that give a desired failure rate. On Sun, Apr 17, 2011 at 12:07 PM, Matthew Foley ma...@yahoo-inc.com wrote: Since

Re: Estimating Time required to compute M/Rjob

2011-04-17 Thread Ted Dunning
Turing completion isn't the central question here, really. The truth is, map-reduce programs have considerably pressure to be written in a scalable fashion which limits them to fairly simple behaviors that result in pretty linear dependence of run-time on input size for a given program. The cool

Re: Estimating Time required to compute M/Rjob

2011-04-17 Thread James Seigel Tynt
Yup. I'm boring On 2011-04-17, at 6:07 PM, Ted Dunning tdunn...@maprtech.com wrote: Turing completion isn't the central question here, really. The truth is, map-reduce programs have considerably pressure to be written in a scalable fashion which limits them to fairly simple behaviors that

Re: Estimating Time required to compute M/Rjob

2011-04-17 Thread real great..
@mathew: initially i wanted to concentrate on generic class of applications..wouldnt mind to stick on to one now..can i know something more about the descriptive parameters? @all: any results of anybody having done something similar? On Mon, Apr 18, 2011 at 5:55 AM, James Seigel Tynt