R.V.,
I was only suggesting one way to tackle the problem; I don't have a list of
appropriate parameters.
I think Ted has much more experience in this area, and he is encouraging you to
stay with the generic approach. You should study that paper he recommended,
the approach looks really powerfu
@mathew: initially i wanted to concentrate on generic class of
applications..wouldnt mind to stick on to one now..can i know something more
about the descriptive parameters?
@all: any results of anybody having done something similar?
On Mon, Apr 18, 2011 at 5:55 AM, James Seigel Tynt wrote:
> Y
Yup. I'm boring
On 2011-04-17, at 6:07 PM, Ted Dunning wrote:
> Turing completion isn't the central question here, really. The truth
> is, map-reduce programs have considerably pressure to be written in a
> scalable fashion which limits them to fairly simple behaviors that
> result in pretty
Turing completion isn't the central question here, really. The truth
is, map-reduce programs have considerably pressure to be written in a
scalable fashion which limits them to fairly simple behaviors that
result in pretty linear dependence of run-time on input size for a
given program.
The cool
ROC Convex Hull is an analysis technique for optimizing parameters for
given outputs.
For example, if a classification technique has tuning knobs, ROCCH
will find the settings that give a desired failure rate.
On Sun, Apr 17, 2011 at 12:07 PM, Matthew Foley wrote:
> Since general M/R jobs vary o
Since general M/R jobs vary over a huge (Turing problem equivalent!) range of
behaviors, a more tractable problem might be to characterize the descriptive
parameters needed to answer the question: "If the following problem P runs in
T0 amount of time on a certain benchmark platform B0, how long
Thanks a lot guys..will go throught it all.
On Sun, Apr 17, 2011 at 3:33 AM, Ted Dunning wrote:
> Sounds like this paper might help you:
>
> Predicting Multiple Performance Metrics for Queries: Better Decisions
> Enabled by Machine Learning by Ganapathi, Archana, Harumi Kuno,
> Umeshwar Daval, J
hi
I have a project on hadoop where I need to have hierarchal map functions.
Essentially, I have a map function which would take the input and apart from
emitting results(for reducer) it would also create new inputs.
I now need to have map threads working on these new inputs and they keep on
gen