Hi,
I examined a little bit about pig's roadmap page and I'm interested in some
of them. I found that you might be working on in this items. But I couldn't
find the issue on jira about them. İs anyone working on them and if it's
not, how can I contribute it? I mean should I create issues about them
woops, wrong listserv :)
2013/4/5 Jonathan Coveney
> The following gist illustrates my question:
>
> https://gist.github.com/jcoveney/5320422
>
> It seems pretty surprising to me that all of these cases all return 1.0,
> at least in python (I will now do this in Java, it's just more verbose). I
The following gist illustrates my question:
https://gist.github.com/jcoveney/5320422
It seems pretty surprising to me that all of these cases all return 1.0, at
least in python (I will now do this in Java, it's just more verbose). Is
this an issue with python? Is this an issue period? Is this une
now that I've turned off noSplitCombination we have 640 mappers.
the relation being ranked is likely in the billions or 1+ trillion records.
On Fri, Apr 5, 2013 at 10:47 AM, Bill Graham wrote:
> How many mappers and reducers do you have? Skimming the Rank code it looks
> like it creates at lea
How many mappers and reducers do you have? Skimming the Rank code it looks
like it creates at least N counters per task which would be a scalability
bug.
On Friday, April 5, 2013, Lauren Blau wrote:
> this is defintely caused by the RANK operator. Is there some way to reduce
> the number of count
this is defintely caused by the RANK operator. Is there some way to reduce
the number of counters generated by this operator when working with large
data?
thanks
On Thu, Apr 4, 2013 at 7:01 PM, Lauren Blau <
lauren.b...@digitalreasoning.com> wrote:
> I can think of only 2 things that have changed