Oh...oh.
The current developing version is not appliable to my service.
My pig job failed by another problem..
I want to see the reason why the job failed... but I have no much time.
It looks like more serious problem.
Right now, pig-0.12.0-h2 version is more stable in my case.
So, I decide to us
I think I figured it out using replicated join.
My initial understanding of the pig M/R plan was incorrect. It was
performing a reduce side join like so:
Map1.1 (LOAD A)
Map1.2 (LOAD B) -> Reduce1 (CROSS, FILTER) -> Map2 (seemingly useless)
->
Reduce2 (COUNT)
Since one of m
I am trying to aggregate on the cross product of two relations. It can be
done using a single M/R job but pig is using two. The pig code looks like
this:
C = cross A, B;
C = filter C by Š;
G = group C by x;
G = foreach G generate group, COUNT(G);
The resulting M/
Hello again,
any comments on the subject?
Cheers,
Anastasis
On 4 Φεβ 2014, at 5:36 μ.μ., Anastasis Andronidis
wrote:
> Hello,
>
> I am using Apache Pig version 0.11.0-cdh4.5.0 and when I want to know if
> there is a way to overwrite a partition in a table with Hcat. Up until know I
> am ge