Re: InputSizeReducerEstimator cannot get PhysicalOperators, so My pig job always make one reducer in Hadoop 2.2.0

2014-02-09 Thread 최종원
Oh...oh. The current developing version is not appliable to my service. My pig job failed by another problem.. I want to see the reason why the job failed... but I have no much time. It looks like more serious problem. Right now, pig-0.12.0-h2 version is more stable in my case. So, I decide to us

Re: CROSS optimization

2014-02-09 Thread Enns, Steven
I think I figured it out using replicated join. My initial understanding of the pig M/R plan was incorrect. It was performing a reduce side join like so: Map1.1 (LOAD A) Map1.2 (LOAD B) -> Reduce1 (CROSS, FILTER) -> Map2 (seemingly useless) -> Reduce2 (COUNT) Since one of m

CROSS optimization

2014-02-09 Thread Enns, Steven
I am trying to aggregate on the cross product of two relations. It can be done using a single M/R job but pig is using two. The pig code looks like this: C = cross A, B; C = filter C by Š; G = group C by x; G = foreach G generate group, COUNT(G); The resulting M/

Re: Pig and Hcat

2014-02-09 Thread Anastasis Andronidis
Hello again, any comments on the subject? Cheers, Anastasis On 4 Φεβ 2014, at 5:36 μ.μ., Anastasis Andronidis wrote: > Hello, > > I am using Apache Pig version 0.11.0-cdh4.5.0 and when I want to know if > there is a way to overwrite a partition in a table with Hcat. Up until know I > am ge