[ANNOUNCE] Pig 0.11.1 has been released!

2013-04-01 Thread Bill Graham
The Pig team is happy to announce the Pig 0.11.1 release. Apache Pig provides a high-level data-flow language and execution framework for parallel computation on Hadoop clusters. More details about Pig can be found at http://pig.apache.org/. This is a maintenance release of Pig 0.11 and contains

Re: Join question

2013-04-01 Thread Mehmet Tepedelenlioglu
I am not sure if I understand you correctly, but you seem to want to find the average per id. For that all you need to do is group by id, and then take the avg for every group. You don't need to count anything. On 4/1/13 3:44 PM, "jamal sasha" wrote: >Hi, > Yeah, there was a bug in my "stats" d

Re: Join question

2013-04-01 Thread jamal sasha
Hi, Yeah, there was a bug in my "stats" data. I was wondering how can I calcualte average in pig.. Something like : http://stackoverflow.com/questions/12593527/finding-mean-using-pig-or-hadoop But in top response.. it seems that the user wanted to calculate across average across all data.. as c

Re: Join question

2013-04-01 Thread Mehmet Tepedelenlioglu
Are your ids unique? On 4/1/13 2:06 PM, "jamal sasha" wrote: >Hi, > I have a simple join question. >base = load 'input1' USING PigStorage( ',' ) as (id1, field1, field2); >stats = load 'input2' USING PigStorage(',') as (id1, mean, median); >joined = JOIN base BY id1, stats BY id1; >final = F

Join question

2013-04-01 Thread jamal sasha
Hi, I have a simple join question. base = load 'input1' USING PigStorage( ',' ) as (id1, field1, field2); stats = load 'input2' USING PigStorage(',') as (id1, mean, median); joined = JOIN base BY id1, stats BY id1; final = FOREACH joined GENERATE base::id1, base::field1,base::field2, stats::me

Re: GSoC 2013

2013-04-01 Thread burakkk
I'm using only WTF graph representation to fit the memory. By the way I haven't seen any explanation from the pig 0.11 release page about WTF or graph models. I don't wanna use Cassovary. I believe it can be done with pig. I implement a graph representation using WTF paper to pig and then I'll use

Re: GSoC 2013

2013-04-01 Thread Dmitriy Ryaboy
I'm somewhat familiar with WTF code (my day job is managing the analytics infrastructure team at Twitter). WTF is implemented using Pig 0.11 (in fact some of the Pig 11 features/improvements are directly due to this project...), and mostly has to do with clever algorithms implemented in Pig (an ear