Thanks Alan, I got the sum to work based on your example. Here is the
script with the sum code now working. I am sure there is a better way to do
what I am after.
The script takes key-value log data, and rolls up based on several of the
keys. Being a perl and not java coder, I stream the da
A general sum with group all can be done as:
A = load 'file' as (x, y);
B = group A all;
C = foreach B generate sum(A.x);
This will give you the sum of all x. But from the schema you show
below I'm not sure this is what you're trying to do. Can you attach
your script and an example record
Hello fellow Pig users. I am brand new to Pig/hadoop, and am having
trouble with something that I am guessing is very basic. I have a
relation where I did a group by several values, then counted the
groups. Here is a description of the relation:
count_grouped: {g1: (site: chararray,tf: char