Hi Thejas,
Thank you for your reply!
I agree that accumulator mode can be used if you only use built-in UDFs. :)
I noticed what you mentioned in your reply in the past. In my script,
PAGE_COUNT is an evaluation function which accumulator is the only
interface implemented. I also check built-in U
Hi Yen,
Does the function also implement Algebraic ? In that case it might end
up using the algebraic interface of the udf.
If your foreach statement has functions that don't implement Accumulator
interface, then reduce task won't run in accumulative mode. This is
because you are anyway going t
Hi Jon,
Thanks for your reponse! I use pig 0.9.1-snapshot.
I've used FLATTEN instead of $0 and $1, but ACCUM_CALL is still not fired.
Also tried to remove generic type in accumulator but it did not help. :(
Is it easy for you to fire accumulator?
Yen
On Tue, Mar 13, 2012 at 3:06 PM, Jonathan C
What version of pig are you using?
just as an experiment in the simple case, can you try doing
GENERATE flatten(group) as (domain,host), ...(the rest)...
shouldn't make a difference, but I think I remember that in some older
versions it did
2012/3/13 Yen SYU
> Hi all,
>
> I just test a very s
Hi all,
I just test a very simple pig script as following:
records = LOAD '$input' AS (hash:chararray, domain:chararray,
host:chararray, page:chararray, freq:int);
grpd = GROUP records BY (domain, host);
stats = FOREACH grpd {
hashes = records.hash;